Memory budget · 8 GB
Best local LLMs for 8GB
8GB is not a single ceiling. An 8GB Mac and an 8GB Laptop each leave a different amount free for model weights, so the largest model you can run changes with the memory type, not just the number.
- Usable range
- 5–5.5 GB
- Models that fit
- 24
- Memory types
- 2
- Top pick
- 4B
What 8GB actually gives you
Usable figures are sourced per device (tap a card for the full profile). Verdicts below use Q4_K_M, the community-default quant.
Runs comfortably on the most capable 8GB setup (Apple M1 (8GB), ~5.5 GB usable) at ~3.8 GB. Check it against your exact device on its model page.
Models ranked for 8GB
- Gemma 3 4B4B · ~3.8 GB at Q4_K_M · Elo 1303
- Qwen3 4B4B · ~3.8 GB at Q4_K_M
- Phi-3.5-mini 3.8B3.82B · ~3.7 GB at Q4_K_M
- Phi-4-mini 3.8B3.8B · ~3.8 GB at Q4_K_M
- Qwen2.5-VL 3B3.75B · ~4.4 GB at Q4_K_M
- Qwen2.5 3B3.09B · ~3.3 GB at Q4_K_M
- Qwen2.5 Coder 3B3.09B · ~3 GB at Q4_K_M
- Llama 3.2 3B3B · ~3.2 GB at Q4_K_M · Elo 1166
- SmolLM3 3B3B · ~3 GB at Q4_K_M
- Gemma 2 2B2.61B · ~2.9 GB at Q4_K_M
- Granite 3.1 2B2.53B · ~2.8 GB at Q4_K_M
- SSarvam-1 2B2B · ~2.7 GB at Q4_K_M
- SmolLM2 1.7B1.7B · ~2.2 GB at Q4_K_M · Elo 1114
- Qwen3 1.7B1.7B · ~2.4 GB at Q4_K_M
- Qwen2.5 1.5B1.54B · ~2.2 GB at Q4_K_M
- Qwen2.5 Coder 1.5B1.54B · ~2 GB at Q4_K_M
- TLTinyLlama 1.1B1.1B · ~1.8 GB at Q4_K_M
- Llama 3.2 1B1B · ~1.8 GB at Q4_K_M · Elo 1110
- Gemma 3 1B1B · ~1.8 GB at Q4_K_M
- Qwen3 0.6B0.6B · ~1.5 GB at Q4_K_M
- Qwen2.5 0.5B0.494B · ~1.5 GB at Q4_K_M
- Qwen2.5 Coder 0.5B0.494B · ~1.4 GB at Q4_K_M
- SmolLM2 360M0.362B · ~1.2 GB at Q4_K_M
- SmolLM2 135M0.135B · ~1 GB at Q4_K_M
Each chip links to the full breakdown for that model on a real 8GB device. "Tight" means it fits but with little headroom, close other apps.
The ceiling, per memory type
Apple M1 (8GB) (~5.5 GB usable)
Runs up to Qwen3 4B (4B) comfortably at Q4_K_M. Larger models either sit tight or spill past the ~5.5 GB it can give a model.
8GB RAM Laptop (CPU/iGPU only) (~5 GB usable)
Runs up to Qwen3 4B (4B) comfortably at Q4_K_M. Larger models either sit tight or spill past the ~5 GB it can give a model.
8GB phones & tablets
Phones report 8GB too, but iOS/Android reserve more and the runtimes differ. Their usable pool is smaller:
Too large for any 8GB device
FAQ
How much of 8GB can a model actually use?
It depends on the memory type. Apple unified memory: about 5.5 GB (Apple M1 (8GB)); System RAM (CPU only): about 5 GB (8GB RAM Laptop (CPU/iGPU only)). The rest is reserved for the OS, display and runtime overhead.
What is the best local LLM for 8GB?
Gemma 3 4B (4B) is the strongest model that runs comfortably at Q4_K_M on the most capable 8GB setup (Apple M1 (8GB), ~5.5 GB usable). On a tighter 8GB device the ceiling is lower, shown per row above.
Sources
Memory figures are estimates at Q4_K_M with a small context. See methodology.