Memory budget · 8 GB

Best local LLMs for 8GB

8GB is not a single ceiling. An 8GB Mac and an 8GB Laptop each leave a different amount free for model weights, so the largest model you can run changes with the memory type, not just the number.

Usable range: 5–5.5 GB
Models that fit: 37
Memory types: 2
Top pick: 5.1B

What 8GB actually gives you

Apple unified memory ~5.5 GB usable on Apple M1 (8GB) Comfortable ceiling: 5.1B System RAM (CPU only) ~5 GB usable on 8GB RAM Laptop (CPU/iGPU only) Comfortable ceiling: 4B

Usable figures are sourced per device (tap a card for the full profile). Verdicts below use Q4_K_M, the community-default quant.

Top pick for 8GB Q4_K_M

Gemma 4 E2B 5.1B

Runs comfortably on the most capable 8GB setup (Apple M1 (8GB), ~5.5 GB usable) at ~4.4 GB. Check it against your exact device on its model page.

Models ranked for 8GB

Biggest that fits first Mac · Laptop

Gemma 4 E2B
5.1B · ~4.4 GB at Q4_K_M

Mac Yes Laptop Tight
Gemma 3 4B
4B · ~3.8 GB at Q4_K_M · Elo 1303

Mac Yes Laptop Yes
Qwen3 4B
4B · ~3.8 GB at Q4_K_M

Mac Yes Laptop Yes
NE
Nemotron 3 Nano 4B
4B · ~3.9 GB at Q4_K_M

Mac Yes Laptop Yes
Phi-3.5-mini 3.8B
3.82B · ~3.7 GB at Q4_K_M

Mac Yes Laptop Yes
Phi-4-mini 3.8B
3.8B · ~3.8 GB at Q4_K_M

Mac Yes Laptop Yes
Phi-4-mini-reasoning
3.8B · ~3.6 GB at Q4_K_M

Mac Yes Laptop Yes
Qwen2.5-VL 3B
3.75B · ~4.4 GB at Q4_K_M

Mac Yes Laptop Tight
Granite 4.1 3B
3.4B · ~3.3 GB at Q4_K_M

Mac Yes Laptop Yes
Qwen2.5 3B
3.09B · ~3.3 GB at Q4_K_M

Mac Yes Laptop Yes
Qwen2.5 Coder 3B
3.09B · ~3 GB at Q4_K_M

Mac Yes Laptop Yes
Llama 3.2 3B
3B · ~3.2 GB at Q4_K_M · Elo 1166

Mac Yes Laptop Yes
SmolLM3 3B
3B · ~3 GB at Q4_K_M

Mac Yes Laptop Yes
OP
Apple OpenELM 3B
3B · ~3 GB at Q4_K_M

Mac Yes Laptop Yes
Gemma 2 2B
2.61B · ~2.9 GB at Q4_K_M

Mac Yes Laptop Yes
Granite 3.1 2B
2.53B · ~2.8 GB at Q4_K_M

Mac Yes Laptop Yes
S
Sarvam-1 2B
2B · ~2.7 GB at Q4_K_M

Mac Yes Laptop Yes
SmolLM2 1.7B
1.7B · ~2.2 GB at Q4_K_M · Elo 1114

Mac Yes Laptop Yes
Qwen3 1.7B
1.7B · ~2.4 GB at Q4_K_M

Mac Yes Laptop Yes
Qwen2.5 1.5B
1.54B · ~2.2 GB at Q4_K_M

Mac Yes Laptop Yes
Qwen2.5 Coder 1.5B
1.54B · ~2 GB at Q4_K_M

Mac Yes Laptop Yes
MI
MiniCPM-V 4.6
1.3B · ~1.6 GB at Q4_K_M

Mac Yes Laptop Yes
LF
LFM2 1.2B
1.17B · ~2.5 GB at Q4_K_M

Mac Yes Laptop Yes
LF
LFM2.5 1.2B
1.17B · ~1.8 GB at Q4_K_M

Mac Yes Laptop Yes
LF
LFM2.5 1.2B Thinking
1.17B · ~1.8 GB at Q4_K_M

Mac Yes Laptop Yes
TL
TinyLlama 1.1B
1.1B · ~1.8 GB at Q4_K_M

Mac Yes Laptop Yes
OP
Apple OpenELM 1.1B
1.1B · ~1.7 GB at Q4_K_M

Mac Yes Laptop Yes
Llama 3.2 1B
1B · ~1.8 GB at Q4_K_M · Elo 1110

Mac Yes Laptop Yes
Gemma 3 1B
1B · ~1.8 GB at Q4_K_M

Mac Yes Laptop Yes
LF
LFM2 700M
0.742B · ~1.9 GB at Q4_K_M

Mac Yes Laptop Yes
Qwen3 0.6B
0.6B · ~1.5 GB at Q4_K_M

Mac Yes Laptop Yes
Qwen2.5 0.5B
0.494B · ~1.5 GB at Q4_K_M

Mac Yes Laptop Yes
Qwen2.5 Coder 0.5B
0.494B · ~1.4 GB at Q4_K_M

Mac Yes Laptop Yes
SmolLM2 360M
0.362B · ~1.2 GB at Q4_K_M

Mac Yes Laptop Yes
LF
LFM2 350M
0.354B · ~1.4 GB at Q4_K_M

Mac Yes Laptop Yes
Gemma 3 270M
0.27B · ~1.1 GB at Q4_K_M

Mac Yes Laptop Yes
SmolLM2 135M
0.135B · ~1 GB at Q4_K_M

Mac Yes Laptop Yes

Each chip links to the full breakdown for that model on a real 8GB device. "Tight" means it fits but with little headroom, close other apps.

The ceiling, per memory type

Apple M1 (8GB) (~5.5 GB usable)

Runs up to Gemma 4 E2B (5.1B) comfortably at Q4_K_M. Larger models either sit tight or spill past the ~5.5 GB it can give a model.

8GB RAM Laptop (CPU/iGPU only) (~5 GB usable)

Runs up to Nemotron 3 Nano 4B (4B) comfortably at Q4_K_M. Larger models either sit tight or spill past the ~5 GB it can give a model.

8GB phones & tablets

Phones report 8GB too, but iOS/Android reserve more and the runtimes differ. Their usable pool is smaller:

iPhone 15 Pro ~4.5 GB iPhone 16 ~4.5 GB iPhone 16 Pro ~4.5 GB Generic Android Phone (8GB RAM) ~4.5 GB iPhone 17 ~4.5 GB

Too large for any 8GB device

Mistral 7B 7B Qwen2.5 7B 7B DeepSeek-R1-Distill-Qwen 7B 7B Qwen2.5 Coder 7B 7B Llama 3.1 8B 8B Qwen3 8B 8B DeepSeek-R1-Distill-Llama 8B 8B Gemma 3n E4B 8B Gemma 4 E4B 8B DeepSeek-R1-0528-Qwen3-8B 8.19B Qwen2.5-VL 7B 8.29B LFM2.5 8B-A1B 8.3B Granite 4.1 8B 8.8B Gemma 2 9B 9B GLM-4 9B 9B GLM-4-9B-0414 9B Nemotron Nano 9B v2 9B Ornith 1.0 9B 9B Falcon3 10B 10B Llama 3.2 Vision 11B 10.7B Gemma 3 12B 12B Gemma 4 12B 12B Mistral Nemo 12B 12.2B Phi-4 14B 14B Qwen2.5 14B 14B Qwen3 14B 14B DeepSeek-R1-Distill-Qwen 14B 14B Qwen2.5 Coder 14B 14B Phi-4-reasoning 14B DeepSeek-V2-Lite 16B gpt-oss 20B 21B ERNIE 4.5 21B-A3B 21B Mistral Small 3 24B 24B Sarvam-M 24B 24B Mistral Small 3.1 24B 24B Magistral Small 24B Devstral Small 24B LFM2 24B-A2B 24B Gemma 4 26B-A4B 26.5B Gemma 2 27B 27B Gemma 3 27B 27B Qwen3.6 27B 27.8B Granite 4.1 30B 28.9B Sarvam-30B 30B Nemotron 3 Nano 30B-A3B 30B Nemotron Cascade 2 30B-A3B 30B GLM-4.7-Flash 30B Qwen3 30B-A3B 30.5B Qwen3-Coder 30B-A3B 30.5B North Mini Code 1.0 30.5B Qwen2.5 32B 32B Qwen3 32B 32B DeepSeek-R1-Distill-Qwen 32B 32B Qwen2.5 Coder 32B 32B Granite 4.0 H Small 32B GLM-4-32B-0414 32B EXAONE 4.0 32B 32B OLMo 2 32B Instruct 32B Granite 4.0 H Small 32B Olmo 3.1 32B Instruct 32B Gemma 4 31B 32.7B Laguna XS 2.1 33.4B Yi 1.5 34B 34B Falcon-H1-34B-Instruct 34B Qwen-AgentWorld 35B-A3B 34.7B Command R 35B 35B Ornith 1.0 35B 35B Seed-OSS 36B Instruct 36B Qwen3.6 35B-A3B 36B Mixtral 8x7B 46.7B Llama-3.3-Nemotron-Super-49B-v1 49B Llama 3.3 70B 70B Qwen2.5 72B 72B Hunyuan-A13B-Instruct 80B Sarvam-105B 105B GLM-4.5-Air 106B Llama 4 Scout 109B Command A 111B gpt-oss 120B 117B Laguna S 2.1 118B Nemotron 3 Super 120B-A12B 120B dots.llm1 142B Qwen3 235B A22B 235B DeepSeek-V4-Flash 284B GLM-4.6 357B Llama 4 Maverick 400B MiniMax M3 428B MiniMax-M1-80k 456B Qwen3-Coder 480B-A35B Instruct 480B Nemotron 3 Ultra 550B-A55B 550B DeepSeek R1 671B DeepSeek V3 671B DeepSeek-R1-0528 671B GLM-5.2 744B Kimi K2 Instruct 1000B Kimi K2.6 1000B Kimi K2.7 Code 1000B DeepSeek-V4-Pro 1600B

FAQ

How much of 8GB can a model actually use?

It depends on the memory type. Apple unified memory: about 5.5 GB (Apple M1 (8GB)); System RAM (CPU only): about 5 GB (8GB RAM Laptop (CPU/iGPU only)). The rest is reserved for the OS, display and runtime overhead.

What is the best local LLM for 8GB?

Gemma 4 E2B (5.1B) is the strongest model that runs comfortably at Q4_K_M on the most capable 8GB setup (Apple M1 (8GB), ~5.5 GB usable). On a tighter 8GB device the ceiling is lower, shown per row above.

Sources

Memory figures are estimates at Q4_K_M with a small context. See methodology.