Model family · 14 sizes

Gemma: which size runs locally?

Gemma comes in 14 sizes, from 0.27B to 32.7B. Its strongest tracked here, Gemma 3 27B, scores an LMArena Elo of 1366. Here is each size with its Q4_K_M weight, the memory it needs, and the hardware that runs it.

Sizes: 14
Smallest: 0.27B
Largest: 32.7B
Runs from: 8GB

The Gemma lineup

Smallest first Q4_K_M · min memory

"Needs" is the sourced minimum memory for Q4_K_M with a small context. Larger context needs more.

Which Gemma fits your memory

8GB

Largest that fits: Gemma 4 E2B (5.1B), best case on Apple M1 (8GB).

Yes

16GB

Largest that fits: Gemma 4 12B (12B), best case on Nvidia GeForce RTX 4080 (16GB).

Yes

24GB

Largest that fits: Gemma 4 31B (32.7B), best case on Nvidia GeForce RTX 4090 (24GB).

Yes

32GB

Largest that fits: Gemma 4 31B (32.7B), best case on Nvidia GeForce RTX 5090 (32GB).

Yes

48GB

Largest that fits: Gemma 4 31B (32.7B), best case on Apple M5 Pro (48GB).

Yes

64GB

Largest that fits: Gemma 4 31B (32.7B), best case on Apple M4 Max (64GB).

Yes

128GB

Largest that fits: Gemma 4 31B (32.7B), best case on Apple M5 Max (128GB).

Yes

256GB

Largest that fits: Gemma 4 31B (32.7B), best case on Apple M3 Ultra (256GB).

Yes

Best case means the most capable device at that size (usually a discrete GPU). A Mac at the same size sits roughly one rung lower; see the per-size breakdown on each memory budget page.

FAQ

Which Gemma size should I run locally?

Pick the largest size your memory allows. On 8GB (best case) up to Gemma 4 E2B; On 16GB (best case) up to Gemma 4 12B; On 24GB (best case) up to Gemma 4 31B; On 32GB (best case) up to Gemma 4 31B; On 48GB (best case) up to Gemma 4 31B; On 64GB (best case) up to Gemma 4 31B; On 128GB (best case) up to Gemma 4 31B; On 256GB (best case) up to Gemma 4 31B. Smaller sizes run faster and leave headroom for context.

What is the smallest Gemma model?

Gemma 3 270M at 0.27B parameters. It is the one to use on phones and 8 GB machines.

What is the largest Gemma model and what does it need?

Gemma 4 31B at 32.7B, about 18.32 GB at Q4_K_M. It fits a high-memory desktop GPU or Mac.

Understand the numbers

Short guides to the ideas behind Gemma's memory and quant figures.

Sources

Memory figures are estimates at Q4_K_M. See methodology.