# Llama 4 Scout: RAM and VRAM requirements

> Llama 4 Scout is a 109B Llama 4 model (Mixture-of-Experts, 17B active per token). At Q4_K_M it needs about **64.2 GB** to run and fits **3 of 39** tracked devices. Minimum to run: Apple M4 Max (128GB).

Last validated: 2026-06-15. Sources: Ollama, HuggingFace GGUF repos, vendor specs.

## Memory by quantization
| Quant | On disk | To run (4k context) |
| --- | --- | --- |
| Q4_K_M | 60.87 GB | ~64.2 GB |

Memory = weights + KV cache + ~0.8 GB runtime overhead, and varies ±15% with context length.

## Will it run on my device?
- **Apple M1 (8GB)** (8 GB): No, not enough memory
- **Generic Android Phone (8GB RAM)** (8 GB): No, not enough memory
- **iPhone 17 Pro** (12 GB): No, not enough memory
- **Nvidia GeForce RTX 4080 (16GB)** (16 GB): No, not enough memory
- **Google Pixel 10 Pro** (16 GB): No, not enough memory
- **Nvidia GeForce RTX 4090 (24GB)** (24 GB): No, not enough memory
- **Apple M4 Pro (48GB)** (48 GB): No, not enough memory
- **Apple M3 Ultra (256GB)** (256 GB): Yes, it runs — room for Q8_0

Full table of all 39 devices: https://localmodel.run/model/llama-4-scout

## How to run
Quickest path: `ollama run llama4:scout`. On Mac, LM Studio (ships MLX) is fastest; on Linux, Ollama for chat or vLLM to serve; on Windows, LM Studio or Ollama.

## Details
- Parameters: 109B (MoE, 17B active per token)
- Default context: 128k tokens
- License: Llama 4 Community (commercial use: conditional)
- Released: 2025-04
- HuggingFace: 368,521 downloads/mo, 1304 likes

## FAQ
### How much VRAM or RAM does Llama 4 Scout need?
About 64.2 GB at Q4_K_M (weights 60.87 GB + KV cache + overhead) at a 4k context.
### Can Llama 4 Scout run on a laptop?
Llama 4 Scout is large; you need a high-memory Mac or a 24 GB+ GPU at Q4_K_M.
### Can I use Llama 4 Scout commercially?
Conditionally: Llama 4 Community License: free under 700M MAU, with use restrictions..

Sources: https://ollama.com/library/llama4, https://ollama.com/library/llama4/tags, https://huggingface.co/unsloth/Llama-4-Scout-17B-16E-Instruct-GGUF, https://huggingface.co/meta-llama/Llama-4-Scout-17B-16E-Instruct
More: https://localmodel.run/model/llama-4-scout