# Sarvam-1 2B: RAM and VRAM requirements

> Sarvam-1 2B is a 2B Sarvam model. At Q4_K_M it needs about **2.7 GB** to run and fits **39 of 39** tracked devices. Minimum to run: Apple M1 (8GB).

Last validated: 2026-06-15. Sources: Ollama, HuggingFace GGUF repos, vendor specs.

## Memory by quantization
| Quant | On disk | To run (4k context) |
| --- | --- | --- |
| Q4_K_M | 1.55 GB | ~2.7 GB |
| Q8_0 | 2.69 GB | ~3.8 GB |
| FP16 | 5.05 GB | ~6.1 GB |

Memory = weights + KV cache + ~0.8 GB runtime overhead, and varies ±15% with context length.

## Will it run on my device?
- **Apple M1 (8GB)** (8 GB): Yes, it runs — room for Q8_0
- **Generic Android Phone (8GB RAM)** (8 GB): Yes, it runs — room for Q8_0
- **iPhone 17 Pro** (12 GB): Yes, it runs — room for FP16
- **Nvidia GeForce RTX 4080 (16GB)** (16 GB): Yes, it runs — room for FP16
- **Google Pixel 10 Pro** (16 GB): Yes, it runs — room for FP16
- **Nvidia GeForce RTX 4090 (24GB)** (24 GB): Yes, it runs — room for FP16
- **Apple M4 Pro (48GB)** (48 GB): Yes, it runs — room for FP16
- **Apple M3 Ultra (256GB)** (256 GB): Yes, it runs — room for FP16

Full table of all 39 devices: https://localmodel.run/model/sarvam-1-2b

## How to run
Use LM Studio (Mac/Windows) or Ollama / vLLM (Linux).

## Details
- Parameters: 2B
- Default context: 8k tokens
- License: Sarvam non-commercial (commercial use: no)
- Released: 2024-10
- HuggingFace: 3,838 downloads/mo, 139 likes

## FAQ
### How much VRAM or RAM does Sarvam-1 2B need?
About 2.7 GB at Q4_K_M (weights 1.55 GB + KV cache + overhead) at a 4k context. Budget ~3.8 GB for Q8_0.
### Can Sarvam-1 2B run on a laptop?
Yes. Sarvam-1 2B fits on a 16 GB laptop or Mac at Q4_K_M, and runs on Apple Silicon or a 12 GB+ GPU comfortably.
### Can I use Sarvam-1 2B commercially?
No: Non-commercial use only per the Sarvam license. Check the current HuggingFace model card for updates..

Sources: https://huggingface.co/sarvamai/sarvam-1, https://huggingface.co/bartowski/sarvam-1-GGUF
More: https://localmodel.run/model/sarvam-1-2b