# Sarvam-30B: RAM and VRAM requirements

> Sarvam-30B is a 30B Sarvam model (Mixture-of-Experts, 2.4B active per token). At Q4_K_M it needs about **21.7 GB** to run and fits **11 of 39** tracked devices. Minimum to run: Nvidia GeForce RTX 4090 (24GB).

Last validated: 2026-06-15. Sources: Ollama, HuggingFace GGUF repos, vendor specs.

## Memory by quantization
| Quant | On disk | To run (4k context) |
| --- | --- | --- |
| Q4_K_M | 19.6 GB | ~21.7 GB |

Memory = weights + KV cache + ~0.8 GB runtime overhead, and varies ±15% with context length.

## Will it run on my device?
- **Apple M1 (8GB)** (8 GB): No, not enough memory
- **Generic Android Phone (8GB RAM)** (8 GB): No, not enough memory
- **iPhone 17 Pro** (12 GB): No, not enough memory
- **Nvidia GeForce RTX 4080 (16GB)** (16 GB): No, not enough memory
- **Google Pixel 10 Pro** (16 GB): No, not enough memory
- **Nvidia GeForce RTX 4090 (24GB)** (24 GB): Yes, but tight
- **Apple M4 Pro (48GB)** (48 GB): Yes, it runs
- **Apple M3 Ultra (256GB)** (256 GB): Yes, it runs — room for FP16

Full table of all 39 devices: https://localmodel.run/model/sarvam-30b

## How to run
Use LM Studio (Mac/Windows) or Ollama / vLLM (Linux).

## Details
- Parameters: 30B (MoE, 2.4B active per token)
- Default context: 64k tokens
- License: Apache-2.0 (commercial use: yes)
- Released: 2026-03
- HuggingFace: 38,811 downloads/mo, 207 likes

## FAQ
### How much VRAM or RAM does Sarvam-30B need?
About 21.7 GB at Q4_K_M (weights 19.6 GB + KV cache + overhead) at a 4k context.
### Can Sarvam-30B run on a laptop?
Sarvam-30B is large; you need a high-memory Mac or a 24 GB+ GPU at Q4_K_M.
### Can I use Sarvam-30B commercially?
Yes, Apache-2.0 permits commercial use.

Sources: https://huggingface.co/sarvamai/sarvam-30b, https://huggingface.co/sarvamai/sarvam-30b-gguf, https://www.sarvam.ai/blogs/sarvam-30b-105b
More: https://localmodel.run/model/sarvam-30b