Q4_K_M
154.5 GBMin VRAM: 177.7 GB
Recommended VRAM: 200.9 GB
Min RAM: 232 GB
Context: 8K / 8K
Loading model details...
Fetching variants, compatibility details, and metadata.
Share MiMo V2 Flash with someone who is deciding what to run locally.
Social proof
1% of 981 scanned PCs run MiMo V2 Flash fully on GPU.
210 keep at least some work on GPU. Based on anonymous compatibility checks.
General-purpose local model brief
Best for
Consider alternatives if
Quantization tip: Benchmark at least two quantizations and validate with a task-specific eval set before production use.
New to local models? Smaller quantization variants are easier to run, while larger ones can improve quality at the cost of more memory.
Q4_K_M
154.5 GBMin VRAM: 177.7 GB
Recommended VRAM: 200.9 GB
Min RAM: 232 GB
Context: 8K / 8K
Q5_K_M
193.1 GBMin VRAM: 222.1 GB
Recommended VRAM: 251 GB
Min RAM: 290 GB
Context: 8K / 8K
Q8_0
309 GBMin VRAM: 355.3 GB
Recommended VRAM: 401.7 GB
Min RAM: 464 GB
Context: 8K / 8K
FP16
618 GBMin VRAM: 710.7 GB
Recommended VRAM: 803.4 GB
Min RAM: 927 GB
Context: 8K / 8K
| Quantization | File Size | Min VRAM | Recommended VRAM | Min RAM | Context |
|---|---|---|---|---|---|
| Q4_K_M | 154.5 GB | 177.7 GB | 200.9 GB | 232 GB | 8K / 8K |
| Q5_K_M | 193.1 GB | 222.1 GB | 251 GB | 290 GB | 8K / 8K |
| Q8_0 | 309 GB | 355.3 GB | 401.7 GB | 464 GB | 8K / 8K |
| FP16 | 618 GB | 710.7 GB | 803.4 GB | 927 GB | 8K / 8K |
These GPUs meet the recommended 200.9 GB VRAM for the Q4_K_M quantization. Estimated speeds are approximate and assume full GPU offloading.
Budget Pick
Apple M4 Ultra256 GB VRAM · ~5.7 tok/s
Lowest cost that meets recommended VRAM
Check price on AmazonNeed a detailed comparison? See all GPU rankings for MiMo V2 Flash.