Q4_K_M
13.5 GBMin VRAM: 15.5 GB
Recommended VRAM: 17.6 GB
Min RAM: 21 GB
Context: 8K / 8K
Loading model details...
Fetching variants, compatibility details, and metadata.
Model Detail
Share Qwen3.5 27B with someone who is deciding what to run locally.
Social proof
33% of 999 scanned PCs run Qwen3.5 27B fully on GPU.
693 keep at least some work on GPU. Based on anonymous compatibility checks.
High-capability serving pick for stronger local APIs
Best for
Consider alternatives if
Quantization tip: Keep context realistic during evaluation so latency stays aligned with production expectations.
New to local models? Smaller quantization variants are easier to run, while larger ones can improve quality at the cost of more memory.
Q4_K_M
13.5 GBMin VRAM: 15.5 GB
Recommended VRAM: 17.6 GB
Min RAM: 21 GB
Context: 8K / 8K
Q5_K_M
16.9 GBMin VRAM: 19.4 GB
Recommended VRAM: 22 GB
Min RAM: 26 GB
Context: 8K / 8K
Q8_0
27 GBMin VRAM: 31.1 GB
Recommended VRAM: 35.1 GB
Min RAM: 41 GB
Context: 8K / 8K
FP16
54 GBMin VRAM: 62.1 GB
Recommended VRAM: 70.2 GB
Min RAM: 81 GB
Context: 8K / 8K
| Quantization | File Size | Min VRAM | Recommended VRAM | Min RAM | Context |
|---|---|---|---|---|---|
| Q4_K_M | 13.5 GB | 15.5 GB | 17.6 GB | 21 GB | 8K / 8K |
| Q5_K_M | 16.9 GB | 19.4 GB | 22 GB | 26 GB | 8K / 8K |
| Q8_0 | 27 GB | 31.1 GB | 35.1 GB | 41 GB | 8K / 8K |
| FP16 | 54 GB | 62.1 GB | 70.2 GB | 81 GB | 8K / 8K |
These GPUs meet the recommended 17.6 GB VRAM for the Q4_K_M quantization. Estimated speeds are approximate and assume full GPU offloading.
Budget Pick
AMD Radeon RX 7900 XT20 GB VRAM · ~47.4 tok/s
Lowest cost that meets recommended VRAM
Check price on AmazonFastest Pick
NVIDIA GeForce RTX 509032 GB VRAM · ~106.2 tok/s
Highest estimated throughput
Check price on AmazonBest Value
NVIDIA GeForce RTX 409024 GB VRAM · ~59.7 tok/s
Best speed per dollar of VRAM
Rent on RunPodNeed a detailed comparison? See all GPU rankings for Qwen3.5 27B.