Skip to main content
Partial GPU

Best variant: Q2_K

Partial GPU offload — 32 GB VRAM is above the 29 GB minimum but below the 36 GB recommendation. Some layers will spill to RAM.

GPU VRAM
32 GB
Min VRAM (best fit)
29 GB
Recommended VRAM
36 GB
Estimated tok/s
~29.1

Share this matchup

Send this page so a friend can see if NVIDIA GeForce RTX 5090 fits Qwen 2.5 72B.

Every Qwen 2.5 72B quantization on NVIDIA GeForce RTX 5090

Each row runs the compatibility engine against your VRAM, RAM, and the model's requirements.

QuantizationFile SizeMin VRAMRec VRAMContextVerdictEstimated tok/s
Q2_KBest fit27 GB29 GB36 GB8K / 128KPartial GPU~29.1
Q3_K_M35 GB37 GB44 GB8K / 128KHybrid CPU+GPU~11
Q4_K_M42 GB44 GB48 GB8K / 128KHybrid CPU+GPU~11
Q5_K_M50 GB52 GB58 GB8K / 128KHybrid CPU+GPU~10

NVIDIA GeForce RTX 5090 is solid pick for Qwen 2.5 72B

Need second card or fresh build? These links help support site at no extra cost.

All hardware for Qwen 2.5 72BBest GPU for Qwen 2.5 72BModels that fit NVIDIA GeForce RTX 5090Full model detailsBrowse all models