Skip to main content
Hybrid CPU+GPU

Best variant: Q4_K_M

CPU + GPU hybrid — not enough VRAM (11 GB < 14 GB min), but 64 GB RAM is sufficient. Expect significantly slower inference.

GPU VRAM
11 GB
Min VRAM (best fit)
14 GB
Recommended VRAM
16 GB
Estimated tok/s
~13

Share this matchup

Send this page so a friend can see if NVIDIA GeForce RTX 2080 Ti fits InternLM 2.5 20B.

Every InternLM 2.5 20B quantization on NVIDIA GeForce RTX 2080 Ti

Each row runs the compatibility engine against your VRAM, RAM, and the model's requirements.

QuantizationFile SizeMin VRAMRec VRAMContextVerdictEstimated tok/s
Q4_K_MBest fit12 GB14 GB16 GB8K / 32KHybrid CPU+GPU~13

Upgrade options that fit InternLM 2.5 20B better

Cheapest fit

NVIDIA GeForce RTX 5080

16 GB VRAM · ~64 tok/s

Best performance

NVIDIA GeForce RTX 5090

32 GB VRAM · ~119.5 tok/s

Rent GPU instead of buying one

If local fit is weak, cloud GPU gets you running today without hardware upgrade.

All hardware for InternLM 2.5 20BBest GPU for InternLM 2.5 20BModels that fit NVIDIA GeForce RTX 2080 TiFull model detailsBrowse all models