Skip to main content
Partial GPU

Best variant: Q4_K_M

Partial GPU offload — 11 GB VRAM is above the 9.5 GB minimum but below the 12 GB recommendation. Some layers will spill to RAM.

GPU VRAM
11 GB
Min VRAM (best fit)
9.5 GB
Recommended VRAM
12 GB
Estimated tok/s
~42.1

Share this matchup

Send this page so a friend can see if NVIDIA GeForce RTX 2080 Ti fits Phi-3 Medium 14B.

Every Phi-3 Medium 14B quantization on NVIDIA GeForce RTX 2080 Ti

Each row runs the compatibility engine against your VRAM, RAM, and the model's requirements.

QuantizationFile SizeMin VRAMRec VRAMContextVerdictEstimated tok/s
Q4_K_MBest fit8.2 GB9.5 GB12 GB4K / 4KPartial GPU~42.1
Q8_014.8 GB16 GB20 GB4K / 4KHybrid CPU+GPU~13

NVIDIA GeForce RTX 2080 Ti is solid pick for Phi-3 Medium 14B

Need second card or fresh build? These links help support site at no extra cost.

All hardware for Phi-3 Medium 14BBest GPU for Phi-3 Medium 14BModels that fit NVIDIA GeForce RTX 2080 TiFull model detailsBrowse all models