/u/Ok_Warning2146 - Provide.ai

The exact KV cache usage of DeepSeek V4

/u/Ok_Warning2146 / April 26, 2026

Figure 1 of DSV4 paper seems to imply that DSV3.2 uses ~50GB at 1m context and DSV4 uses ~5GB: https://huggingface.co/deepseek-ai/DeepSeek-V4-Pro/blob/main/DeepSeek_V4.pdf From my own calculations, the correct FP16 KV cache at 1m context should be: M…

Author name: /u/Ok_Warning2146

The exact KV cache usage of DeepSeek V4