/u/ggonavyy - Provide.ai

llama.cpp’s Preliminary SM120 Native NVFP4 MMQ Is Merged

/u/ggonavyy / April 29, 2026

https://github.com/ggml-org/llama.cpp/pull/22196 And somehow we already got some GGUFs for it! https://huggingface.co/CISCai/gemma-4-31B-it-NVFP4-turbo-GGUF https://huggingface.co/stevelikesrhino/gemma-4-31B-it-nvfp4-GGUF (the below one is from PR auth…

Author name: /u/ggonavyy

llama.cpp’s Preliminary SM120 Native NVFP4 MMQ Is Merged