/u/Jordanthecomeback

Qwen 27b and Other Dense Models Optimization

/u/Jordanthecomeback / April 5, 2026

Hi All, I hadn't realized the kv cache quant made such a big difference, so I took my 64 gig mac M2 Max Studio and switched from Qwen 3.5 35b a3b to the dense 27b. I love it, it's a huge difference, but I get maybe 3 tokens a second. I have kv …

Author name: /u/Jordanthecomeback

Qwen 27b and Other Dense Models Optimization