I can’t get Qwen3.6 27B to outperform Qwen-Coder-Next and I’m not sure why

In my real-world usage (opencode) and in my synthetic benchmarks, Coder-Next (Q5) demolishes the whole Qwen3.6 family including the 27B Dense model (All Q8).

Everybody else is hailing that 27B is superior and is an amazing model, but I haven't been able to replicate any of that. Coder-Next seems to overperform, and 27B seems to underperform.

I am using the recommended settings on the model cards, and I have tried several 27B models including the MTP one Unsloth released.

I'm using llama.cpp with a 96GB variant Strix Halo machine. I would think it's the speed that is causing it to trip up, but 35BA3B also performs poorly.

Has anybody ran into this? Is 27B just being compared to other GPU sized models, or is something in my setup not optimal?

submitted by /u/Forward_Jackfruit813
[link] [comments]

Leave a Comment