Used over a million tokens in three separate sessions to test Qwen 3.6 35b (new Multi-token Prediction version)
In my opinion, MTP models are 100% game changer for local LLMs. In terms of speed, I was getting around 1.5x the tok/sec of previous tests. The project was a test – building a full iterative step-by-step pygame; a small mystery dungeon-style game. At f…