Anyone deployed Kimi K2.6 on their local hardware?

What should I expect to add to the cart if I want to run Kimi k2.6 ? Need the full 265k context window + no quantized variant. Need to get a realistic hardware estimate for at least 25 - 30 tok/s. I can look into turboquant for KV cache compression though

submitted by /u/Oxydised
[link] [comments]

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top