Agentic coding Qwen 3.6, Q6_K 125k context vs Q5_K_XL 200k context

What would you choose if you were in my shoes? How viable is 125k for agentic coding really? is "compact" really good enough, or would you go with Q6_K 125k?

I am getting around 165-170 tok/sec with either config with my 5090.

submitted by /u/ComfyUser48
[link] [comments]

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top