LocalLLaMA

Qwen 3.6 35B A3B Q4_K_M quant evaluation

About the Model: 35B total parameters, 3B active (A3B) mixture of experts architecture. Evaluation approach taken: We took Q4_K_M quantized GGUF from Unsloth. Ran it on CPU via llama-cpp-python and tested on three standard benchmarks: – HumanEva…