LocalLLaMA

Can we talk about the reasoning token format chaos?

Qwen/DeepSeek: <think>…</think> Gemma: <|channel>…<channel|> Ok weird but sure. Gemma again, sometimes: just bare thought\n with no delimiters at all vLLM has –reasoning-parser flags per model which helps but that's b…