Llama.cpp, opencode / pi / basically all agents, context compaction & cache validation: how do you manage it?
Ok so, I will try to explain myself as much as possible because onlinew I really cannot find much about this. Let's start by my settings for running Qwen 3.6 35B: Qwen 3.6: cmd: '/X –port ${PORT} –chat-template-kwargs '{"preserve_thi…