LocalLLaMA

LocalLLaMA

The definitive Qwen 3.5 Jinja template

I’ve been doing a pretty thorough deep dive into the Qwen 3.5 templating logic to properly fix the lingering tool calling bugs. People here have done some really brilliant groundwork, templates from folks like @pneuny and @ellary were absolute lifesave…

LocalLLaMA

Dual A100X local workflow

Came across these A100X's at work and decided to keep them for internal use. We were not sure what to use them for but I came up with a work flow to use RAG to allow a local model to access our inventory database and have users interact with …

LocalLLaMA

Don’t buy Mac Studio now.

I've been totally obsessed with local models lately, and with some cybersecurity projects that need to run locally, I'm gearing up to grab a Mac Studio—staring at this page every day. And I just found out!!! Last month, after Apple quietl…

LocalLLaMA

Run Qwen3.5-397B-A13B with vLLM and 8xR9700

Special thanks for u/Sea-Speaker1700 to make possible run mxfp4 on R0700 GPU, first guide to run 122B models here Well, 397B model works amazing, super fast. Use this Dockerfile to build image, original image provided by u/Sea-Speaker1700 FROM…

Scroll to Top