LocalLLaMA

LocalLLaMA

Stanford: Self improving Meta-Harness

We had Prompt engineering, then Context engineering, then Agents and Harness. Now we have Meta Harness, a harness that auto corrects its agentic mistakes and improves performance and uses less context: https://arxiv.org/abs/2603.28052 "The p…

LocalLLaMA

More Gemma4 fixes in the past 24 hours

Reasoning budget fix (merged): https://github.com/ggml-org/llama.cpp/pull/21697 New chat templates from Google to fix tool calling: 31B: https://huggingface.co/google/gemma-4-31B-it/blob/main/chat_template.jinja 27B: https://huggingface.co/google/gemma…

LocalLLaMA

Creating Pi Extension with Pi and Qwen3.5 27B

Following my latest post about setting up Claude Code to be used with Local Models I received a recommendation in the comments to try **Pi**. The suggestion was based on its customizability and superior harness for local models. Unlike Claude Code, whi…

Scroll to Top