Author name: /u/Ryannnnnnnnnnnnnnnh

What is your actual local LLM stack right now?

/u/Ryannnnnnnnnnnnnnnh / April 20, 2026

I keep trying new models, but the bigger difference usually comes from the setup around them, not the model itself. Backend frontend RAG or no RAG quant choice GPU offload context settings prompt format whatever janky glue holds it together A lot of lo…

LocalLLaMA

built a local tool because manually digging through reddit was too slow

/u/Ryannnnnnnnnnnnnnnh / April 19, 2026

for me the annoying part was not finding posts it was finding the few where someone is clearly asking for a real solution and not just talking i was doing it by hand for too long. search keywords, open many threads, read comments, save some posts, then…

LocalLLaMA

The most useful AI work I see now is not chat. It is boring background stuff

/u/Ryannnnnnnnnnnnnnnh / April 16, 2026

for me the exciting part is not another chat UI anymore. the really useful stuff is much more boring classification routing ranking cleaning messy inputs watching a stream of text and surfacing the few things that actually matter that is where AI start…