LocalLLaMA

I made a 35% REAP of 397B with potentially usable quality in 96GB GPU

/u/Goldkoron / April 5, 2026

submitted by /u/Goldkoron [link] [comments]

Gemma 4 vs Qwen3.5 on SVG style

/u/iChrist / April 5, 2026

Some quick test using Gemma4-31B and Qwen3.5-27B, both Q4 quants from unsloth. I was already expecting Gemma 4 to be excellent at creative writing and better at translations for more obscure languages, but I didn’t expected to be that good at fun…

LocalLLaMA

Best LLM for Mac Mini M4 Pro (64GB RAM) – Focus on Agents, RAG, and Automation?

/u/farmatex / April 5, 2026

Hi everyone! I just got my hands on a Mac Mini M4 Pro with 64GB. My goal is to replace ChatGPT on my phone and desktop with a local setup. I’m specifically looking for models that excel at: Web Search & RAG: High context window and accuracy for re…

LocalLLaMA

Gemma 4 26B A4B Single Page ASCII Chatbot Design

/u/Reaper_9382 / April 5, 2026

Built a single chatbot HTML page using Gemma 4 26B A4B running locally sharded between my 7900 XT and 3060 Ti with 32K context window at 50-65 t/s. Connects to LM Studio's API with full streaming, Markdown rendering, model selector, 6 parame…

LocalLLaMA

Are ocr engines like tesseract still valid or do people just use image recognition models now.

/u/optipuss / April 5, 2026

had this thought when someone just used qwen3.5 to read the content of a pdf file very accurately even the signature. so this question arose in my mind. submitted by /u/optipuss [link] [comments]

LocalLLaMA

state of r/locallama after Gemma4 release.

/u/GreenGreasyGreasels / April 5, 2026

submitted by /u/GreenGreasyGreasels [link] [comments]

LocalLLaMA

One year ago DeepSeek R1 was 25 times bigger than Gemma 4

/u/rinaldo23 / April 5, 2026

I'm mind blown by the fact that about a year ago DeepSeek R1 came out with a MoE architecture at 671B parameters and today Gemma 4 MoE is only 26B and is genuinely impressive. It's 25 times smaller, but is it 25 times worse? I'm exited abou…

LocalLLaMA

Gemini 3.1 Pro Level Performance With Gemma4-31B Multi-Agent Swarm

/u/Ryoiki-Tokuiten / April 4, 2026

submitted by /u/Ryoiki-Tokuiten [link] [comments]

LocalLLaMA

Looking for smallest VLM for NSFW image detector (atleast 5 it/s on CPU)

/u/nihalxx3 / April 4, 2026

Hello everyone, I am looking for a very small VLM or Transformer based ViT, which will inference over images (each size less than 10MB, any ratio/resolution possible). The model should return 1 or 0 that the img is NSFW or not, thats it. I want the mod…

LocalLLaMA

Feeling a bit handicapped by my 7900 XT. Is Apple the move?

/u/vick2djax / April 4, 2026

I’ve been using ChatGPT, Gemini and Claude for a long time. My work is being a Salesforce developer/admin/holyshiteverything. I’ve got an Unraid machine with an Intel i9-12900K, 64 GB of RAM, an unholy amount of storage that serves a lot of dockers lik…