I made a 35% REAP of 397B with potentially usable quality in 96GB GPU
submitted by /u/Goldkoron [link] [comments]
submitted by /u/Goldkoron [link] [comments]
Some quick test using Gemma4-31B and Qwen3.5-27B, both Q4 quants from unsloth. I was already expecting Gemma 4 to be excellent at creative writing and better at translations for more obscure languages, but I didn’t expected to be that good at fun…
Hi everyone! I just got my hands on a Mac Mini M4 Pro with 64GB. My goal is to replace ChatGPT on my phone and desktop with a local setup. I’m specifically looking for models that excel at: Web Search & RAG: High context window and accuracy for re…
Built a single chatbot HTML page using Gemma 4 26B A4B running locally sharded between my 7900 XT and 3060 Ti with 32K context window at 50-65 t/s. Connects to LM Studio's API with full streaming, Markdown rendering, model selector, 6 parame…
had this thought when someone just used qwen3.5 to read the content of a pdf file very accurately even the signature. so this question arose in my mind. submitted by /u/optipuss [link] [comments]
submitted by /u/GreenGreasyGreasels [link] [comments]
I'm mind blown by the fact that about a year ago DeepSeek R1 came out with a MoE architecture at 671B parameters and today Gemma 4 MoE is only 26B and is genuinely impressive. It's 25 times smaller, but is it 25 times worse? I'm exited abou…
submitted by /u/Ryoiki-Tokuiten [link] [comments]
Hello everyone, I am looking for a very small VLM or Transformer based ViT, which will inference over images (each size less than 10MB, any ratio/resolution possible). The model should return 1 or 0 that the img is NSFW or not, thats it. I want the mod…
I’ve been using ChatGPT, Gemini and Claude for a long time. My work is being a Salesforce developer/admin/holyshiteverything. I’ve got an Unraid machine with an Intel i9-12900K, 64 GB of RAM, an unholy amount of storage that serves a lot of dockers lik…