Author name: /u/redblood252

Higher precision or higher parameter count

/u/redblood252 / April 25, 2026

I’m wondering if we take models of the same family (e.g qwen3.5 moes). And we compared ggufs that are of different core counts different quantizations but similar sizes. Which model would be better for tasks? If it varies I’m mostly interested in codi…

LocalLLaMA

Which model to summarize rss news articles

/u/redblood252 / April 19, 2026

I don’t know what nor how to test the quality of summaries of news articles. But I know I don’t need very large models. I’m looking preferably for something that uses low vram or cpu only but that is sufficient for this use case. I won’t need something…