Author name: /u/redblood252

LocalLLaMA

Higher precision or higher parameter count

I’m wondering if we take models of the same family (e.g qwen3.5 moes). And we compared ggufs that are of different core counts different quantizations but similar sizes. Which model would be better for tasks? If it varies I’m mostly interested in codi…

LocalLLaMA

Which model to summarize rss news articles

I don’t know what nor how to test the quality of summaries of news articles. But I know I don’t need very large models. I’m looking preferably for something that uses low vram or cpu only but that is sufficient for this use case. I won’t need something…

Scroll to Top