| RedHatAI/gemma-4-31B-it-FP8-block vs Sehyo/Qwen3.5-122B-A10B-NVFP4 It's different quant but both are using about 90GB vram. I prefer gemma4 for financial summary. The output is concise. It also properly explaining 'resort facility' while qwen just say 'a facility'. Qwen also missed 'higher-than-expected recoveries...'. Tht's material missed. I cited example for just one instance, but in general I am very impressed with gemma4 summary compared to other models. But qwen3.5 is better at agentic coding. Gemma4 sometimes stop at mid task. Would love to hear feedback if anyone has similar experience or any model suggestion. [link] [comments] |