LocalLLaMA

LocalLLaMA

Gemma 4 MoE hitting 120 TPS on Dual 3090s!

Thought I'd share some benchmark numbers from my local setup. Hardware: Dual NVIDIA RTX 3090s Model: Gemma 4 (MoE architecture) Performance: ~120 Tokens Per Second The efficiency of this MoE implementation is unreal. Even with a heavy load, the thr…

LocalLLaMA

Visual Guide to Gemma 4

source: https://x.com/osanseviero/status/2040105484061954349 https://newsletter.maartengrootendorst.com/p/a-visual-guide-to-gemma-4 submitted by /u/jacek2023 [link] [comments]

Scroll to Top