/u/C_Coffie - Provide.ai

Ran the same models across Strix Halo, RTX 3090, and RTX 5070 because I wanted my own numbers

/u/C_Coffie / May 16, 2026

I kept seeing inference-speed claims for these models and wanting an apples-to-apples comparison on the hardware I actually have. So I built a harness and a public page that dumps every run as YAML. The dataset: 55 runs, three rigs, five backends (rocm…

Author name: /u/C_Coffie

Ran the same models across Strix Halo, RTX 3090, and RTX 5070 because I wanted my own numbers