Ran the same models across Strix Halo, RTX 3090, and RTX 5070 because I wanted my own numbers
I kept seeing inference-speed claims for these models and wanting an apples-to-apples comparison on the hardware I actually have. So I built a harness and a public page that dumps every run as YAML. The dataset: 55 runs, three rigs, five backends (rocm…