/u/Financial_Buy_2287

INT3 compression+fused metal kernels [R]

/u/Financial_Buy_2287 / April 22, 2026

Hey guys, I am a researcher and solo founder. I compress models with INT3 at +0.14 nats and built a 2-bit KV cache for long-horizon tasks. I shipped both (INT3 model + INT2 KV) with custom fused Metal kernels for Mac (M-series). Currently Qwen 7B is av…

Author name: /u/Financial_Buy_2287

INT3 compression+fused metal kernels [R]