MachineLearning

[Project] Kuat: A Rust-based, Zero-Copy Dataloader for PyTorch (4.6x training speedup on T4/H100)

Hi everyone, We built a drop-in replacement for torch.utils.data.DataLoader entirely in Rust. The Problem: Python's multiprocessing isolates workers, meaning every batch incurs IPC and pickling overhead. Even on a T4, the CPU often bottlenecks whil…