DPU or GPU for Accelerating Neural Networks Inference — Why not both? Split CNN Inference
arXiv:2605.00174v1 Announce Type: cross
Abstract: Video and image streaming on edge devices requires low latency. To address this, Neural Networks (NNs) are widely used, and prior work mainly focuses on accelerating them with single hardware units suc…