DVM: Real-Time Kernel Generation for Dynamic AI Models
arXiv:2603.24239v1 Announce Type: cross
Abstract: Dynamism is common in AI computation, e.g., the dynamic tensor shapes and the dynamic control flows in models. Due to the long compilation time, existing runtime compilation damages the model efficienc…