FrameDiT: Diffusion Transformer with Matrix Attention for Efficient Video Generation
arXiv:2603.09721v2 Announce Type: replace
Abstract: High-fidelity video generation remains challenging for diffusion models due to the difficulty of modeling complex spatio-temporal dynamics efficiently. Recent video diffusion methods typically repres…