Long-Horizon Streaming Video Generation via Hybrid Attention with Decoupled Distillation
arXiv:2604.10103v1 Announce Type: new
Abstract: Streaming video generation (SVG) distills a pretrained bidirectional video diffusion model into an autoregressive model equipped with sliding window attention (SWA). However, SWA inevitably loses distant…