cs.CV

Long-Horizon Streaming Video Generation via Hybrid Attention with Decoupled Distillation

arXiv:2604.10103v1 Announce Type: new
Abstract: Streaming video generation (SVG) distills a pretrained bidirectional video diffusion model into an autoregressive model equipped with sliding window attention (SWA). However, SWA inevitably loses distant…