cs.AI, cs.CV

PackForcing: Short Video Training Suffices for Long Video Sampling and Long Context Inference

arXiv:2603.25730v1 Announce Type: new
Abstract: Autoregressive video diffusion models have demonstrated remarkable progress, yet they remain bottlenecked by intractable linear KV-cache growth, temporal repetition, and compounding errors during long-vi…