CRePE: Curved Ray Expectation Positional Encoding for Unified-Camera-Controlled Video Generation
arXiv:2605.12938v1 Announce Type: cross
Abstract: Camera-conditioned video generation requires positional encoding that remains reliable under changes in camera motion, lens configuration, and scene structure. However, existing attention-level camera …