Representations Before Pixels: Semantics-Guided Hierarchical Video Prediction
arXiv:2604.11707v1 Announce Type: new
Abstract: Accurate future video prediction requires both high visual fidelity and consistent scene semantics, particularly in complex dynamic environments such as autonomous driving. We present Re2Pix, a hierarchi…