cs.CV

WiT: Waypoint Diffusion Transformers via Trajectory Conflict Navigation

arXiv:2603.15132v2 Announce Type: replace
Abstract: While recent Flow Matching models avoid the reconstruction bottlenecks of latent autoencoders by operating directly in pixel space, the lack of semantic continuity in the pixel manifold severely inte…