cs.CV

TrackCraft3R: Repurposing Video Diffusion Transformers for Dense 3D Tracking

arXiv:2605.12587v1 Announce Type: new
Abstract: Dense 3D tracking from monocular video is fundamental to dynamic scene understanding. While recent 3D foundation models provide reliable per-frame geometry, recovering object motion in this geometry rema…