Audio-Visual Camera Pose Estimation with Passive Scene Sounds and In-the-Wild Video
arXiv:2512.12165v3 Announce Type: replace
Abstract: Understanding camera motion is a fundamental problem in embodied perception and 3D scene understanding. While visual methods have advanced rapidly, they often struggle under visually degraded conditi…