Speak, Segment, Track, Navigate: An Interactive System for Video-Guided Skull-Base Surgery
arXiv:2603.16024v2 Announce Type: replace
Abstract: We introduce a speech-guided embodied agent framework for video-guided skull base surgery that dynamically executes perception and image-guidance tasks in response to surgeon queries. The proposed sy…