EggHand: A Multimodal Foundation Model for Egocentric Hand Pose Forecasting
arXiv:2605.07642v1 Announce Type: new
Abstract: Forecasting future 3D hand pose sequences from egocentric video is essential for understanding human intention and enabling embodied applications such as AR/VR assistance and human-robot interaction. How…