Motion-Guided Semantic Alignment with Negative Prompts for Zero-Shot Video Action Recognition
arXiv:2604.17062v1 Announce Type: new
Abstract: Zero-shot action recognition is challenging due to the semantic gap between seen and unseen classes. We present a novel framework that enhances CLIP with disentangled embeddings and semantic-guided inter…