cs.CV

Adversarial Video Promotion Against Text-to-Video Retrieval

arXiv:2508.06964v3 Announce Type: replace
Abstract: Thanks to the development of cross-modal models, text-to-video retrieval (T2VR) is advancing rapidly, but its robustness remains largely unexamined. Existing attacks against T2VR are designed to push…