cs.CV

ScriptHOI: Learning Scripted State Transitions for Open-Vocabulary Human-Object Interaction Detection

arXiv:2605.05057v1 Announce Type: new
Abstract: Open-vocabulary human-object interaction (HOI) detection requires recognizing interaction phrases that may not appear as annotated categories during training. Recent vision-language HOI detectors improve…