Pixels or Positions? Benchmarking Modalities in Group Activity Recognition
arXiv:2511.12606v2 Announce Type: replace
Abstract: Group Activity Recognition (GAR) is well studied on the video modality for surveillance and indoor team sports (e.g., volleyball, basketball). Yet, other modalities such as agent positions and trajec…