cs.CL

AUDITA: A New Dataset to Audit Humans vs. AI Skill at Audio QA

arXiv:2604.21766v1 Announce Type: new
Abstract: Existing audio question answering benchmarks largely emphasize sound event classification or caption-grounded queries, often enabling models to succeed through shortcut strategies, short-duration cues, l…

Scroll to Top