StarDrinks: An English and Korean Test Set for SLU Evaluation in a Drink Ordering Scenario
arXiv:2604.26500v1 Announce Type: new
Abstract: LLMs and speech assistants are increasingly used for task-oriented interactions, yet their evaluation often relies on controlled scenarios that fail to capture the variability and complexity of real user…