Commander-GPT: Dividing and Routing for Multimodal Sarcasm Detection
arXiv:2506.19420v2 Announce Type: replace
Abstract: Multimodal sarcasm understanding is a high-order cognitive task. Although large language models (LLMs) have shown impressive performance on many downstream NLP tasks, growing evidence suggests that t…