cs.CV

Video-Only ToM: Enhancing Theory of Mind in Multimodal Large Language Models

arXiv:2603.24484v1 Announce Type: new
Abstract: As large language models (LLMs) continue to advance, there is increasing interest in their ability to infer human mental states and demonstrate a human-like Theory of Mind (ToM). Most existing ToM evalua…