cs.AI, cs.CV

Why Do Vision Language Models Struggle To Recognize Human Emotions?

arXiv:2604.15280v1 Announce Type: new
Abstract: Understanding emotions is a fundamental ability for intelligent systems to be able to interact with humans. Vision-language models (VLMs) have made tremendous progress in the last few years for many visu…