cs.CL

Humans vs Vision-Language Models: A Unified Measure of Narrative Coherence

arXiv:2603.25537v1 Announce Type: new
Abstract: We study narrative coherence in visually grounded stories by comparing human-written narratives with those generated by vision-language models (VLMs) on the Visual Writing Prompts corpus. Using a set of …