cs.AI, cs.CV

Video Panels for Long Video Understanding

arXiv:2509.23724v2 Announce Type: replace
Abstract: Recent Video-Language Models (VLMs) achieve promising results on long-video understanding, but their performance still lags behind that achieved on tasks involving images or short videos. This has le…