cs.CV

PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance

arXiv:2411.02327v4 Announce Type: replace
Abstract: In the past year, video-based large language models (Video LLMs) have achieved impressive progress, particularly in their ability to process long videos through extremely extended context lengths. Ho…