cs.CV

Omni-Persona: Systematic Benchmarking and Improving Omnimodal Personalization

arXiv:2605.09996v1 Announce Type: new
Abstract: While multimodal large language models have advanced across text, image, and audio, personalization research has remained primarily vision-language, with unified omnimodal benchmarking that jointly cover…