MINOS: A Multimodal Evaluation Model for Bidirectional Generation Between Image and Text
arXiv:2506.02494v2 Announce Type: replace
Abstract: Evaluation is important for multimodal generation tasks, while traditional multimodal evaluation metrics suffer from several limitations. With the rapid progress of MLLMs, there is growing interest i…