Dynamic Cross-Modal Prompt Generation for Multimodal Continual Instruction Tuning
arXiv:2605.10765v1 Announce Type: cross
Abstract: Multimodal Large Language Models (MLLMs) achieve strong performance through instruction tuning, yet real-world deployment often requires continual capability expansion across sequential tasks. In such …