AIM: Asymmetric Information Masking for Visual Question Answering Continual Learning
arXiv:2604.14779v1 Announce Type: new
Abstract: In continual visual question answering (VQA), existing Continual Learning (CL) methods are mostly built for symmetric, unimodal architectures. However, modern Vision-Language Models (VLMs) violate this a…