Scaling Continual Learning to 300+ Tasks with Bi-Level Routing Mixture-of-Experts
arXiv:2602.03473v2 Announce Type: replace-cross
Abstract: Continual learning, especially class-incremental learning (CIL), on the basis of a pre-trained model (PTM) has garnered substantial research interest in recent years. However, how to effectivel…