Efficient Bilevel Optimization with KFAC-Based Hypergradients
arXiv:2603.29108v1 Announce Type: new
Abstract: Bilevel optimization (BO) is widely applicable to many machine learning problems. Scaling BO, however, requires repeatedly computing hypergradients, which involves solving inverse Hessian-vector products…