Quang-Hung Bui, Anh Son Ta

AdaFRUGAL: Adaptive Memory-Efficient Training with Dynamic Control

Quang-Hung Bui, Anh Son Ta / April 30, 2026

arXiv:2601.11568v2 Announce Type: replace-cross
Abstract: Training Large Language Models (LLMs) is highly memory-intensive due to optimizer state overhead. The FRUGAL framework mitigates this with gradient splitting, but its static hyperparameters — …

Author name: Quang-Hung Bui, Anh Son Ta

AdaFRUGAL: Adaptive Memory-Efficient Training with Dynamic Control