Fit More and Train Faster With ZeRO via DeepSpeed and FairScaleBy Hugging Face - Blog / January 19, 2021