How to train a new language model from scratch using Transformers and TokenizersBy Hugging Face - Blog / February 14, 2020