Fast Inference on Large Language Models: BLOOMZ on Habana Gaudi2 AcceleratorBy Hugging Face - Blog / March 28, 2023