Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation InferenceBy Hugging Face - Blog / January 16, 2025