Shipping LLMs (Part 4/6): How to Evaluate a RAG Pipeline

Previously: Shipping LLMs (Part 3/6): Speculative Decoding vs Quantization. I argued you should run both. This piece is about whether the…

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top