Evaluating LLMs in Production: Two Walls We Hit and How We Got Through

Evaluating LLM output in production is two problems stacked on top of each other. First, you have to see what the model actually did —…

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top