MachineLearning

What benchmark would you build for “reply quality” in SDR generation? [D]

Working on evaluating some AI-generated outbound (SDR-style emails along with follow-ups), and I’m running into a weird problem. Everyone talks about better personalisation or higher reply rates, but when you actually try to benchmark quality it gets m…