LocalLLaMA

An Open Benchmark for Testing RAG on Realistic Company-Internal Data

We built a corpus of 500,000 documents simulating a real company, and then let RAG systems compete to find out which one is the best. Introducing EnterpriseRAG-Bench, a benchmark for testing how well RAG systems work on messy, enterprise-scale in…