Huy Hoang Ha, Benoit Favre, Francois Portet

MedMeta: A Benchmark for LLMs in Synthesizing Meta-Analysis Conclusion from Medical Studies

Huy Hoang Ha, Benoit Favre, Francois Portet / May 12, 2026

arXiv:2605.09661v1 Announce Type: cross
Abstract: Large language models (LLMs) have saturated standard medical benchmarks that test factual recall, yet their ability to perform higher-order reasoning, such as synthesizing evidence from multiple source…

Author name: Huy Hoang Ha, Benoit Favre, Francois Portet

MedMeta: A Benchmark for LLMs in Synthesizing Meta-Analysis Conclusion from Medical Studies