Tianpeng Zheng, Zhehan Jiang, Jiayi Liu, Shicong Feng

Leveraging Computerized Adaptive Testing for Cost-effective Evaluation of Large Language Models in Medical Benchmarking

Tianpeng Zheng, Zhehan Jiang, Jiayi Liu, Shicong Feng / March 26, 2026

arXiv:2603.23506v1 Announce Type: cross
Abstract: The rapid proliferation of large language models (LLMs) in healthcare creates an urgent need for scalable and psychometrically sound evaluation methods. Conventional static benchmarks are costly to adm…

Author name: Tianpeng Zheng, Zhehan Jiang, Jiayi Liu, Shicong Feng

Leveraging Computerized Adaptive Testing for Cost-effective Evaluation of Large Language Models in Medical Benchmarking