Confidence Estimation in Automatic Short Answer Grading with LLMs
arXiv:2605.00200v1 Announce Type: new
Abstract: Automatic Short Answer Grading (ASAG) with generative large language models (LLMs) has recently demonstrated strong performance without task-specific fine-tuning, while also enabling the generation of sy…