Evaluating Answer Leakage Robustness of LLM Tutors against Adversarial Student Attacks
arXiv:2604.18660v1 Announce Type: cross
Abstract: Large Language Models (LLMs) are increasingly used in education, yet their default helpfulness often conflicts with pedagogical principles. Prior work evaluates pedagogical quality via answer leakage-t…