MHSafeEval: Role-Aware Interaction-Level Evaluation of Mental Health Safety in Large Language Models
arXiv:2604.17730v1 Announce Type: new
Abstract: Large language models (LLMs) are increasingly explored as scalable tools for mental health counseling, yet evaluating their safety remains challenging due to the interactional and context-dependent natur…