cs.AI, cs.CL, cs.LG, cs.SE

LLMORPH: Automated Metamorphic Testing of Large Language Models

arXiv:2603.23611v1 Announce Type: cross
Abstract: Automated testing is essential for evaluating and improving the reliability of Large Language Models (LLMs), yet the lack of automated oracles for verifying output correctness remains a key challenge. …