LLMORPH: Automated Metamorphic Testing of Large Language Models
arXiv:2603.23611v1 Announce Type: cross
Abstract: Automated testing is essential for evaluating and improving the reliability of Large Language Models (LLMs), yet the lack of automated oracles for verifying output correctness remains a key challenge. …