Beyond BLEU: A Semantic Evaluation Method for Code Translation
arXiv:2605.05282v1 Announce Type: cross
Abstract: Code translation is one of the core capabilities of LLMs. However, evaluating the correctness of translations remains difficult, as commonly used metrics such as BLEU measure only syntactic similarity,…