cs.AI, cs.CL, cs.LG

Stress Testing Factual Consistency Metrics for Long-Document Summarization

arXiv:2511.07689v2 Announce Type: replace
Abstract: Evaluating the factual consistency of abstractive text summarization remains a significant challenge, particularly for long documents, where conventional metrics struggle with input length limitation…