Stress Testing Factual Consistency Metrics for Long-Document Summarization
arXiv:2511.07689v2 Announce Type: replace
Abstract: Evaluating the factual consistency of abstractive text summarization remains a significant challenge, particularly for long documents, where conventional metrics struggle with input length limitation…