Beyond Chunking: Discourse-Aware Hierarchical Retrieval for Long Document Question Answering
arXiv:2506.06313v5 Announce Type: replace-cross
Abstract: Existing long-document question answering systems typically process texts as flat sequences or use heuristic chunking, which overlook the discourse structures that naturally guide human compreh…