What makes an entity salient in discourse?
arXiv:2508.16464v2 Announce Type: replace
Abstract: Entities in discourse vary in salience: main participants, objects and locations stay prominent, while others are quickly forgotten, raising questions about how humans signal and infer discourse-level salience. Using a graded operationalization of discourse-level salience based on summary-worthiness in multiple summaries, this paper investigates whether predictors of utterance-level prominence extend to the discourse level, and how they interact across 24 spoken and written genres of English. We examine features including grammatical function, definiteness, entity type, linear order, discourse relations and hierarchy, and referential structure, as well as the impact of genre. Our results show that utterance-level predictors significantly correlate with discourse-level salience, but interact with and are modulated by entity-level factors such as frequency and dispersion across the document. Multifactorial models reveal that no single factor determines salience; rather, discourse-structural and semantic features prove more robust than morphosyntactic ones, with substantial variation by genre and communicative intent.