Summarization: ERSS


ERSS is a system that evolved from 2003-2007 and participated in each NIST-sponsored DUC competition. Working under the assumption that the most important entities of a newspaper article are referred to most frequently, ERSS employs a coreference-based summarization strategy. ERSS approximates noun phrase (NP) coreference resolution with a shallow processing environment to identify the most important NPs to be included in the summary. While for single documents the longest coreference chain is a good indicator for importance, for multi-document summaries we developed a more intricate cluster graph algorithm. This algorithm is robust and adapts naturally to different subtasks, in fact it underlies all of our contributions to DUC competitions. In particular, the cluster graph algorithm adapts well to introducing a focus, or bias into the summary.

Last updated June 2007
by Sabine Bergler.