Evaluation Methodology for Text Summarization

Professor Kathleen McKeown
Columbia University
New York, USA

Tuesday 6th December 2005 at 11am

Video of Seminar (in Windows Media format)

Abstract

Evaluation of text summarization systems is a thorny issue. Should the quality of system output be quantified through comparison with human summaries of the same texts? It is well-known that there are many good human summaries of the same document(s). How can we make the comparison if there is not a single gold standard? Should quality instead be measured by how well end users can carry out a task using summarization? Task-based evaluation can be time-consuming and it can be difficult to control experimental design so that the summary and not, for example, the interface or the difficulty of the document, impacts how well the end user does on the task.

In this talk, I will describe three different types of summarization evaluation we carried out at Columbia. Two are task-based, but involve different methods of measuring results. These evaluations were performed on a medical journal article summarization system and a news summarization system. The third evaluation used a new methodology called The Pyramid Method, developed at Columbia, to allow comparison of system summaries against multiple human gold standards. This method was used in NIST's DUC05, a forum for cross-site evaluation of summarization systems. In the talk, I will briefly describe, for each evaluation, the systems being evaluated as well as the evaluation methodology and results. There will be time for discussion of the many open-ended issues that remain for summarization evaluation.

Short resume

Kathleen R. McKeown is a Professor of Computer Science at Columbia University. Her research interests include text summarization, natural language generation, multi-media explanation, digital libraries, concept to speech generation and natural language interfaces.

McKeown received the Ph.D. in Computer Science from the University of Pennsylvania in 1982 and has been at Columbia since then. In 1985 she received a National Science Foundation Presidential Young Investigator Award, in 1991 she received a National Science Foundation Faculty Award for Women, and in 1994 was selected as a AAAI Fellow. McKeown is also quite active nationally. She serves as a board member of the Computing Research Association and serves as secretary of the board.

She served as President of the Association of Computational Linguistics in 1992, Vice President in 1991, and Secretary Treasurer for 1995-1997. She has served on the Executive Council of the Association for Artificial Intelligence and was co-program chair of their annual conference in 1991.

Back to HAIL Home Page