Computer Science

Generating coherent summaries from documents

- Document summarization by discourse tree trimming -

Abstract

Previous studies on extractive document summarization regard a document as a set of sentences and they select a subset of the sentences as a summary. However, summaries generated by these approaches may lack coherence. Incoherent summaries mislead readers. To generate a coherent summary, (1) we regard a document as a discourse tree whose nodes represent sentences and the edges represent rhetorical relationship between them, and (2) extract a rooted subtree as a summary. We formulate the procedure as a Tree Knapsack Problem and develop an efficient algorithm to solve the problem. In future, we will extend our method by integrating it with sentence compression to generate short and coherent summaries.

Photos

Poster


Please click the thumbnail image to open the full-size PDF file.

Map

Presentor

Tsutomu Hirao
Tsutomu Hirao
Innovative Communication Laboratory
Masaaki Nishino
Masaaki Nishino
Innovative Communication Laboratory
Yasuhisa Yoshida
Yasuhisa Yoshida
Innovative Communication Laboratory
Jun Suzuki
Jun Suzuki
Innovative Communication Laboratory
Masaaki Nagata
Masaaki Nagata
Innovative Communication Laboratory