NUS at DUC 2006: Document Concept Lattice for Summarization

Concepts composed of open-class terms after semantic equivalence discovery can be considered as simplified basic elements. We utilize frequent concept sets to construct a Document Concept Lattice, which contains hierarchical summary information of a document cluster. Based on this lattice, we further extract a set of sentences with maximal representative power and minimal redundancy for summarization. The implementation of our summarization approach via concept lattice obtains competitive performance in DUC 2006.

[1]  Gerd Stumme,et al.  Conceptual Knowledge Discovery in Databases Using Formal Concept Analysis Methods , 1998, PKDD.

[2]  Dragomir R. Radev,et al.  The University of Michigan at DUC 2004 , 2004 .

[3]  Chin-Yew Lin,et al.  Automatic Evaluation of Machine Translation Quality Using Longest Common Subsequence and Skip-Bigram Statistics , 2004, ACL.

[4]  David A. Bell,et al.  A Lattice Machine Approach to Automated Casebase Design: Marrying Lazy and Eager Learning , 1999, IJCAI.

[5]  Slava M. Katz Distribution of content words and phrases in text and language modelling , 1996, Natural Language Engineering.

[6]  Kathleen R. McKeown,et al.  Applying the Pyramid Method in DUC 2005 , 2005 .

[7]  Mary S. Neff,et al.  Multi-document Summarization by Visualizing Topical Content , 2000 .

[8]  Tat-Seng Chua,et al.  NUS at DUC 2005: Understanding Documents via Concept Links , 2005 .

[9]  Eduard Hovy,et al.  Evaluating DUC 2005 using Basic Elements , 2005 .

[10]  R RadevDragomir,et al.  Centroid-based summarization of multiple documents , 2004 .

[11]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.

[12]  Yuji Matsumoto,et al.  A new approach to unsupervised text summarization , 2001, SIGIR '01.

[13]  Jade Goldstein-Stewart,et al.  The use of MMR, diversity-based reranking for reordering documents and producing summaries , 1998, SIGIR '98.

[14]  Zunaid Kazi,et al.  Who's who? Identifying concepts and entities across multiple documents , 2000, Proceedings of the 33rd Annual Hawaii International Conference on System Sciences.

[15]  Hoa Trang Dang,et al.  Overview of DUC 2005 , 2005 .

[16]  Elizabeth D. Liddy,et al.  Advances in Automatic Text Summarization , 2001, Information Retrieval.