The CIST Summarization System at TAC 2010

This is the first time we participate in TAC. In this report, we present our extractive summarization system on both initial and update summarization tracks of TAC 2010. We introduce an integrated method to generate all summaries. The TAC evaluation of results show that our summarization method is feasible but it has to be improved in future.

[1]  Shao Hai-min,et al.  Automatic multi-document summarization based on the latent Dirichlet topic allocation model , 2010 .

[2]  Jaime G. Carbonell,et al.  Machine learning research , 1981, SGAR.

[3]  Jun-ichi Fukumoto,et al.  Automated Summarization Evaluation with Basic Elements. , 2006, LREC.

[4]  Xin Liu,et al.  Generic text summarization using relevance measure and latent semantic analysis , 2001, SIGIR '01.

[5]  Balaraman Ravindran,et al.  Latent dirichlet allocation based multi-document summarization , 2008, AND '08.

[6]  Eduard H. Hovy,et al.  From Single to Multi-document Summarization , 2002, ACL.

[7]  I JordanMichael,et al.  The nested chinese restaurant process and bayesian nonparametric inference of topic hierarchies , 2010 .

[8]  Thomas Hofmann,et al.  Probabilistic latent semantic indexing , 1999, SIGIR '99.

[9]  Zhang Jie-hui Automatic summarization evaluation method based on similarity of text , 2007 .

[10]  Chin-Yew Lin Improving Summarization Performance by Sentence Compression — A Pilot Study , 2003 .

[11]  Richard A. Harshman,et al.  Indexing by Latent Semantic Analysis , 1990, J. Am. Soc. Inf. Sci..

[12]  Thomas L. Griffiths,et al.  The nested chinese restaurant process and bayesian nonparametric inference of topic hierarchies , 2007, JACM.

[13]  Mark T. Maybury,et al.  Automatic Summarization , 2002, Computational Linguistics.

[14]  Dianne P. O'Leary,et al.  Text summarization via hidden Markov models , 2001, SIGIR '01.

[15]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[16]  Dilek Z. Hakkani-Tür,et al.  A Hybrid Hierarchical Model for Multi-Document Summarization , 2010, ACL.

[17]  Eduard H. Hovy,et al.  Automatic Evaluation of Summaries Using N-gram Co-occurrence Statistics , 2003, NAACL.

[18]  Balaraman Ravindran,et al.  Latent Dirichlet Allocation and Singular Value Decomposition Based Multi-document Summarization , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[19]  Thomas L. Griffiths,et al.  Hierarchical Topic Models and the Nested Chinese Restaurant Process , 2003, NIPS.

[20]  Sanda M. Harabagiu,et al.  Topic themes for multi-document summarization , 2005, SIGIR '05.

[21]  Chin-Yew Lin,et al.  ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[22]  Robert L. Donaway,et al.  A Comparison of Rankings Produced by Summarization Evaluation Measures , 2000 .

[23]  Dragomir R. Radev,et al.  Centroid-based summarization of multiple documents , 2004, Inf. Process. Manag..

[24]  Kathleen McKeown,et al.  Cut and Paste Based Text Summarization , 2000, ANLP.

[25]  Thomas L. Griffiths,et al.  A probabilistic approach to semantic representation , 2019, Proceedings of the Twenty-Fourth Annual Conference of the Cognitive Science Society.

[26]  Zong Chengqing,et al.  An Approach to Automatic Summarization by Integrating Latent Dirichlet Allocation in Conditional Random Field , 2009 .

[27]  Rada Mihalcea,et al.  A Language Independent Algorithm for Single and Multiple Document Summarization , 2005, IJCNLP.

[28]  Wai Lam,et al.  Meta-evaluation of Summaries in a Cross-lingual Environment using Content-based Metrics , 2002, COLING.

[29]  Sun Park,et al.  Multi-document Summarization Based on Cluster Using Non-negative Matrix Factorization , 2007, SOFSEM.

[30]  Daniel Marcu,et al.  Summarization beyond sentence extraction: A probabilistic approach to sentence compression , 2002, Artif. Intell..

[31]  Kathleen R. McKeown,et al.  Applying the Pyramid Method in DUC 2005 , 2005 .