Summary of FAQs from a topical forum based on the native composition structure

This research improves the presentation of an automatic multiple-document summarization system, which produces the frequently asked questions (FAQs) of a topical forum. The design and findings of the proposed presentation structure based on a four-part pattern of traditional Chinese articles are presented. An experiment was designed to conduct an objective experimental analysis based on criteria consisting of compression rate, recall rate, and precision rate, as well as a subjective experimental analysis based on user acceptance in terms of the indication, readability, appropriate number of sentences, and structure of the summary. The experimental results show that the proposed summary presentation structure with both domain-terminology corpus methods produced a significantly improved summary presentation compared with the original system. Nevertheless, the FAQ summary presentation system could have used complete sentences instead of partial sentences from the original articles for improved readability.

[1]  J. H. Ward Hierarchical Grouping to Optimize an Objective Function , 1963 .

[2]  Francine Chen,et al.  A trainable document summarizer , 1995, SIGIR '95.

[3]  Abderrafih Lehmam,et al.  Text Structuration Leading to an Automatic Summary System: RAFI , 1999, Inf. Process. Manag..

[4]  H. P. Edmundson,et al.  Automatic abstracting and indexing—survey and recommendations , 1961, CACM.

[5]  Meng Wang,et al.  A study of Chinese text summarization using adaptive clustering of paragraphs , 2004, The Fourth International Conference onComputer and Information Technology, 2004. CIT '04..

[6]  Robert G. Farrell,et al.  Summarization of discussion groups , 2001, CIKM '01.

[7]  Hongyuan Zha,et al.  Generic summarization and keyphrase extraction using mutual reinforcement principle and sentence clustering , 2002, SIGIR '02.

[8]  Keh-Yih Su,et al.  Statistical Models for Word Segmentation And Unknown Word Resolution , 1992, ROCLING.

[9]  Sung-Hyon Myaeng,et al.  Text Summarization Based on Sentence Clustering with Rhetorical Structure Information , 2005, Int. J. Comput. Process. Orient. Lang..

[10]  Richard Sproat,et al.  A statistical method for finding word boundaries in Chinese text , 1990 .

[11]  Chin-Yew Lin,et al.  Automated Text Summarization , 2005, IJCNLP.

[12]  R. Kaplan CULTURAL THOUGHT PATTERNS IN INTER‐CULTURAL EDUCATION , 1966 .

[13]  Hans van Halteren New Feature Sets for Summarization by Sentence Extraction , 2003, IEEE Intell. Syst..

[14]  Eduard Hovy,et al.  Automated Text Summarization in SUMMARIST , 1997, ACL 1997.

[15]  Karen Spärck Jones Automatic summarising: The state of the art , 2007, Inf. Process. Manag..

[16]  Inderjeet Mani,et al.  The Challenges of Automatic Summarization , 2000, Computer.

[17]  Yu-Hui Tao,et al.  Comparing Genetic Algorithm and Statistical Segmentation in Terminology Extraction for Topical Forum , 2012 .

[18]  Jian-Yun Nie,et al.  On Chinese text retrieval , 1996, SIGIR '96.

[19]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[20]  Yoshihiro Ueda,et al.  Toward the "At-a-glance" Summary: Phrase-representation Summarization Method , 2000, COLING.

[21]  Chang-Shing Lee,et al.  Ontology-based fuzzy event extraction agent for Chinese e-news summarization , 2003, Expert Syst. Appl..

[22]  Gerard Salton,et al.  Automatic Text Structuring and Summarization , 1997, Inf. Process. Manag..

[23]  Keh-Jiann Chen,et al.  Word Identification for Mandarin Chinese Sentences , 1992, COLING.

[24]  Yuji Matsumoto,et al.  The diversity-based approach to open-domain text summarization , 2003, Inf. Process. Manag..

[25]  H. P. Edmundson,et al.  New Methods in Automatic Extracting , 1969, JACM.

[26]  Regina Barzilay,et al.  Inferring Strategies for Sentence Ordering in Multidocument News Summarization , 2002, J. Artif. Intell. Res..

[27]  Hsinchun Chen,et al.  Document clustering for electronic meetings: an experimental comparison of two techniques , 1999, Decis. Support Syst..

[28]  Evangelos E. Milios,et al.  World Wide Web site summarization , 2004, Web Intell. Agent Syst..