Évaluation de l'impact de l'intégration des étapes de filtrage et de compression dans le processus d'automatisation du résumé

Dans cet article, nous proposons une evaluation de l’impact de l’integration des etapes de compression et de filtrage dans la chaine de resume automatique. Cette evaluation se base sur un certain nombre d’experiences que nous avons menees sur des sous-corpus dissemines lors la conference DUC-TAC. Afin de mener ces experiences, nous avons adopte une methode d’extraction qui considere le processus de resume comme etant un probleme d’optimisation ou il s’agit d’en determiner la meilleure partition qui repond a des criteres predetermines. Les resultats obtenus montrent l’importance de l’integration des etapes de filtrage et de compression.

[1]  Marko Grobelnik,et al.  Learning Sub-structures of Document Semantic Graphs for Document Summarization , 2004 .

[2]  Tomek Strzalkowski,et al.  Cross-document summarization by concept classification , 2002, SIGIR '02.

[3]  Dragomir R. Radev,et al.  Experiments in Single and Multi-Document Summarization Using MEAD , 2001 .

[4]  Ryan T. McDonald Discriminative Sentence Compression with Soft Syntactic Evidence , 2006, EACL.

[5]  Chris Buckley,et al.  Automatic Text Summarization by Paragraph Extraction , 1997 .

[6]  Juan-Manuel Torres-Moreno,et al.  Résumé automatique de documents : une approche statistique , 2011 .

[7]  Chris D. Paice,et al.  Constructing literature abstracts by computer: Techniques and prospects , 1990, Inf. Process. Manag..

[8]  Zhu Zhang,et al.  Towards CST-enhanced summarization , 2002, AAAI/IAAI.

[9]  Chikashi Nobata and Satoshi Sekine. CRL/NYU Summarization System at DUC-2004 , 2004 .

[10]  John F. Sowa,et al.  Conceptual Structures: Information Processing in Mind and Machine , 1983 .

[11]  Simone Teufel,et al.  Sentence extraction as a classification task , 1997 .

[12]  Inderjeet Mani,et al.  Summarizing Similarities and Differences Among Related Documents , 1997, Information Retrieval.

[13]  A.A. Mohamed,et al.  Improving Query-Based Summarization Using Document Graphs , 2006, 2006 IEEE International Symposium on Signal Processing and Information Technology.

[14]  Francine Chen,et al.  A trainable document summarizer , 1995, SIGIR '95.

[15]  Sanda M. Harabagiu,et al.  Multi-Document Summaries Based on Semantic Redundancy , 2003, FLAIRS Conference.

[16]  Description of S QUASH , the SFU Question Answering Summary Handler for the DUC-2006 Summarization Task , 2005 .

[17]  Abdelmajid Ben Hamadou,et al.  Automatic Text Summarization of Scientific Articles Based on Classification of Extract's Population , 2003, CICLing.

[18]  Hua Li,et al.  Document Summarization Using Conditional Random Fields , 2007, IJCAI.

[19]  Juan-Manuel Torres-Moreno,et al.  Compression entropique de phrases contrôlée par un perceptron , 2008 .

[20]  Jawad Berri Contribution à la méthode d'exploration contextuelle : applications au résumé automatique et aux représentations temporelles réalisation informatique du système SERAPHIN , 1996 .

[21]  Violaine Prince,et al.  Sentence Compression as a Step in Summarization or an Alternative Path in Text Shortening , 2008, COLING.

[22]  Rakesh M. Verma,et al.  A Semantic Free-text Summarization System Using Ontology Knowledge , 2007 .

[23]  Kenneth C. Litkowski,et al.  Text Summarization Using XML-Tagged Documents , 2003 .

[24]  John M. Conroy Left-Brain/Right-Brain Multi-Document Summarization , 2004 .

[25]  Kathleen R. McKeown,et al.  Integrating Rhetorical-Semantic Relation Models for Query-Focused Summarization , 2006 .

[26]  Kathleen R. McKeown,et al.  Generating natural language summaries from multiple on-line sources , 1998 .

[27]  Kathleen McKeown,et al.  Cut and Paste Based Text Summarization , 2000, ANLP.

[28]  Feifan Liu,et al.  Correlation between ROUGE and Human Evaluation of Extractive Meeting Summaries , 2008, ACL.

[29]  Frédéric Béchet,et al.  The LIA-Thales summarization system at DUC-2006 , 2006, HLT-NAACL 2006.

[30]  Jean-Luc Minel Filtrage sémantique : du résumé automatique à la fouille de textes , 2002 .

[31]  H. P. Edmundson,et al.  New Methods in Automatic Extracting , 1969, JACM.

[32]  Hsin-Hsi Chen,et al.  Opinion Extraction, Summarization and Tracking in News and Blog Corpora , 2006, AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs.

[33]  Florian Boudin,et al.  NEO-CORTEX: A Performant User-Oriented Multi-Document Summarization System , 2007, CICLing.

[34]  Mirella Lapata,et al.  Discourse Constraints for Document Compression , 2010, CL.

[35]  Hans Peter Luhn,et al.  The Automatic Creation of Literature Abstracts , 1958, IBM J. Res. Dev..

[36]  Vasudeva Varma,et al.  Capturing Sentence Prior for Query-Based Multi-Document Summarization , 2007, RIAO.

[37]  Karel Jezek,et al.  Text Summarization and Singular Value Decomposition , 2004, ADVIS.

[38]  Dragos Stefan Munteanu,et al.  GLEANS: A Generator of Logical Extracts and Abstracts for Nice Summaries , 2002 .

[39]  Dragomir R. Radev,et al.  LexPageRank: Prestige in Multi-Document Text Summarization , 2004, EMNLP.

[40]  Mirella Lapata,et al.  Collective Content Selection for Concept-to-Text Generation , 2005, HLT.

[41]  Nicolas Masson Methodes pour une generation variable de resume automatique : vers un systeme de reduction de texte , 1998 .

[42]  Daniel Marcu,et al.  Summarization beyond sentence extraction: A probabilistic approach to sentence compression , 2002, Artif. Intell..

[43]  Claire Cardie,et al.  Multidocument Summarization via Information Extraction , 2001, HLT.

[44]  Karen Spärck Jones Automatic summarising: The state of the art , 2007, Inf. Process. Manag..

[45]  Massih-Reza Amini,et al.  Incorporating prior knowledge into a transductive ranking algorithm for multi-document summarization , 2009, SIGIR.

[46]  Tatsunori Mori,et al.  Multi-Answer-Focused Multi-Document Summarization Using a Question-Answering Engine , 2004, COLING.

[47]  Jade Goldstein-Stewart,et al.  The use of MMR, diversity-based reranking for reordering documents and producing summaries , 1998, SIGIR '98.

[48]  Jade Goldstein-Stewart,et al.  Creating and evaluating multi-document sentence extract summaries , 2000, CIKM '00.

[49]  Chin-Yew Lin,et al.  ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[50]  Fatma Kallel Jaoua,et al.  Intégration d’une étape de pré-filtrage et d’une fonction multiobjectif en vue d’améliorer le système ExtraNews de résumé de documents multiples , 2008, JEPTALNRECITAL.

[51]  Liang Zhou,et al.  Multi-Document Biography Summarization , 2005, EMNLP.

[52]  Sanda M. Harabagiu,et al.  Multi-Document Summarization Using Multiple-Sequence Alignment , 2004, LREC.

[53]  Daniel Marcu,et al.  From discourse structures to text summaries , 1997 .

[54]  Dianne P. O'Leary,et al.  Text summarization via hidden Markov models , 2001, SIGIR '01.

[55]  Horacio Saggion,et al.  Selective analysis for automatic abstracting: Evaluating Indicativeness and Acceptability , 2000, RIAO.

[56]  Seiji Miike,et al.  Abstract Generation Based on Rhetorical Structure Extraction , 1994, COLING.

[57]  Balaraman Ravindran,et al.  Latent dirichlet allocation based multi-document summarization , 2008, AND '08.

[58]  Lucy Vanderwende,et al.  Enhancing Single-Document Summarization by Combining RankNet and Third-Party Sources , 2007, EMNLP.

[59]  Julia Hirschberg,et al.  An Unsupervised Approach to Biography Production Using Wikipedia , 2008, ACL.

[60]  Blair Tennessy An epistemological approach to domain-specific multiple biographical document summarization , 2006 .

[61]  Vasileios Hatzivassiloglou,et al.  Domain -independent detection, extraction, and labeling of Atomic Events , 2003 .

[62]  Mirella Lapata,et al.  Sentence Compression as Tree Transduction , 2009, J. Artif. Intell. Res..

[63]  Karen Spärck Jones Automatic summarising: factors and directions , 1998, ArXiv.

[64]  Michel Gagnon,et al.  Text Compression by Syntactic Pruning , 2006, Canadian Conference on AI.

[65]  Akira Shimazu,et al.  Probabilistic Sentence Reduction Using Support Vector Machines , 2004, COLING.

[66]  Brigitte Endres-Niggemeyer,et al.  Making Cognitive Summarization Agents Work In A Real-World Domain , 2004, NLUCS.