Long-answer question answering and rhetorical-semantic relations

Long-Answer Question Answering and Rhetorical-Semantic Relations Sasha J. Blair-Goldensohn Over the past decade, Question Answering (QA) has generated considerable interest and participation in the fields of Natural Language Processing and Information Retrieval. Conferences such as TREC, CLEF and DUC have examined various aspects of the QA task in the academic community. In the commercial world, major search engines from Google, Microsoft and Yahoo have integrated basic QA capabilities into their core web search. These efforts have focused largely on so-called “factoid” questions seeking a single fact, such as the birthdate of an individual or the capital city of a country. Yet in the past few years, there has been growing recognition of a broad class of “long-answer” questions which cannot be satisfactorily answered in this framework, such as those seeking a definition, explanation, or other descriptive information in response. In this thesis, we consider the problem of answering such questions, with particular focus on the contribution to be made by integrating rhetorical and semantic models. We present DefScriber, a system for answering definitional (“What is X?”), biographical (“Who is X?”) and other long-answer questions using a hybrid of goaland data-driven methods. Our goal-driven, or top-down, approach is motivated by a set of definitional predicates which capture information types commonly useful in definitions; our data-driven, or bottom-up, approach uses dynamic analysis of input data to guide answer content. In several evaluations, we demonstrate that DefScriber outperforms competitive summarization techniques, and ranks among the top long-answer QA systems being developed by others. Motivated by our experience with definitional predicates in DefScriber, we pursue a set of experiments which automatically acquire broad-coverage lexical models of rhetoricalsemantic relations (RSRs) such as Cause and Contrast. Building on the framework of Marcu and Echihabi (Marcu and Echihabi, 2002), we implement techniques to improve the quality of these models using syntactic filtering and topic segmentation, and present evaluation results showing that these methods can improve the accuracy of relation classification. Lastly, we implement two approaches for applying the knowledge in our RSR models to enhance the performance and scope of DefScriber. First, we integrate RSR models into the answer-building process in DefScriber, finding incremental improvements with respect to the content and ordering of responses. Second, we use our RSR models to help identify relevant answer material for an exploratory class of “relation-focused” questions which seek explanatory or comparative responses. We demonstrate that in the case of explanation questions, using RSRs can lead to significantly more relevant responses.

[1]  Hoa Trang Dang,et al.  Overview of DUC 2005 , 2005 .

[2]  Mark T. Maybury,et al.  Enhancing Explanation Coherence With Rhetorical Strategies , 1989, EACL.

[3]  Charles L. Wayne Multilingual Topic Detection and Tracking: Successful Research Enabled by Corpora and Evaluation , 2000, LREC.

[4]  Francine Chen,et al.  A trainable document summarizer , 1995, SIGIR '95.

[5]  Roger C. Schank,et al.  SCRIPTS, PLANS, GOALS, AND UNDERSTANDING , 1988 .

[6]  Syin Chan,et al.  Extracting Causal Knowledge from a Medical Database Using Graphical Patterns , 2000, ACL.

[7]  Thomas Mussweiler,et al.  'Everything is relative': Comparison processes in social judgment: The 2002 Jaspars Lecture , 2003 .

[8]  Brian Roark,et al.  Query-focused summarization by supervised sentence ranking and skewed word distributions , 2006 .

[9]  Lillian Lee,et al.  Measures of Distributional Similarity , 1999, ACL.

[10]  Vasudeva Varma,et al.  A Relevance-Based Language Modeling approach to DUC 2005 , 2005 .

[11]  Jennifer Chu-Carroll,et al.  Question Answering Using Constraint Satisfaction: QA-By-Dossier-With-Contraints , 2004, ACL.

[12]  W. Bruce Croft,et al.  Query expansion using local and global document analysis , 1996, SIGIR '96.

[13]  Eric Fosler-Lussier,et al.  Discourse Segmentation of Multi-Party Conversation , 2003, ACL.

[14]  Jeannett Martin,et al.  English Text: System and structure , 1992 .

[15]  James C. Lester,et al.  Developing and Empirically Evaluating Robust Explanation Generators: The KNIGHT Experiments , 1997, Comput. Linguistics.

[16]  Daniel Marcu,et al.  An Unsupervised Approach to Recognizing Discourse Relations , 2002, ACL.

[17]  Koby Crammer,et al.  Pranking with Ranking , 2001, NIPS.

[18]  Roxana Gîrju,et al.  Automatic Detection of Causal Relations for Question Answering , 2003, ACL 2003.

[19]  Eduard H. Hovy,et al.  Automated Discourse Generation Using Discourse Structure Relations , 1993, Artif. Intell..

[20]  Barbara Di Eugenio,et al.  On the Usage of Kappa to Evaluate Agreement on Coding Tasks , 2000, LREC.

[21]  George Lakoff,et al.  Women, Fire, and Dangerous Things , 1987 .

[22]  Michelle X. Zhou,et al.  An optimization-based approach to dynamic data content selection in intelligent multimedia interfaces , 2004, UIST '04.

[23]  Dan I. Moldovan,et al.  Learning Semantic Constraints for the Automatic Discovery of Part-Whole Relations , 2003, NAACL.

[24]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[25]  Johanna D. Moore,et al.  Discourse in Computational Linguistics and Artificial Intelligence , 2003 .

[26]  Daniel Marcu,et al.  Evaluating Multiple Aspects of Coherence in Student Essays , 2004, NAACL.

[27]  Luis Gravano,et al.  Snowball: extracting relations from large plain-text collections , 2000, DL '00.

[28]  David Evans,et al.  Columbia University at DUC 2004 , 2004 .

[29]  Candace L. Sidner,et al.  Attention, Intentions, and the Structure of Discourse , 1986, CL.

[30]  Thorsten Brants,et al.  Multiple Similarity Measures and Source-Pair Information in Story Link Detection , 2004, HLT-NAACL.

[31]  Jörg Tiedemann Integrating Linguistic Knowledge in Passage Retrieval for Question Answering , 2005, HLT/EMNLP.

[32]  Hong Yu,et al.  Towards Answering Opinion Questions: Separating Facts from Opinions and Identifying the Polarity of Opinion Sentences , 2003, EMNLP.

[33]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[34]  Daniel Marcu,et al.  The rhetorical parsing, summarization, and generation of natural language texts , 1998 .

[35]  Julia Hirschberg,et al.  Comparing lexical, acoustic/prosodic, structural and discourse features for speech summarization , 2005, INTERSPEECH.

[36]  Regina Barzilay,et al.  Columbia’s Newsblaster: New Features and Future Directions , 2003, NAACL.

[37]  Kathleen McKeown,et al.  Lexicalized Markov Grammars for Sentence Compression , 2007, NAACL.

[38]  Kathleen R. McKeown,et al.  Generating natural language summaries from multiple on-line sources , 1998 .

[39]  Tat-Seng Chua,et al.  Unsupervised learning of soft patterns for generating definitions from online news , 2004, WWW '04.

[40]  Vibhu O. Mittal,et al.  Generating explanations in context: The system perspective , 1995 .

[41]  Advaith Siddharthan,et al.  Syntactic Simplification and Text Cohesion , 2006 .

[42]  Dan Roth,et al.  Learning Question Classifiers , 2002, COLING.

[43]  Chin-Yew Lin,et al.  ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[44]  William W. Cohen Fast Effective Rule Induction , 1995, ICML.

[45]  Dekang Lin,et al.  Dependency-Based Evaluation of Minipar , 2003 .

[46]  Eduard Hovy,et al.  Automated multi-document summarization in NeATS , 2002 .

[47]  Kathleen McKeown,et al.  Text generation: using discourse strategies and focus constraints to generate natural language text , 1985 .

[48]  Sasha Blair-Goldensohn From Definitions to Complex Topics: Columbia University at DUC 2005 , 2005 .

[49]  Lou Boves,et al.  Data for question answering: The case of why , 2006, LREC.

[50]  Elena Filatova,et al.  Tell Me What You Do and I'll Tell You What You Are: Learning Occupation-Related Activities for Biographies , 2005, HLT/EMNLP.

[51]  Oren Etzioni,et al.  Extracting Product Features and Opinions from Reviews , 2005, HLT.

[52]  G. Meade Building a Discourse-Tagged Corpus in the Framework of Rhetorical Structure Theory , 2001 .

[53]  Lucy Vanderwende,et al.  MindNet: Acquiring and Structuring Semantic Information from Text , 1998, COLING-ACL.

[54]  Kathleen F. McCoy,et al.  Efficient text summarization using lexical chains , 2000, IUI '00.

[55]  Jerry R. Hobbs Literature And Cognition , 1990 .

[56]  Hiroaki Sato,et al.  The FrameNet Database and Software Tools , 2002, LREC.

[57]  Daniel Marcu,et al.  Bayesian Multi-Document Summarization at MSE , 2005 .

[58]  Eleazar Eskin,et al.  Detecting Text Similarity over Short Passages: Exploring Linguistic Feature Combinations via Machine Learning , 1999, EMNLP.

[59]  Sanda M. Harabagiu,et al.  Impact of Question Decomposition on the Quality of Answer Summaries , 2006, LREC.

[60]  James H. Martin,et al.  Contextual Spelling Correction Using Latent Semantic Analysis , 1997, ANLP.

[61]  Daniel Jurafsky,et al.  Learning Syntactic Patterns for Automatic Hypernym Discovery , 2004, NIPS.

[62]  Michael Oakes,et al.  Statistics for Corpus Linguistics , 1998 .

[63]  Daniel Marcu,et al.  Towards Automatic Classification of Discourse Elements in Essays , 2001, ACL.

[64]  Sanda M. Harabagiu,et al.  Question Answering Based on Semantic Structures , 2004, COLING.

[65]  Daniel Jurafsky,et al.  Semantic Taxonomy Induction from Heterogenous Evidence , 2006, ACL.

[66]  Jade Goldstein-Stewart,et al.  The use of MMR, diversity-based reranking for reordering documents and producing summaries , 1998, SIGIR '98.

[67]  Regina Barzilay,et al.  Information Fusion in the Context of Multi-Document Summarization , 1999, ACL.

[68]  Eduard H. Hovy,et al.  The Use of External Knowledge of Factoid QA , 2001, TREC.

[69]  Dekang Lin,et al.  Automatic Retrieval and Clustering of Similar Words , 1998, ACL.

[70]  Alexey Radul,et al.  Nuggeteer: Automatic Nugget-Based Evaluation using Descriptions and Judgements , 2006, NAACL.

[71]  Beth Sundheim,et al.  Overview of the Fourth Message Understanding Evaluation and Conference , 1992, MUC.

[72]  Ted Dunning,et al.  Accurate Methods for the Statistics of Surprise and Coincidence , 1993, CL.

[73]  Leo G. M. Noordman,et al.  Toward a taxonomy of coherence relations , 1992 .

[74]  Marti A. Hearst Text Tiling: Segmenting Text into Multi-paragraph Subtopic Passages , 1997, CL.

[75]  Michael Collins,et al.  A New Statistical Parser Based on Bigram Lexical Dependencies , 1996, ACL.

[76]  David Reitter,et al.  The Embra System at DUC 2005: Query-oriented Multi-document Summarization with a Very Large Latent Semantic Space , 2005 .

[77]  Paul Over,et al.  TRECVID 2005 - An Overview , 2005, TRECVID.

[78]  Regina Barzilay,et al.  Catching the Drift: Probabilistic Content Models, with Applications to Generation and Summarization , 2004, NAACL.

[79]  Mirella Lapata,et al.  Modeling Local Coherence: An Entity-Based Approach , 2005, ACL.

[80]  W. Bruce Croft,et al.  Combining the language model and inference network approaches to retrieval , 2004, Inf. Process. Manag..

[81]  Susan T. Dumais Combining evidence for effective information filtering , 1996 .

[82]  Susan T. Dumais,et al.  The latent semantic analysis theory of knowledge , 1997 .

[83]  Sandra Carberry,et al.  A New Strategy for Providing Definitions In Task-Oriented Dialogues , 1988, COLING.

[84]  Ani Nenkova,et al.  Syntactic Simplification for Improving Content Selection in Multi-Document Summarization , 2004, COLING.

[85]  David M. Pennock,et al.  Inferring hierarchical descriptions , 2002, CIKM '02.

[86]  Johanna D. Moore,et al.  Planning Text for Advisory Dialogues: Capturing Intentional and Rhetorical Information , 1993, CL.

[87]  B. Webber,et al.  A Short Introduction to the Penn Discourse TreeBank , 2005 .

[88]  Daniel Marcu,et al.  Sentence Level Discourse Parsing using Syntactic and Lexical Information , 2003, NAACL.

[89]  Jerry R. Hobbs Coherence and Coreference , 1979, Cogn. Sci..

[90]  Smaranda Muresan,et al.  DEFINDER: Rule-based Methods for the Extraction of Medical Terminology and their Associated Definitions from On-line Text , 2000, AMIA.

[91]  Regina Barzilay,et al.  Using Lexical Chains for Text Summarization , 1997 .

[92]  Eduard Hovy,et al.  Parsimonious or Profligate: How Many and Which Discourse Structure Relations? , 1992 .

[93]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[94]  Eugene Charniak,et al.  Finding Parts in Very Large Corpora , 1999, ACL.

[95]  Daniel Marcu,et al.  The rhetorical parsing of unrestricted texts: a surface-based approach , 2000, CL.

[96]  Jimmy J. Lin,et al.  Automatically Evaluating Answers to Definition Questions , 2005, HLT.

[97]  Mirella Lapata,et al.  Inferring Sentence-internal Temporal Relations , 2004, NAACL.

[98]  Owen Rambow,et al.  Use of Deep Linguistic Features for the Recognition and Labeling of Semantic Arguments , 2003, EMNLP.

[99]  Livio Robaldo,et al.  The Penn Discourse Treebank 2.0 Annotation Manual , 2007 .

[100]  Carla Umbach Contrast and Contrastive Topic , 2001 .

[101]  Owen Rambow Domain Communication Knowledge , 1990, INLG.

[102]  Daniel Jurafsky,et al.  Semantic Role Labeling by Tagging Syntactic Chunks , 2004, CoNLL.

[103]  W. Eric L. Grimson,et al.  Answering Questions about Moving Objects in Surveillance Videos , 2003, New Directions in Question Answering.

[104]  Suzan Verberne,et al.  Developing an Approach for Why-Question Answering , 2006, EACL.

[105]  Ani Nenkova,et al.  References to Named Entities: a Corpus Study , 2003, HLT-NAACL.

[106]  Daniel Jurafsky,et al.  Automatic Labeling of Semantic Roles , 2002, CL.

[107]  James Allan,et al.  Topic-based novelty detection: 1999 summer workshop at clsp , 1999 .

[108]  Byron Dom,et al.  An Information-Theoretic External Cluster-Validity Measure , 2002, UAI.

[109]  Mirella Lapata,et al.  Constructing Semantic Space Models from Parsed Corpora , 2003, ACL.

[110]  Jean Carletta,et al.  Assessing Agreement on Classification Tasks: The Kappa Statistic , 1996, CL.

[111]  Ani Nenkova,et al.  Evaluating Content Selection in Summarization: The Pyramid Method , 2004, NAACL.

[112]  Ralph Grishman,et al.  Information Extraction: Techniques and Challenges , 1997, SCIE.

[113]  Daniela Garcia,et al.  COATIS, an NLP System to Locate Expressions of Actions Connected by Causality Links , 1997, EKAW.

[114]  Eduard H. Hovy,et al.  Learning surface text patterns for a Question Answering System , 2002, ACL.

[115]  Steven J. Maiorano Finding Answers in Large Collections of Texts: Paragraph Indexing W Abductive Inference , 1999 .

[116]  Jimmy J. Lin,et al.  Quantitative evaluation of passage retrieval algorithms for question answering , 2003, SIGIR.

[117]  James Pustejovsky,et al.  The Generative Lexicon , 1995, CL.

[118]  Johanna D. Moore,et al.  A Problem for RST: The Need for Multi-Level Discourse Analysis , 1992, CL.

[119]  Dragomir R. Radev,et al.  Question-answering by predictive annotation , 2000, SIGIR '00.

[120]  Dominic Widdows,et al.  Unsupervised methods for developing taxonomies by combining syntactic and statistical information , 2003, NAACL.

[121]  Jinxi Xu,et al.  Evaluation of an extraction-based approach to answering definitional questions , 2004, SIGIR '04.

[122]  Michael Elhadad,et al.  Generating Connectives , 1990, COLING.

[123]  Marti A. Hearst Automated Discovery of WordNet Relations , 2004 .

[124]  Regina Barzilay,et al.  Inferring Strategies for Sentence Ordering in Multidocument News Summarization , 2002, J. Artif. Intell. Res..

[125]  Tristan Miller,et al.  Latent semantic analysis and the construction of coherent extracts , 2003, RANLP.

[126]  Jinxi Xu,et al.  A Hybrid Approach to Answering Biographical Questions , 2004, New Directions in Question Answering.

[127]  Mirella Lapata,et al.  Probabilistic Text Structuring: Experiments with Sentence Ordering , 2003, ACL.

[128]  T. Landauer,et al.  Indexing by Latent Semantic Analysis , 1990 .

[129]  Alex Lascarides,et al.  Temporal interpretation, discourse relations and commonsense entailment , 1993, The Language of Time - A Reader.

[130]  Klaus Krippendorff,et al.  Content Analysis: An Introduction to Its Methodology , 1980 .

[131]  William C. Mann,et al.  Rhetorical Structure Theory: Toward a functional theory of text organization , 1988 .

[132]  Kathleen R. McKeown,et al.  Understanding the process of multi-document summarization: content selection, rewriting and evaluation , 2006 .

[133]  Yoram Singer,et al.  BoosTexter: A Boosting-based System for Text Categorization , 2000, Machine Learning.

[134]  Inderjeet Mani,et al.  Multi-Document Summarization by Graph Search and Matching , 1997, AAAI/IAAI.

[135]  Lou Boves,et al.  Discourse-based answering of why-questions , 2006, Trait. Autom. des Langues.

[136]  Lucy Vanderwende,et al.  at DUC 2006 : Task-Focused Summarization with Sentence Simplification and Lexical Expansion , 2006 .

[137]  Kathleen Dahlgren,et al.  Naive semantics for natural language understanding , 1988 .

[138]  Graeme Hirst,et al.  Non-Classical Lexical Semantic Relations , 2004, Proceedings of the HLT-NAACL Workshop on Computational Lexical Semantics - CLS '04.

[139]  Peter W. Foltz,et al.  The Measurement of Textual Coherence with Latent Semantic Analysis. , 1998 .

[140]  T. Sanders,et al.  The classification of coherence relations and their linguistic markers: An exploration of two languages , 1998 .

[141]  Dragomir R. Radev,et al.  Centroid-based summarization of multiple documents , 2004, Inf. Process. Manag..

[142]  Johanna D. Moore,et al.  Toward a Synthesis of Two Accounts of Discourse Structure , 1996, CL.

[143]  Daniel Gildea,et al.  The Proposition Bank: An Annotated Corpus of Semantic Roles , 2005, CL.