Structural relationships for large-scale learning of answer re-ranking

Supervised learning applied to answer re-ranking can highly improve on the overall accuracy of question answering (QA) systems. The key aspect is that the relationships and properties of the question/answer pair composed of a question and the supporting passage of an answer candidate, can be efficiently compared with those captured by the learnt model. In this paper, we define novel supervised approaches that exploit structural relationships between a question and their candidate answer passages to learn a re-ranking model. We model structural representations of both questions and answers and their mutual relationships by just using an off-the-shelf shallow syntactic parser. We encode structures in Support Vector Machines (SVMs) by means of sequence and tree kernels, which can implicitly represent question and answer pairs in huge feature spaces. Such models together with the latest approach to fast kernel-based learning enabled the training of our rerankers on hundreds of thousands of instances, which previously rendered intractable for kernelized SVMs. The results on two different QA datasets, e.g., Answerbag and Jeopardy! data, show that our models deliver large improvement on passage re-ranking tasks, reducing the error in Recall of BM25 baseline by about 18%. One of the key findings of this work is that, despite its simplicity, shallow syntactic trees allow for learning complex relational structures, which exhibits a steep learning curve with the increase in the training size.

[1]  Ming Zhou,et al.  Reranking Answers for Definitional QA Using Language Modeling , 2006, ACL.

[2]  Jean-Michel Renders,et al.  Word-Sequence Kernels , 2003, J. Mach. Learn. Res..

[3]  Mihai Surdeanu,et al.  Learning to Rank Answers on Large Online QA Collections , 2008, ACL.

[4]  Sasha Blair-Goldensohn,et al.  Answering Definitional Questions: A Hybrid Approach , 2004, New Directions in Question Answering.

[5]  Alessandro Moschitti,et al.  Kernel methods, syntax and semantics for relational text categorization , 2008, CIKM '08.

[6]  Yannick Versley,et al.  Coreference Systems Based on Kernels Methods , 2008, COLING.

[7]  Thorsten Joachims,et al.  Making large scale SVM learning practical , 1998 .

[8]  Mirella Lapata,et al.  Using Semantic Roles to Improve Question Answering , 2007, EMNLP.

[9]  Eric H. Nyberg,et al.  Improving Text Retrieval Precision and Answer Accuracy in Question Answering Systems , 2008, COLING 2008.

[10]  Ellen M. Voorhees,et al.  Using WordNet to disambiguate word senses for text retrieval , 1993, SIGIR.

[11]  Kathleen R. McKeown,et al.  A Hybrid Approach for Answering Definitional Questions , 2003 .

[12]  Ellen M. Voorhees,et al.  Overview of the TREC 2004 Novelty Track. , 2005 .

[13]  Yasemin Altun,et al.  Broad-Coverage Sense Disambiguation and Information Extraction with a Supersense Sequence Tagger , 2006, EMNLP.

[14]  W. Bruce Croft,et al.  Finding similar questions in large question and answer archives , 2005, CIKM '05.

[15]  Alessandro Moschitti,et al.  Syntactic/Semantic Structures for Textual Entailment Recognition , 2010, HLT-NAACL.

[16]  Nina Wacholder,et al.  HITIQA: Towards Analytical Question Answering , 2004, COLING.

[17]  Aravind K. Joshi,et al.  Using LTAG Based Features in Parse Reranking , 2003, EMNLP.

[18]  Alan F. Smeaton,et al.  Using NLP or NLP Resources for Information Retrieval Tasks , 1999 .

[19]  Nello Cristianini,et al.  Kernel Methods for Pattern Analysis , 2004 .

[20]  Yuji Matsumoto,et al.  Fast Methods for Kernel-Based Text Analysis , 2003, ACL.

[21]  Michael Collins,et al.  New Ranking Algorithms for Parsing and Tagging: Kernels over Discrete Structures, and the Voted Perceptron , 2002, ACL.

[22]  James P. Callan,et al.  Structured retrieval for question answering , 2007, SIGIR.

[23]  Alessandro Moschitti,et al.  Efficient Convolution Kernels for Dependency and Constituent Syntactic Trees , 2006, ECML.

[24]  Daniel Marcu,et al.  A Noisy-Channel Approach to Question Answering , 2003, ACL.

[25]  Tat-Seng Chua,et al.  Generic soft pattern models for definitional question answering , 2005, SIGIR '05.

[26]  Dmitry Zelenko,et al.  Kernel Methods for Relation Extraction , 2002, J. Mach. Learn. Res..

[27]  Alessandro Moschitti,et al.  A Semantic Kernel for Predicate Argument Classification , 2004, CoNLL.

[28]  Roberto Basili,et al.  Exploiting Syntactic and Shallow Semantic Kernels for Question Answer Classification , 2007, ACL.

[29]  Tomek Strzalkowski,et al.  Natural Language Information Retrieval: TREC-8 Report , 1994, TREC.

[30]  Thorsten Joachims,et al.  Training structural svms with kernels using sampled cuts , 2008, KDD.

[31]  Alessandro Moschitti,et al.  Joint Distant and Direct Supervision for Relation Extraction , 2011, IJCNLP.

[32]  Ellen M. Voorhees,et al.  Query expansion using lexical-semantic relations , 1994, SIGIR '94.

[33]  Alessandro Moschitti,et al.  Large-Scale Support Vector Learning with Structural Kernels , 2010, ECML/PKDD.

[34]  Alessandro Moschitti,et al.  Semantic Role Labeling via FrameNet, VerbNet and PropBank , 2006, ACL.

[35]  Yutaka Sasaki Question Answering as Question-Biased Term Extraction: A New Approach toward Multilingual QA , 2005, ACL.

[36]  Knowledge Discovering using FrameNet, VerbNet and PropBank , 2004 .

[37]  Jaime G. Carbonell,et al.  Rank learning for factoid question answering with linguistic and semantic constraints , 2010, CIKM.

[38]  Filip Radlinski,et al.  Query chains: learning to rank from implicit feedback , 2005, KDD '05.

[39]  Alessandro Moschitti,et al.  A Study on Convolution Kernels for Shallow Statistic Parsing , 2004, ACL.

[40]  Andrew Hickl,et al.  Question Answering with LCC's CHAUCER-2 at TREC 2007 , 2006, TREC.

[41]  Jun Suzuki,et al.  SVM Answer Selection for Open-Domain Question Answering , 2002, COLING.

[42]  David A. Ferrucci Build watson: An overview of DeepQA for the Jeopardy! Challenge , 2010, 2010 19th International Conference on Parallel Architectures and Compilation Techniques (PACT).