Building structures from classifiers for passage reranking

This paper shows that learning to rank models can be applied to automatically learn complex patterns, such as relational semantic structures occurring in questions and their answer passages. This is achieved by providing the learning algorithm with a tree representation derived from the syntactic trees of questions and passages connected by relational tags, where the latter are again provided by the means of automatic classifiers, i.e., question and focus classifiers and Named Entity Recognizers. This way effective structural relational patterns are implicitly encoded in the representation and can be automatically utilized by powerful machine learning models such as kernel methods. We conduct an extensive experimental evaluation of our models on well-known benchmarks from the question answer (QA) track of TREC challenges. The comparison with state-of-the-art systems and BM25 show a relative improvement in MAP of more than 14% and 45%, respectively. Further comparison on the task restricted to the answer sentence reranking shows an improvement in MAP of more than 8% over the state of the art.

[1]  Mirella Lapata,et al.  Using Semantic Roles to Improve Question Answering , 2007, EMNLP.

[2]  Trevor I. Dix,et al.  A Bit-String Longest-Common-Subsequence Algorithm , 1986, Inf. Process. Lett..

[3]  Alessandro Moschitti,et al.  Kernel methods, syntax and semantics for relational text categorization , 2008, CIKM '08.

[4]  Hamish Cunningham,et al.  Identification of the Question Focus: Combining Syntactic Analysis and Ontology-based Lookup through the User Interaction , 2010, LREC.

[5]  Alessandro Moschitti,et al.  Efficient Graph Kernels for Textual Entailment Recognition , 2011, Fundam. Informaticae.

[6]  James Allan,et al.  Passage Reranking for Question Answering Using Syntactic Structures and Answer Types , 2011, ECIR.

[7]  Nina Wacholder,et al.  HITIQA: Towards Analytical Question Answering , 2004, COLING.

[8]  Ellen M. Voorhees,et al.  Overview of the TREC 2002 Question Answering Track , 2003, TREC.

[9]  Dan Roth,et al.  Learning Question Classifiers , 2002, COLING.

[10]  Eric Brill,et al.  Automatic question answering using the web: Beyond the Factoid , 2006, Information Retrieval.

[11]  Gokhan Tur,et al.  LDA Based Similarity Modeling for Question Answering , 2010, HLT-NAACL 2010.

[12]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[13]  W. Bruce Croft,et al.  Finding similar questions in large question and answer archives , 2005, CIKM '05.

[14]  Alessandro Moschitti,et al.  Kernels on Linguistic Structures for Answer Extraction , 2008, ACL.

[15]  Silvia Quarteroni,et al.  Evaluating Multi-focus Natural Language Queries over Data Services , 2012, LREC.

[16]  Ben He,et al.  Question-answer topic model for question retrieval in community question answering , 2012, CIKM.

[17]  James Fan,et al.  Learning to rank for robust question answering , 2012, CIKM.

[18]  Yi Liu,et al.  Statistical Machine Translation for Query Expansion in Answer Retrieval , 2007, ACL.

[19]  Jennifer Chu-Carroll,et al.  Building Watson: An Overview of the DeepQA Project , 2010, AI Mag..

[20]  Andrew Hickl,et al.  Question Answering with LCC's CHAUCER-2 at TREC 2007 , 2006, TREC.

[21]  Alessandro Moschitti,et al.  Learning Adaptable Patterns for Passage Reranking , 2013, CoNLL.

[22]  Antonio Cisternino,et al.  PiQASso: Pisa Question Answering System , 2001, TREC.

[23]  Iryna Gurevych,et al.  UKP: Computing Semantic Textual Similarity by Combining Multiple Content Similarity Measures , 2012, *SEMEVAL.

[24]  Alessandro Moschitti,et al.  Structural relationships for large-scale learning of answer re-ranking , 2012, SIGIR '12.

[25]  John Tait,et al.  Word sense disambiguation in information retrieval revisited , 2003, SIGIR.

[26]  Yasemin Altun,et al.  Broad-Coverage Sense Disambiguation and Information Extraction with a Supersense Sequence Tagger , 2006, EMNLP.

[27]  Philip Resnik,et al.  Using Information Content to Evaluate Semantic Similarity in a Taxonomy , 1995, IJCAI.

[28]  Evgeniy Gabrilovich,et al.  Computing Semantic Relatedness Using Wikipedia-based Explicit Semantic Analysis , 2007, IJCAI.

[29]  Alessandro Moschitti,et al.  Linguistic kernels for answer re-ranking in question answering systems , 2011, Inf. Process. Manag..

[30]  Alon Lavie,et al.  Meteor 1.3: Automatic Metric for Reliable Optimization and Evaluation of Machine Translation Systems , 2011, WMT@EMNLP.

[31]  Eneko Agirre,et al.  SemEval-2012 Task 6: A Pilot on Semantic Textual Similarity , 2012, *SEMEVAL.

[32]  Filip Radlinski,et al.  Query chains: learning to rank from implicit feedback , 2005, KDD '05.

[33]  Noah A. Smith,et al.  What is the Jeopardy Model? A Quasi-Synchronous Grammar for QA , 2007, EMNLP.

[34]  Alessandro Moschitti,et al.  Syntactic and Semantic Kernels for Short Text Pair Categorization , 2009, EACL.

[35]  Roberto Basili,et al.  Exploiting Syntactic and Shallow Semantic Kernels for Question Answer Classification , 2007, ACL.

[36]  Alessandro Moschitti,et al.  Fast and effective kernels for relational learning from texts , 2007, ICML '07.

[37]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[38]  Ellen M. Voorhees,et al.  Overview of TREC 2003. , 2003 .

[39]  Alessandro Moschitti,et al.  A machine learning approach to textual entailment recognition , 2009, Natural Language Engineering.

[40]  Alessandro Moschitti,et al.  Efficient Convolution Kernels for Dependency and Constituent Syntactic Trees , 2006, ECML.

[41]  Mihai Surdeanu,et al.  Learning to Rank Answers on Large Online QA Collections , 2008, ACL.

[42]  Carlo Strapparava,et al.  Corpus-based and Knowledge-based Measures of Text Semantic Similarity , 2006, AAAI.

[43]  W. Bruce Croft,et al.  Retrieval models for question and answer archives , 2008, SIGIR '08.

[44]  Michael Collins,et al.  New Ranking Algorithms for Parsing and Tagging: Kernels over Discrete Structures, and the Voted Perceptron , 2002, ACL.

[45]  Dan Gusfield,et al.  Algorithms on Strings, Trees, and Sequences - Computer Science and Computational Biology , 1997 .

[46]  Noah A. Smith,et al.  Tree Edit Models for Recognizing Textual Entailments, Paraphrases, and Answers to Questions , 2010, NAACL.

[47]  Mark Andrew Greenwood,et al.  Open-domain question answering , 2005 .

[48]  Thorsten Joachims,et al.  Optimizing search engines using clickthrough data , 2002, KDD.

[49]  Alessandro Moschitti,et al.  Automatic Learning of Textual Entailments with Cross-Pair Similarities , 2006, ACL.

[50]  Christian Biemann Creating a system for lexical substitutions from scratch using crowdsourcing , 2013, Lang. Resour. Evaluation.

[51]  Chris Callison-Burch,et al.  Answer Extraction as Sequence Tagging with Tree Edit Distance , 2013, NAACL.

[52]  Dekang Lin,et al.  A Probabilistic Answer Type Model , 2006, EACL.

[53]  Michael J. Wise,et al.  YAP3: improved detection of similarities in computer program and other texts , 1996, SIGCSE '96.

[54]  Christopher D. Manning,et al.  Probabilistic Tree-Edit Models with Structured Latent Variables for Textual Entailment and Question Answering , 2010, COLING.