More Accurate Entity Ranking Using Knowledge Graph and Web Corpus

Recent years have witnessed some convergence in the architecture of entity search systems driven by a knowledge graph (KG) and a corpus with annotated entity mentions. However, each specific system has some limitations. We present AQQUCN, an entity search system that combines the best design principles into a public reference implementation. AQQUCN does not depend on well-formed question syntax, but works equally well with syntax-poor keyword queries. It uses several convolutional networks (convnets) to extract subtle, overlapping roles of query words. Instead of ranking structured query interpretations, which are then executed on the KG to return unranked sets, AQQUCN directly ranks response entities, by closely integrating coarse-grained predicates from the KG with fine-grained scoring from the corpus, into a single ranking model. Over and above competitive F1 score, AQQUCN gets the best entity ranking accuracy on two syntax-rich and two syntaxpoor public query workloads amounting to over 8,000 queries, with 16– 18% absolute improvement in mean average precision (MAP), compared to recent systems.

[1]  Yoon Kim,et al.  Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[2]  Daniele Bonadiman,et al.  Convolutional Neural Networks vs. Convolution Kernels: Feature Engineering for Answer Sentence Reranking , 2016, NAACL.

[3]  Ming-Wei Chang,et al.  Semantic Parsing via Staged Query Graph Generation: Question Answering with Knowledge Base , 2015, ACL.

[4]  Ming Zhou,et al.  Question Answering over Freebase with Multi-Column Convolutional Neural Networks , 2015, ACL.

[5]  Eugene Agichtein,et al.  When a Knowledge Base Is Not Enough: Question Answering over Knowledge Bases with External Text Data , 2016, SIGIR.

[6]  Michael Gamon,et al.  Active objects: actions for entity-centric search , 2012, WWW.

[7]  Eunsol Choi,et al.  Coarse-to-Fine Question Answering for Long Documents , 2016, ACL.

[8]  Ya-Xiang Yuan,et al.  Alternate minimization gradient method , 2003 .

[9]  Luke S. Zettlemoyer,et al.  Broad-coverage CCG Semantic Parsing with AMR , 2015, EMNLP.

[10]  Dongyan Zhao,et al.  Question Answering on Freebase via Relation Extraction and Textual Evidence , 2016, ACL.

[11]  Xuchen Yao,et al.  Information Extraction over Structured Data: Question Answering with Freebase , 2014, ACL.

[12]  Dongyan Zhao,et al.  Hybrid Question Answering over Knowledge Base and Free Text , 2016, COLING.

[13]  Meghdad Farahmand,et al.  Learning Semantic Composition to Detect Non-compositionality of Multiword Expressions , 2015, EMNLP.

[14]  M. de Rijke,et al.  A language modeling framework for expert finding , 2009, Inf. Process. Manag..

[15]  Andrew Chou,et al.  Semantic Parsing on Freebase from Question-Answer Pairs , 2013, EMNLP.

[16]  Richard Socher,et al.  A Neural Network for Factoid Question Answering over Paragraphs , 2014, EMNLP.

[17]  Andrew Y. Ng,et al.  Improving Word Representations via Global Context and Multiple Word Prototypes , 2012, ACL.

[18]  Jayant Krishnamurthy,et al.  Semantic Parsing to Probabilistic Programs for Situated Question Answering , 2016, EMNLP.

[19]  Jason Weston,et al.  Question Answering with Subgraph Embeddings , 2014, EMNLP.

[20]  Hae-Chang Rim,et al.  Joint Relational Embeddings for Knowledge-based Question Answering , 2014, EMNLP.

[21]  ChengXiang Zhai,et al.  Statistical Language Models for Information Retrieval , 2008, NAACL.

[22]  Oren Etzioni,et al.  Open Information Extraction: The Second Generation , 2011, IJCAI.

[23]  Jonathan Berant,et al.  Semantic Parsing via Paraphrasing , 2014, ACL.

[24]  Hannah Bast,et al.  More Accurate Question Answering on Freebase , 2015, CIKM.

[25]  Soumen Chakrabarti,et al.  Learning joint query interpretation and response ranking , 2013, WWW '13.

[26]  Paolo Ferragina,et al.  TAGME: on-the-fly annotation of short text fragments (by wikipedia entities) , 2010, CIKM.

[27]  Jonathan Berant,et al.  Imitation Learning of Agenda-based Semantic Parsers , 2015, TACL.

[28]  Alessandro Moschitti,et al.  Learning to Rank Short Text Pairs with Convolutional Deep Neural Networks , 2015, SIGIR.

[29]  Craig MacDonald,et al.  Voting for candidates: adapting data fusion techniques for an expert search task , 2006, CIKM '06.

[30]  Mandar Joshi,et al.  Knowledge Graph and Corpus Driven Segmentation and Answer Inference for Telegraphic Entity-seeking Queries , 2014, EMNLP.

[31]  Xuchen Yao,et al.  Lean Question Answering over Freebase from Scratch , 2015, NAACL.