Neural variational entity set expansion for automatically populated knowledge graphs

We propose Neural variational set expansion to extract actionable information from a noisy knowledge graph (KG) and propose a general approach for increasing the interpretability of recommendation systems. We demonstrate the usefulness of applying a variational autoencoder to the Entity set expansion task based on a realistic automatically generated KG.

[1]  L. Getoor,et al.  Sparsity and Noise: Where Knowledge Graph Embeddings Fall Short , 2017, EMNLP.

[2]  Katherine A. Heller,et al.  Bayesian Sets , 2005, NIPS.

[3]  Claire Cardie,et al.  TinkerBell: Cross-lingual Cold-Start Knowledge Base Construction , 2017, Text Analysis Conference.

[4]  Yifan He,et al.  ICE: Rapid Information Extraction Customization for NLP Novices , 2015, HLT-NAACL.

[5]  Benjamin Van Durme,et al.  Efficient, Compositional, Order-sensitive n-gram Embeddings , 2017, EACL.

[6]  Mariano Sigman,et al.  Comparative study of LSA vs Word2vec embeddings in small corpora: a case study in dreams database , 2016, ArXiv.

[7]  Eric Crestan,et al.  Web-Scale Distributional Similarity and Entity Set Expansion , 2009, EMNLP.

[8]  Tony Jebara,et al.  Probability Product Kernels , 2004, J. Mach. Learn. Res..

[9]  Praveen Paritosh,et al.  Freebase: a collaboratively created graph database for structuring human knowledge , 2008, SIGMOD Conference.

[10]  Hugo Larochelle,et al.  A Meta-Learning Perspective on Cold-Start Recommendations for Items , 2017, NIPS.

[11]  Alexander J. Smola,et al.  Deep Sets , 2017, 1703.06114.

[12]  Yeye He,et al.  SEISA: set expansion by iterative similarity aggregation , 2011, WWW.

[13]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[14]  Tat-Seng Chua,et al.  Neural Collaborative Filtering , 2017, WWW.

[15]  Bonnie Webber,et al.  Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers , 2017 .

[16]  Max Welling,et al.  Semi-Supervised Classification with Graph Convolutional Networks , 2016, ICLR.

[17]  William W. Cohen,et al.  Automatic Set Expansion for List Question Answering , 2008, EMNLP.

[18]  Kyungwoo Song,et al.  Augmented Variational Autoencoders for Collaborative Filtering with Auxiliary Information , 2017, CIKM.

[19]  Marcin Sydow,et al.  Aspect-Based Similar Entity Search in Semantic Knowledge Graphs with Diversity-Awareness and Relaxation , 2014, 2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT).

[20]  William W. Cohen,et al.  Language-Independent Set Expansion of Named Entities Using the Web , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).

[21]  Dekang Lin,et al.  Automatic Retrieval and Clustering of Similar Words , 1998, ACL.

[22]  William W. Cohen,et al.  Automatic Set Instance Extraction using the Web , 2009, ACL/IJCNLP.

[23]  Max Welling Donald,et al.  Products of Experts , 2007 .

[24]  Christine D. Piatko,et al.  Using “Annotator Rationales” to Improve Machine Learning for Text Categorization , 2007, NAACL.

[25]  Bhaskar Mitra,et al.  Neural Models for Information Retrieval , 2017, ArXiv.

[26]  Catarina Ferreira Da Silva,et al.  2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT), Warsaw, Poland, August 11-14, 2014 - Volume I , 2014, WI-IAT.

[27]  Sebastian Nowozin,et al.  Multi-Level Variational Autoencoder: Learning Disentangled Representations from Grouped Observations , 2017, AAAI.

[28]  Ke Wang,et al.  Entity Set Expansion via Knowledge Graphs , 2017, SIGIR.

[29]  Benjamin Van Durme,et al.  What You Seek Is What You Get: Extraction of Class Attributes from Query Logs , 2007, IJCAI.

[30]  James Henderson,et al.  Graph-Based Seed Set Expansion for Relation Extraction Using Random Walk Hitting Times , 2013, HLT-NAACL.

[31]  Omer Levy,et al.  Dependency-Based Word Embeddings , 2014, ACL.

[32]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[33]  Kugatsu Sadamitsu,et al.  Entity Set Expansion using Topic information , 2011, ACL.

[34]  Bin Wu,et al.  Entity Set Expansion with Meta Path in Knowledge Graph , 2017, PAKDD.

[35]  Ole Winther,et al.  Ladder Variational Autoencoders , 2016, NIPS.

[36]  Hugo Zaragoza,et al.  The Probabilistic Relevance Framework: BM25 and Beyond , 2009, Found. Trends Inf. Retr..

[37]  Andrew Trotman,et al.  Overview of the INEX 2009 Entity Ranking Track , 2009 .

[38]  Petr Sojka,et al.  Software Framework for Topic Modelling with Large Corpora , 2010 .

[39]  Jens Lehmann,et al.  DBpedia: A Nucleus for a Web of Open Data , 2007, ISWC/ASWC.

[40]  William W. Cohen,et al.  Iterative Set Expansion of Named Entities Using the Web , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[41]  Gianluca Demartini,et al.  Overview of the INEX 2009 Entity Ranking Track , 2009, INEX.

[42]  Phil Blunsom,et al.  Neural Variational Inference for Text Processing , 2015, ICML.

[43]  Max Welling,et al.  Auto-Encoding Variational Bayes , 2013, ICLR.

[44]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[45]  Paul Thomas,et al.  Overview of the TREC 2009 Entity Track , 2009, TREC.

[46]  Jing Jiang,et al.  Linking Entities to a Knowledge Base with Query Expansion , 2011, EMNLP.

[47]  Benjamin Van Durme,et al.  Weakly-Supervised Acquisition of Open-Domain Classes and Class Attributes from Web Documents and Query Logs , 2008, ACL.

[48]  Jiawei Han,et al.  SetExpan: Corpus-Based Set Expansion via Context Feature Selection and Rank Ensemble , 2017, ECML/PKDD.

[49]  Daan Wierstra,et al.  Stochastic Backpropagation and Approximate Inference in Deep Generative Models , 2014, ICML.