Entity Synonym Discovery via Multiple Attentions

Entity synonym discovery is an important task, and it can benefit many downstream applications, such as web search, question answering and knowledge graph construction. Two types of approaches are widely exploited to discover synonyms from a raw text corpus, including the distributional based approaches and pattern based approaches. However, they suffered from either low precision or low recall. In this paper, we propose a novel framework SynMine to extract synonyms from massive raw text corpora. The framework can integrate corpus-level statistics and local contexts in a unified way via a multi-attention mechanism. Extensive experiments on a real-world dataset show the effectiveness of our approach.

[1]  Fuchun Peng,et al.  Context sensitive synonym discovery for web search queries , 2009, CIKM.

[2]  Eric Crestan,et al.  Web-Scale Distributional Similarity and Entity Set Expansion , 2009, EMNLP.

[3]  Fang Liu,et al.  Improving Question Retrieval in Community Question Answering Using World Knowledge , 2013, IJCAI.

[4]  William Yang Wang,et al.  Robust Distant Supervision Relation Extraction via Deep Reinforcement Learning , 2018, ACL.

[5]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[6]  Zhiyuan Liu,et al.  Neural Relation Extraction with Selective Attention over Instances , 2016, ACL.

[7]  Catherine Havasi,et al.  ConceptNet 5.5: An Open Multilingual Graph of General Knowledge , 2016, AAAI.

[8]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[9]  Gemma Boleda,et al.  Inclusive yet Selective: Supervised Distributional Hypernymy Detection , 2014, COLING.

[10]  Carlo Zaniolo,et al.  An Efficient Sliding Window Approach for Approximate Entity Extraction with Synonyms , 2019, EDBT.

[11]  Surajit Chaudhuri,et al.  Exploiting web search to generate synonyms for entities , 2009, WWW '09.

[12]  Jiawei Han,et al.  Automatic Synonym Discovery with Knowledge Bases , 2017, KDD.

[13]  Xiang Ren,et al.  Synonym Discovery for Structured Entities on Heterogeneous Graphs , 2015, WWW.

[14]  Ming Zhou,et al.  Identifying Synonyms among Distributionally Similar Words , 2003, IJCAI.

[15]  Marcel J. T. Reinders,et al.  Detecting synonyms in social tagging systems to improve content retrieval , 2008, SIGIR '08.

[16]  Yu Zhang,et al.  Weakly-supervised Relation Extraction by Pattern-enhanced Embedding Learning , 2017, WWW.

[17]  Jiawei Han,et al.  SetExpan: Corpus-Based Set Expansion via Context Feature Selection and Rank Ensemble , 2017, ECML/PKDD.

[18]  Surajit Chaudhuri,et al.  A framework for robust discovery of entity synonyms , 2012, KDD.

[19]  Jun Zhao,et al.  Distant Supervision for Relation Extraction via Piecewise Convolutional Neural Networks , 2015, EMNLP.

[20]  Shay Artzi,et al.  Synonym Expansion for Large Shopping Taxonomies , 2019, AKBC.

[21]  Zhiguo Wang,et al.  Bilateral Multi-Perspective Matching for Natural Language Sentences , 2017, IJCAI.

[22]  Jun Zhao,et al.  Distant Supervision for Relation Extraction with Sentence-Level Attention and Entity Descriptions , 2017, AAAI.

[23]  Daniel Jurafsky,et al.  Distant supervision for relation extraction without labeled data , 2009, ACL.

[24]  Nitish Srivastava,et al.  Improving neural networks by preventing co-adaptation of feature detectors , 2012, ArXiv.

[25]  Yeye He,et al.  Automatic Discovery of Attribute Synonyms Using Query Logs and Table Corpora , 2016, WWW.

[26]  Tao Cheng,et al.  Entity Synonyms for Structured Web Search , 2012, IEEE Transactions on Knowledge and Data Engineering.

[27]  Jens Lehmann,et al.  DBpedia - A large-scale, multilingual knowledge base extracted from Wikipedia , 2015, Semantic Web.

[28]  Jiawei Han,et al.  TruePIE: Discovering Reliable Patterns in Pattern-Based Information Extraction , 2018, KDD.

[29]  Andrew McCallum,et al.  Modeling Relations and Their Mentions without Labeled Text , 2010, ECML/PKDD.

[30]  Daniel Jurafsky,et al.  Learning Syntactic Patterns for Automatic Hypernym Discovery , 2004, NIPS.

[31]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[32]  Daisy Zhe Wang,et al.  WebTables: exploring the power of tables on the web , 2008, Proc. VLDB Endow..

[33]  Fei Xia,et al.  Leveraging Paraphrase Labels to Extract Synonyms from Twitter , 2015, FLAIRS Conference.

[34]  Ngoc Thang Vu,et al.  Distinguishing Antonyms and Synonyms in a Pattern-based Neural Network , 2017, EACL.

[35]  Jason Weston,et al.  Translating Embeddings for Modeling Multi-relational Data , 2013, NIPS.