Entity Linking on Chinese Microblogs via Deep Neural Network

Entity linking is the task of mapping mentions in text to target knowledge base, which is crucial to knowledge-base-related tasks such as knowledge fusion and knowledge base construction. Although English-oriented entity linking task has undergone continuing advancement, the entity linking systems targeted at Chinese language still suffer from lagged development. State-of-the-art Chinese entity linking systems devise multiple handcrafted features to measure similarity between mention and entity, whereas fail to mine semantic relations underneath the surface forms. In this paper, we propose to take the advantage of latent text features and generate representations of mention and entity via double-attention-based long short term memory network, which are further utilized to calculate mention-entity similarity. Furthermore, joint word and entity embedding training and well-designed candidate entities generation strategies are put forward to facilitate the implementation of neural network. The experimental results validate the superiority of our method Celan. Our proposal not only offers an improved deep neural network for generating mention and entity representation, but also enhances the performance of entity linking on Chinese microblogs.

[1]  Quoc V. Le,et al.  Distributed Representations of Sentences and Documents , 2014, ICML.

[2]  Mohammad Sadoghi,et al.  Joint Learning of Local and Global Features for Entity Linking via Neural Networks , 2016, COLING.

[3]  Heng Ji,et al.  Name List Only? Target Entity Disambiguation in Short Texts , 2015, EMNLP.

[4]  Zhaochen Guo,et al.  Robust Entity Linking via Random Walks , 2014, CIKM.

[5]  Xianpei Han,et al.  Named entity disambiguation by leveraging wikipedia semantic knowledge , 2009, CIKM.

[6]  Xiaolong Wang,et al.  Modeling Mention, Context and Entity with Neural Networks for Entity Disambiguation , 2015, IJCAI.

[7]  Dan Klein,et al.  Capturing Semantic Similarity for Entity Linking with Convolutional Neural Networks , 2016, NAACL.

[8]  Gerhard Weikum,et al.  Robust Disambiguation of Named Entities in Text , 2011, EMNLP.

[9]  Michael Granitzer,et al.  Robust and Collective Entity Disambiguation through Semantic Embeddings , 2016, SIGIR.

[10]  Roi Blanco,et al.  Lightweight Multilingual Entity Extraction and Linking , 2017, WSDM.

[11]  Yifan He,et al.  Personalized Page Rank for Named Entity Disambiguation , 2015, NAACL.

[12]  Dan Roth,et al.  Entity Linking via Joint Encoding of Types, Descriptions, and Context , 2017, EMNLP.

[13]  Jiuyang Tang,et al.  Collective List-Only Entity Linking: A Graph-Based Approach , 2018, IEEE Access.

[14]  Thomas Hofmann,et al.  Probabilistic Bag-Of-Hyperlinks Model for Entity Linking , 2015, WWW.

[15]  Mark Dredze,et al.  Entity Disambiguation for Knowledge Base Population , 2010, COLING.

[16]  Ming Li,et al.  Entity Disambiguation by Knowledge and Text Jointly Embedding , 2016, CoNLL.

[17]  Houfeng Wang,et al.  Learning Entity Representation for Entity Disambiguation , 2013, ACL.

[18]  Robert J. Gaizauskas,et al.  Graph Ranking for Collective Named Entity Disambiguation , 2014, ACL.

[19]  Bin Liang,et al.  CN-DBpedia: A Never-Ending Chinese Knowledge Extraction System , 2017, IEA/AIE.

[20]  Jun Zhao,et al.  Collective entity linking in web text: a graph-based method , 2011, SIGIR.

[21]  Weihua Xu,et al.  A novel approach to information fusion in multi-source datasets: A granular computing viewpoint , 2017, Inf. Sci..

[22]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[23]  Yi Tay,et al.  NeuPL: Attention-based Semantic Matching and Pair-Linking for Entity Disambiguation , 2017, CIKM.

[24]  Xiaocheng Feng,et al.  Effective LSTMs for Target-Dependent Sentiment Classification , 2015, COLING.

[25]  Hiroyuki Shindo,et al.  Joint Learning of the Embedding of Words and Entities for Named Entity Disambiguation , 2016, CoNLL.

[26]  Thomas Hofmann,et al.  Deep Joint Entity Disambiguation with Local Neural Attention , 2017, EMNLP.

[27]  Heng Ji,et al.  List-only Entity Linking , 2017, ACL.

[28]  Surajit Chaudhuri,et al.  Targeted disambiguation of ad-hoc, homogeneous sets of named entities , 2012, WWW.

[29]  Li Zhao,et al.  Attention-based LSTM for Aspect-level Sentiment Classification , 2016, EMNLP.

[30]  Yang Li,et al.  Entity Disambiguation with Linkless Knowledge Bases , 2016, WWW.

[31]  Razvan C. Bunescu,et al.  Using Encyclopedic Knowledge for Named entity Disambiguation , 2006, EACL.

[32]  Xuanjing Huang,et al.  Adversarial Multi-Criteria Learning for Chinese Word Segmentation , 2017, ACL.

[33]  Rada Mihalcea,et al.  Wikify!: linking documents to encyclopedic knowledge , 2007, CIKM '07.