Evaluating the word-expert approach for Named-Entity Disambiguation

Named Entity Disambiguation (NED) is the task of linking a named-entity mention to an instance in a knowledge-base, typically Wikipedia. This task is closely related to word-sense disambiguation (WSD), where the supervised word-expert approach has prevailed. In this work we present the results of the word-expert approach to NED, where one classifier is built for each target entity mention string. The resources necessary to build the system, a dictionary and a set of training instances, have been automatically derived from Wikipedia. We provide empirical evidence of the value of this approach, as well as a study of the differences between WSD and NED, including ambiguity and synonymy statistics.

[1]  Eneko Agirre,et al.  UBC-ALM: Combining k-NN with SVD for WSD , 2007, SemEval@ACL.

[2]  Dan Klein,et al.  Optimization, Maxent Models, and Conditional Estimation without Magic , 2003, NAACL.

[3]  Valentin I. Spitkovsky,et al.  Stanford-UBC Entity Linking at TAC-KBP , 2010, TAC.

[4]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[5]  Heng Ji,et al.  Knowledge Base Population: Successful Approaches and Challenges , 2011, ACL.

[6]  Ying Shi,et al.  LCC Approaches to Knowledge Base Population at TAC 2010 , 2010, TAC.

[7]  Douglas W. Oard,et al.  Resolving Personal Names in Email Using Context Expansion , 2008, ACL.

[8]  Danuta Ploch Exploring Entity Relations for Named Entity Disambiguation , 2011, ACL.

[9]  Valentin I. Spitkovsky,et al.  Stanford-UBC Entity Linking at TAC-KBP, Again , 2011, TAC.

[10]  Heng Ji,et al.  Overview of the TAC 2010 Knowledge Base Population Track , 2010 .

[11]  Roberto Navigli,et al.  Word sense disambiguation: A survey , 2009, CSUR.

[12]  Julio Gonzalo,et al.  WePS 2 Evaluation Campaign: Overview of the Web People Search Clustering Task , 2009 .

[13]  German Rigau,et al.  Supervised Corpus-Based Methods for WSD , 2007 .

[14]  Breck Baldwin,et al.  Entity-Based Cross-Document Coreferencing Using the Vector Space Model , 1998, COLING.

[15]  Martha Palmer,et al.  SemEval-2007 Task-17: English Lexical Sample, SRL and All Words , 2007, Fourth International Workshop on Semantic Evaluations (SemEval-2007).

[16]  Joel Nothman,et al.  Evaluating Entity Linking with Wikipedia , 2013, Artif. Intell..

[17]  Gerhard Weikum,et al.  KORE: keyphrase overlap relatedness for entity disambiguation , 2012, CIKM.

[18]  Hwee Tou Ng,et al.  It Makes Sense: A Wide-Coverage Word Sense Disambiguation System for Free Text , 2010, ACL.

[19]  Paul McNamee HLTCOE Efforts in Entity Linking at TAC KBP 2010 , 2010, TAC.

[20]  S. Soderland,et al.  - based Named Entity Disambiguation to Arbitrary Web Text , 2009 .

[21]  Silviu Cucerzan,et al.  Large-Scale Named Entity Disambiguation Based on Wikipedia Data , 2007, EMNLP.

[22]  Valentin I. Spitkovsky,et al.  A Cross-Lingual Dictionary for English Wikipedia Concepts , 2012, LREC.

[23]  Jian Su,et al.  Entity Linking Leveraging Automatically Generated Annotation , 2010, COLING.

[24]  Xianpei Han,et al.  An Entity-Topic Model for Entity Linking , 2012, EMNLP.

[25]  G. Prasad LEARNING TO LINK ENTITIES WITH KNOWLEDGE BASE , 2016 .

[26]  Ian H. Witten,et al.  Learning to link with wikipedia , 2008, CIKM '08.

[27]  Razvan C. Bunescu,et al.  Using Encyclopedic Knowledge for Named entity Disambiguation , 2006, EACL.

[28]  Martha Palmer,et al.  The English all-words task , 2004, SENSEVAL@ACL.

[29]  Gerhard Weikum,et al.  Robust Disambiguation of Named Entities in Text , 2011, EMNLP.

[30]  Wanxiang Che,et al.  A Graph-based Method for Entity Linking , 2011, IJCNLP.

[31]  Erik F. Tjong Kim Sang,et al.  Introduction to the CoNLL-2003 Shared Task: Language-Independent Named Entity Recognition , 2003, CoNLL.

[32]  Ganesh Ramakrishnan,et al.  Collective annotation of Wikipedia entities in web text , 2009, KDD.

[33]  Dan Klein,et al.  A Joint Model for Entity Analysis: Coreference, Typing, and Linking , 2014, TACL.

[34]  Roberto Navigli,et al.  Entity Linking meets Word Sense Disambiguation: a Unified Approach , 2014, TACL.

[35]  Doug Downey,et al.  Local and Global Algorithms for Disambiguation to Wikipedia , 2011, ACL.

[36]  Rada Mihalcea,et al.  Linking Documents to Encyclopedic Knowledge , 2008, IEEE Intelligent Systems.

[37]  Jing Jiang,et al.  Linking Entities to a Knowledge Base with Query Expansion , 2011, EMNLP.

[38]  James Allan,et al.  Cross-Document Coreference on a Large Scale Corpus , 2004, NAACL.

[39]  Elaine Marsh,et al.  MUC-7 Evaluation of IE Technology: Overview of Results , 1998, MUC.

[40]  Eneko Agirre,et al.  Word Sense Disambiguation: Algorithms and Applications , 2007 .

[41]  Thomas Hofmann,et al.  Support vector machine learning for interdependent and structured output spaces , 2004, ICML.

[42]  Xianpei Han,et al.  A Generative Entity-Mention Model for Linking Entities with Knowledge Base , 2011, ACL.

[43]  Gerhard Weikum,et al.  From information to knowledge: harvesting entities and relationships from web sources , 2010, PODS '10.

[44]  Lise Getoor,et al.  Collective entity resolution in relational data , 2007, TKDD.

[45]  Rada Mihalcea,et al.  Wikify!: linking documents to encyclopedic knowledge , 2007, CIKM '07.

[46]  Julio Gonzalo,et al.  Web people search: results of the first evaluation and the plan for the second , 2008, WWW.

[47]  Mark Dredze,et al.  Entity Disambiguation for Knowledge Base Population , 2010, COLING.

[48]  Adam Kilgarriff,et al.  The Senseval-3 English lexical sample task , 2004, SENSEVAL@ACL.

[49]  Lan Nie,et al.  Resolving Surface Forms to Wikipedia Topics , 2010, COLING.

[50]  Heng Ji,et al.  Collaborative Ranking: A Case Study on Entity Linking , 2011, EMNLP.

[51]  David Yarowsky,et al.  Unsupervised Personal Name Disambiguation , 2003, CoNLL.

[52]  Vasudeva Varma,et al.  IIIT Hyderabad at TAC 2009 , 2008, TAC.

[53]  Laura Schweitzer,et al.  Advances In Kernel Methods Support Vector Learning , 2016 .

[54]  Thorsten Joachims,et al.  Making large scale SVM learning practical , 1998 .

[55]  Valentin I. Spitkovsky,et al.  Stanford-UBC at TAC-KBP , 2009, TAC.