Linkify: Enhancing Text Reading Experience by Detecting and Linking Helpful Entities to Users

We frequently encounter unfamiliar entity names (e.g., a person's name or a geographic location) while reading texts such as newspapers, magazines, and web pages. When this occurs, we typically perform a sequence of tedious actions: select the entity name, submit it to a search engine, and obtain detailed information from websites. In this paper, we present Linkify, a tool that enhances text reading by automatically converting entity names into links and displaying a widget that contains links to several relevant websites. We also propose a novel method for evaluating the helpfulness of entities to users using supervised machine learning with a set of carefully designed features. Experimental results show that our method significantly outperforms existing state-of-the-art methods.

[1]  Ian H. Witten,et al.  An effective, low-cost measure of semantic relatedness obtained from Wikipedia links , 2008 .

[2]  Giuseppe Ottaviano,et al.  Fast and Space-Efficient Entity Linking for Queries , 2015, WSDM.

[3]  Ian H. Witten,et al.  Learning to link with wikipedia , 2008, CIKM '08.

[4]  Dan Roth,et al.  Relational Inference for Wikification , 2013, EMNLP.

[5]  M. de Rijke,et al.  Adding semantics to microblog posts , 2012, WSDM '12.

[6]  Reiner Kraft,et al.  Leveraging context in user-centric entity detection systems , 2007, CIKM '07.

[7]  Ganesh Ramakrishnan,et al.  Collective annotation of Wikipedia entities in web text , 2009, KDD.

[8]  Fernando Pereira,et al.  Collective Entity Resolution with Multi-Focal Attention , 2016, ACL.

[9]  Hiroyuki Shindo,et al.  Joint Learning of the Embedding of Words and Entities for Named Entity Disambiguation , 2016, CoNLL.

[10]  Ming-Wei Chang,et al.  To Link or Not to Link? A Study on End-to-End Tweet Entity Linking , 2013, NAACL.

[11]  Daniel S. Weld,et al.  Design Challenges for Entity Linking , 2015, TACL.

[12]  Jianfeng Gao,et al.  Modeling Interestingness with Deep Neural Networks , 2014, EMNLP.

[13]  Doug Downey,et al.  Local and Global Algorithms for Disambiguation to Wikipedia , 2011, ACL.

[14]  Christian Bizer,et al.  DBpedia spotlight: shedding light on the web of documents , 2011, I-Semantics '11.

[15]  Paolo Ferragina,et al.  Fast and Accurate Annotation of Short Texts with Wikipedia Pages , 2010, IEEE Software.