Automatic Method for Author Name Disambiguation Using Social Networks

A name is a key feature for distinguishing people, but we often fail to distinguish people because an author may have multiple names or multiple authors may share the same name. Such name ambiguity problems affect the performance of the document retrieval, web search and database integration. Especially, in bibliographic information, a number of errors may be included since there are different authors with the same name or an author name may be misspelled or represented with an abbreviation. For solving these problems, it is necessary to disambiguate the names inputted into the database. In this paper, we propose a method to solve the name ambiguity by using social networks constructed based on the relations among authors. We evaluated the effectiveness of the proposed system based on the DBLP data that offer computer science bibliographic information.

[1]  Ying Chen,et al.  Towards Robust Unsupervised Personal Name Disambiguation , 2007, EMNLP-CoNLL.

[2]  Ondrej Lhoták,et al.  Estimating precision by random sampling (poster abstract) , 1999, SIGIR '99.

[3]  Vandana Sundaram,et al.  The American Journal of Public Health. , 1945, American journal of public health and the nation's health.

[4]  Witold Pedrycz,et al.  Data Mining: A Knowledge Discovery Approach , 2007 .

[5]  Henry A. Kautz,et al.  Hardening soft information sources , 2000, KDD '00.

[6]  David J. DeWitt,et al.  Duplicate record elimination in large data files , 1983, TODS.

[7]  Jaeyoung Yang,et al.  Detecting Collaborative Fields Using Social Networks , 2008, 2008 Fourth International Conference on Networked Computing and Advanced Information Management.

[8]  Enrico Motta,et al.  Solving Semantic Ambiguity to Improve Semantic Web based Ontology Matching , 2007, OM.

[9]  Donald B. Johnson,et al.  Finding All the Elementary Circuits of a Directed Graph , 1975, SIAM J. Comput..

[10]  Stasha Ann Bown Larsen,et al.  Record Linkage , 2018, Encyclopedia of Database Systems.

[11]  Randy Goebel,et al.  DBconnect: mining research community on DBLP data , 2007, WebKDD/SNA-KDD '07.

[12]  Karl Branting A comparative evaluation of name-matching algorithms , 2003, ICAIL.

[13]  Salvatore J. Stolfo,et al.  The merge/purge problem for large databases , 1995, SIGMOD '95.

[14]  Cheng Li,et al.  Two supervised learning approaches for name disambiguation in author citations , 2004, Proceedings of the 2004 Joint ACM/IEEE Conference on Digital Libraries, 2004..

[15]  Alan W. Biermann,et al.  Coreference, cross-document coreference, and information extraction methodologies , 1998 .