Analyzing Email Patterns with Timelines on Researcher Data

This paper proposes a procedure that easily extracts a feature that helps differentiate between similar researcher names in articles. We examined email patterns and their timelines to identify researchers. Our statistical analysis results show multiple email address usage patterns are found in the case of approximately 43% researchers, and 5% of the patterns are overlapped. Base on the statistics, we conclude that the identification of researchers is still required to enhance performance of the researcher-centric analytics systems and applications.

[1]  Qinghua Zheng,et al.  Combining machine learning and human judgment in author disambiguation , 2011, CIKM '11.

[2]  E. Vardell,et al.  SciVal Experts: A Collaborative Tool , 2011, Medical reference services quarterly.

[3]  Michael Ley,et al.  The DBLP Computer Science Bibliography: Evolution, Research Issues, Perspectives , 2002, SPIRE.

[4]  Divesh Srivastava,et al.  Linking temporal records , 2011, Frontiers of Computer Science.

[5]  William W. Cohen,et al.  Contextual search and name disambiguation in email using graphs , 2006, SIGIR.

[6]  Madian Khabsa,et al.  Digital commons , 2020, Internet Policy Rev..

[7]  Philip S. Yu,et al.  ADANA: Active Name Disambiguation , 2011, 2011 IEEE 11th International Conference on Data Mining.

[8]  Wojciech Rytter,et al.  Extracting Powers and Periods in a String from Its Runs Structure , 2010, SPIRE.

[9]  Won-Kyung Sung,et al.  Prescriptive Analytics System for Improving Research Power , 2013, 2013 IEEE 16th International Conference on Computational Science and Engineering.