Feature-Based Researcher Identification Framework Using Timeline Data

Studies are actively ongoing for better understanding and strengthening the capabilities of researchers. To do so requires an accurate diagnosis and analysis of such researchers. Therefore, data of each researcher must be collected and be identified in a big-data environment. Consequently, researcher-name identification has emerged as an important issue. This paper proposes a framework for collecting, refining, identifying, and publicly offering researcher data. For identifying authors’ name, the proposed framework extracts timeline based patterns that make help to identify the same name authors with their representative attributes such as emails and affiliations. The results of the proposed framework based on timeline patterns, show a 69.5 % average author-identification rate given a group of otherwise unidentified authors.

[1]  Min-Hee Cho,et al.  Prescriptive Analytics System for Scholar Research Performance Enhancement , 2014, HCI.

[2]  Hanmin Jung,et al.  Linked Open Data System for Scientific Data Sets , 2014, IPaMin@KONVENS.

[3]  Christian Bizer,et al.  D2R Server - Publishing Relational Databases on the Semantic Web , 2004 .

[4]  Jianzhong Li,et al.  EIF: A Framework of Effective Entity Identification , 2010, WAIM.

[5]  K Ham,et al.  OpenRefine (version 2.5). . Free, open-source tool for cleaning and transforming data. , 2013 .

[6]  G. Geethakumari,et al.  Detecting misinformation in online social networks using cognitive psychology , 2014, Human-centric Computing and Information Sciences.

[7]  Neil R. Smalheiser,et al.  A probabilistic similarity metric for Medline records: A model for author name disambiguation , 2005, J. Assoc. Inf. Sci. Technol..

[8]  Michael Ley,et al.  DBLP - Some Lessons Learned , 2009, Proc. VLDB Endow..

[9]  William W. Cohen,et al.  Contextual search and name disambiguation in email using graphs , 2006, SIGIR.

[10]  Philip S. Yu,et al.  ADANA: Active Name Disambiguation , 2011, 2011 IEEE 11th International Conference on Data Mining.

[11]  Vassilios Peristeras,et al.  Re-using Cool URIs: Entity Reconciliation Against LOD Hubs , 2011, LDOW.

[12]  Neil R. Smalheiser,et al.  Author name disambiguation , 2009, Annu. Rev. Inf. Sci. Technol..

[13]  Won-Kyung Sung,et al.  On co-authorship for author disambiguation , 2009, Inf. Process. Manag..

[14]  Won-Kyung Sung,et al.  Prescriptive Analytics System for Improving Research Power , 2013, 2013 IEEE 16th International Conference on Computational Science and Engineering.

[15]  Juan Carlos Augusto,et al.  Flexible context aware interface for ambient assisted living , 2014, Human-centric Computing and Information Sciences.

[16]  Masashi Katsumata Task context-aware e-mail platform for collaborative tasks , 2014, Human-centric Computing and Information Sciences.

[17]  Peter Suber Open Access Overview , 2012 .

[18]  Dave E. Marcial,et al.  Developing a Web-Based Knowledge Product Outsourcing System at a University , 2013, J. Inf. Process. Syst..

[19]  Michael Ley,et al.  The DBLP Computer Science Bibliography: Evolution, Research Issues, Perspectives , 2002, SPIRE.

[20]  Hanmin Jung,et al.  Analyzing Email Patterns with Timelines on Researcher Data , 2014, JIST.

[21]  Laura Paglione,et al.  ORCID: a system to uniquely identify researchers , 2012, Learn. Publ..