Text mining: An analysis of research published under the subject category ‘Information Science Library Science’ in Web of Science Database during 1999-2013

Purpose – The purpose of this paper was to analyse text mining (TM) literature indexed in the Web of Science (WoS) under the “Information Science Library Science” subcategory. More specifically, it analyses the chronological growth of TM literature, and the major countries, institutions, departments and individuals contributing to TM literature. Collaboration in TM research is also analysed. Design/methodology/approach – Bibliographic and citation data required for this research were retrieved from the WoS database. TM being a multidisciplinary field, the search was restricted to “Information Science Library Science” subcategory in the WoS. A comprehensive query statement covering all synonyms of “text mining” was prepared using the Boolean operator “OR”. Microsoft Excel and HistCite software were used for data analysis. Pajek and VoSviewer were used for data visualization. Findings – It was found that USA is the major producer of TM research literature, and the highest number of papers were published in the Journal of The American Medical Informatics. Columbia University ranked first both in number of articles and citations received in the top ten institutes publishing TM literature. It was also observed that six of the top ten subdivisions of institutions are either from medicine or medical informatics or biomedical information. H.C. Chen and C. Friedman were seen to be the most prolific authors. Research limitations/implications – The paper analyses articles on TM published during 1999-2013 in WoS under the subcategory Information Science Library Science’. Originality/value – The paper is based on empirical data exclusively gathered for this research.

[1]  Aditya K. Gupta,et al.  Psychological well-being and burden in caregivers of patients with schizophrenia - , 2015 .

[2]  Can Huang,et al.  Nanoscience and technology publications and patents: a review of social science studies and search strategies , 2011 .

[3]  Sheikh Mohammed Shahabuddin Mapping neuroscience research in India - a bibliometric approach , 2013 .

[4]  Yuntao Pan,et al.  Scientific progress regarding neural regeneration in the Web of Science: A 10-year bibliometric analysis , 2013, Neural regeneration research.

[5]  T. Massoud,et al.  Trends in performance indicators of neuroimaging anatomy research publications: A bibliometric study of major neuroradiology journal output over four decades based on web of science database , 2015, Clinical anatomy.

[6]  Christopher W. Belter,et al.  Measuring the Value of Research Data: A Citation Analysis of Oceanographic Data Sets , 2014, PloS one.

[7]  Yuh-Shan Ho,et al.  Top-cited articles in environmental sciences: merits and demerits of citation analysis. , 2012, The Science of the total environment.

[8]  Thangavel Rajagopal,et al.  Research output in pheromone biology: a case study of India , 2012, Scientometrics.

[9]  S. Carroll,et al.  Microsurgery: The Top 50 Classic Papers in Plastic Surgery: A Citation Analysis , 2014, Archives of plastic surgery.

[10]  Taemin Kim Park,et al.  Asian and Pacific Region Authorship Characteristics in Leading Library and Information Science Journals , 2008 .

[11]  Karin M. Verspoor,et al.  Biomedical Text Mining: State-of-the-Art, Open Problems and Future Challenges , 2014, Interactive Knowledge Discovery and Data Mining in Biomedical Informatics.

[12]  Weimin Ni,et al.  The top cited articles on glioma stem cells in Web of Science , 2013, Neural regeneration research.

[13]  Fernando R. Mazarrón,et al.  Bibliometric analysis of research activity in the “Agronomy” category from the Web of Science, 1997–2011 , 2013 .

[14]  Nathan Efron,et al.  Citation Analysis of the Contact Lens Field , 2012, Optometry and vision science : official publication of the American Academy of Optometry.

[15]  William R. Hersh,et al.  A Survey of Current Work in Biomedical Text Mining , 2005 .

[16]  Dursun Delen,et al.  Seeding the survey and analysis of research literature with text mining , 2008, Expert Syst. Appl..

[17]  Sada Bihari Sahu,et al.  Impact and Influence of Two Premier Physics Journals: A Comparative Bibliometric Study , 2014 .

[18]  Subbiah Arunachalam,et al.  Mapping of cholera research in India using HistCite , 2010 .

[19]  Padmini Srinivasan,et al.  Text mining: Generating hypotheses from MEDLINE , 2004, J. Assoc. Inf. Sci. Technol..

[20]  Maria Cláudia Cabrini Grácio,et al.  Studies of Author Cocitation Analysis: A Bibliometric Approach for Domain Analysis , 2014 .

[21]  Ulrich Schmoch,et al.  Impact of bibliometric studies on the publication behaviour of authors , 2013, Scientometrics.

[22]  T. Park AUTHORSHIP FROM THE ASIA AND PACIFIC REGION IN TOP LIBRARY AND INFORMATION SCIENCE JOURNALS , 2006 .

[23]  Chandrakanta Swain,et al.  Bibliometric analysis of Library Review from 2007 to 2011 , 2013 .

[24]  Loet Leydesdorff,et al.  An evaluation of impacts in “Nanoscience & nanotechnology”: steps towards standards for citation analysis , 2011, Scientometrics.

[25]  Martin Rajman,et al.  From Text to Knowledge: Document Processing and Visualization: a Text Mining Approach , 2004 .