Topic analysis of academic disciplines based on prolific and authoritative researchers

PurposeThe paper aims to explore whether topic analysis (identification of the core contents, trends and topic distribution in the target field) can be performed using a more low-cost and easily applicable method that relies on a small dataset, and how we can obtain this small dataset based on the features of the publications.Design/methodology/approachThe paper proposes a topic analysis method based on prolific and authoritative researchers (PARs). First, the authors identify PARs in a specific discipline by considering the number of publications and citations of authors. Based on the research publications of PARs (small dataset), the authors then construct a keyword co-occurrence network and perform a topic analysis. Finally, the authors compare the method with the traditional method.FindingsThe authors found that using a small dataset (only 6.47% of the complete dataset in our experiment) for topic analysis yields relatively high-quality and reliable results. The comparison analysis reveals that the proposed method is quite similar to the results of traditional large dataset analysis in terms of publication time distribution, research areas, core keywords and keyword network density.Research limitations/implicationsExpert opinions are needed in determining the parameters of PARs identification algorithm. The proposed method may neglect the publications of junior researchers and its biases should be discussed.Practical implicationsThis paper gives a practical way on how to implement disciplinary analysis based on a small dataset, and how to identify this dataset by proposing a PARs-based topic analysis method. The proposed method presents a useful view of the data based on PARs that can produce results comparable to traditional method, and thus will improve the effectiveness and cost of interdisciplinary topic analysis.Originality/valueThis paper proposes a PARs-based topic analysis method and verifies that topic analysis can be performed using a small dataset.

[1]  Muaz A. Niazi,et al.  Agent-based computing from multi-agent systems to agent-based models: a visual survey , 2011, Scientometrics.

[2]  Gang Li,et al.  Visual topical analysis of Chinese and American Library and Information Science research institutions , 2014, J. Informetrics.

[3]  Claire François,et al.  Identification and characterisation of technological topics in the field of Molecular Biology , 2010, Scientometrics.

[4]  Alberto Gherardini,et al.  Yesterday’s giants and invisible colleges of today. A study on the ‘knowledge transfer’ scientific domain , 2017, Scientometrics.

[5]  Jianhua Hou,et al.  Emerging trends and new developments in information science: a document co-citation analysis (2009–2016) , 2018, Scientometrics.

[6]  Chih-Fong Tsai,et al.  Popular research topics in multimedia , 2012, Scientometrics.

[7]  S. N. Singh,et al.  Mapping the intellectual structure of scientometrics: a co-word analysis of the journal Scientometrics (2005–2010) , 2014, Scientometrics.

[8]  Chao Yang,et al.  SAO Semantic Information Identification for Text Mining , 2017, Int. J. Comput. Intell. Syst..

[9]  Ludo Waltman,et al.  Software survey: VOSviewer, a computer program for bibliometric mapping , 2009, Scientometrics.

[10]  Anke Piepenbrink,et al.  Topics in the literature of transition economies and emerging markets , 2014, Scientometrics.

[11]  Peter Haddawy,et al.  Analyzing knowledge flows of scientific literature through semantic links: a case study in the field of energy , 2015, Scientometrics.

[12]  Jan L. Youtie,et al.  Tracking the emergence of synthetic biology , 2017, Scientometrics.

[13]  Yoshiyuki Takeda,et al.  Structure of research on biomass and bio-fuels: A citation-based approach , 2008 .

[14]  Fefie Dotsika,et al.  Identifying potentially disruptive trends by means of keyword network analysis , 2017 .

[15]  Ehsan Mohammadi,et al.  Knowledge mapping of the Iranian nanoscience and technology: a text mining approach , 2012, Scientometrics.

[16]  Christian Weismayer,et al.  Identifying emerging research fields: a longitudinal latent semantic keyword analysis , 2017, Scientometrics.

[17]  Na Zhu,et al.  The mutually beneficial relationship of patents and scientific literature: topic evolution in nanoscience , 2018, Scientometrics.

[18]  Baitong Chen,et al.  Understanding the topic evolution in a scientific domain: An exploratory study for the field of information retrieval , 2017, J. Informetrics.

[19]  Heesang Lee,et al.  Research profiling for ‘standardization and innovation’ , 2011, Scientometrics.

[20]  Chaomei Chen,et al.  Making sense of the evolution of a scientific domain: a visual analytic study of the Sloan Digital Sky Survey research , 2010, Scientometrics.

[21]  Carlos G. Figuerola,et al.  Mapping the evolution of library and information science (1978–2014) using topic modeling on LISA , 2017, Scientometrics.

[22]  Katy Börner,et al.  Open data and open code for big science of science studies , 2014, Scientometrics.

[23]  Michael Schreiber,et al.  Inconsistencies of recently proposed citation impact indicators and how to avoid them , 2012, J. Assoc. Inf. Sci. Technol..

[24]  Chaomei Chen,et al.  How are they different? A quantitative domain comparison of information visualization and data visualization (2000–2014) , 2016, Scientometrics.

[25]  Fiorenzo Franceschini,et al.  Influence of omitted citations on the bibliometric statistics of the major Manufacturing journals , 2015, Scientometrics.

[26]  Jin-Cheon Na,et al.  Influence Detection between Blog Posts through Blog Features, Content Analysis, and Community Identity , 2011, Online Inf. Rev..

[27]  Jane G. Payumo,et al.  A bibliometric assessment of ASEAN collaboration in plant biotechnology , 2015, Scientometrics.

[28]  Ju Wang,et al.  Visualizing the research on pervasive and ubiquitous computing , 2011, Scientometrics.

[29]  Claire François,et al.  An advanced diffusion model to identify emergent research issues: the case of optoelectronic devices , 2010, Scientometrics.

[30]  So Young Sohn,et al.  Predicting the pattern of technology convergence using big-data technology on large-scale triadic patents , 2015 .

[31]  Alan L. Porter,et al.  Topic analysis and forecasting for science, technology and innovation: Methodology with a case study focusing on big data research , 2016 .

[32]  Lin Zhu,et al.  Keywords co-occurrence mapping knowledge domain research base on the theory of Big Data in oil and gas industry , 2015, Scientometrics.

[33]  Heiner Stuckenschmidt,et al.  Multidimensional topic analysis in political texts , 2014, Data Knowl. Eng..

[34]  Tahereh Dehdarirad,et al.  Research trends in gender differences in higher education and science: a co-word analysis , 2014, Scientometrics.

[35]  Bo Yang,et al.  An exploration of link-based knowledge map in academic web space , 2012, Scientometrics.

[36]  Chaomei Chen,et al.  A scientometric review of emerging trends and new developments in recommendation systems , 2015, Scientometrics.

[37]  Kai Hu,et al.  Identifying the “Ghost City” of domain topics in a keyword semantic space combining citations , 2017, Scientometrics.

[38]  Yu-Wen Huang,et al.  Mapping knowledge structure by keyword co-occurrence and social network analysis: Evidence from Library Hi Tech between 2006 and 2017 , 2018, Libr. Hi Tech.

[39]  Hong-Gee Kim,et al.  Intellectual structure of biomedical informatics reflected in scholarly events , 2010, Scientometrics.

[40]  Min Song,et al.  Investigating the integrated landscape of the intellectual topology of bioinformatics , 2014, Scientometrics.

[41]  Nader Ale Ebrahim,et al.  Evaluating the academic trend of RFID technology based on SCI and SSCI publications from 2001 to 2014 , 2016, Scientometrics.

[42]  Cui Huang,et al.  An improved SAO network-based method for technology trend analysis: A case study of graphene , 2018, J. Informetrics.

[43]  Yan Yan,et al.  Global geographical and scientometric analysis of tourism-themed research , 2015, Scientometrics.

[44]  Min Song,et al.  Time gap analysis by the topic model-based temporal technique , 2014, J. Informetrics.

[45]  Eli M. Blatt Differentiating, describing, and visualizing scientific space: A novel approach to the analysis of published scientific abstracts , 2008, Scientometrics.

[46]  Yoshiyuki Takeda,et al.  Tracking emerging technologies in energy research : toward a roadmap for sustainable energy , 2008 .

[47]  Alberto Di Minin,et al.  Visualizing the structure and bridges of the intellectual property management and strategy literature: a document co-citation analysis , 2014, Scientometrics.

[48]  Alan L. Porter,et al.  A systematic method to create search strategies for emerging technologies based on the Web of Science: illustrated for ‘Big Data’ , 2015, Scientometrics.

[49]  Li Zhai,et al.  Evolutionary analysis of international collaboration network of Chinese scholars in management research , 2013, Scientometrics.

[50]  Qingpu Zhang,et al.  Mapping knowledge domains of Chinese digital library research output, 1994–2010 , 2011, Scientometrics.

[51]  Christopher J. Williams,et al.  Using bibliometrics to support the facilitation of cross-disciplinary communication , 2013, J. Assoc. Inf. Sci. Technol..

[52]  Chao Zhang,et al.  How to identify metaknowledge trends and features in a certain research field? Evidences from innovation and entrepreneurial ecosystem , 2017, Scientometrics.