Potential index: Revealing the future impact of research topics based on current knowledge networks

Abstract As the volume of scientific publications has been growing at an increasingly rapid speed, it is important to identify prominent research trends for scientists and institutions. While a considerable number of researchers have attempted to map the current state of scientific research, more efforts should be made to reveal potentially influential research topics. In this study, we investigate the relationship between the scientific impact of a research topic and the structure of its knowledge network. A novel indicator, potential index, is proposed to model topic impact based on the structural information. It is an immediate indicator with two components: knowledge novelty and diversity, which are operationalized using the concepts of betweenness centrality and network entropy. The empirical results show that potential index serves as a good predictor of future topic impact, with a high R2 and positive correlation. Its superiority sustains when used as the input feature of regression models. Moreover, the proposed index achieves better results, and the differences between it and other features become more prominent as the model complexity increases. Quantitative and qualitative analysis on the topic evolution process is also conducted to explain the change in the proposed indicator. This study contributes to the research of scientific impact modeling by establishing an explicit relationship between the impact of topics and the knowledge structure, and is thus helpful in predicting the potential impact of research topics.

[1]  Deepak Gupta,et al.  A study of big data evolution and research challenges , 2018, J. Inf. Sci..

[2]  Peter V. Marsden,et al.  Egocentric and sociocentric measures of network centrality , 2002, Soc. Networks.

[3]  Qi Zhang,et al.  Social media rumor refutation effectiveness: Evaluation, modelling and enhancement , 2021, Inf. Process. Manag..

[4]  Yue Chen,et al.  Towards an explanatory and computational theory of scientific discovery , 2009, J. Informetrics.

[5]  Jin Mao,et al.  Quantifying cross-disciplinary knowledge flow from the perspective of content: Introducing an approach based on knowledge memes , 2020, J. Informetrics.

[6]  Xiaomei Bai,et al.  Predicting the citations of scholarly paper , 2019, J. Informetrics.

[7]  Sam Wilson,et al.  What makes an article influential? Predicting impact in social and personality psychology , 2008, Scientometrics.

[8]  Kaveh Kavousi,et al.  Predicting scientific research trends based on link prediction in keyword networks , 2020, J. Informetrics.

[9]  Daniel Jurafsky,et al.  Predicting the Rise and Fall of Scientific Topics from Trends in their Rhetorical Framing , 2016, ACL.

[10]  Carl T. Bergstrom,et al.  The Science of Science , 2018, Science.

[11]  Jorge E. Hirsch,et al.  An index to quantify an individual’s scientific research output that takes into account the effect of multiple coauthorship , 2009, Scientometrics.

[12]  Luis Gravano,et al.  Predicting the impact of scientific concepts using full‐text features , 2016, J. Assoc. Inf. Sci. Technol..

[13]  Leonard M. Freeman,et al.  A set of measures of centrality based upon betweenness , 1977 .

[14]  Peter Haddawy,et al.  Small-world phenomenon of keywords network based on complex network , 2013, Scientometrics.

[15]  Katy Börner,et al.  Mixed-indicators model for identifying emerging research areas , 2011, Scientometrics.

[16]  Paul Groth,et al.  The Altmetrics Collection , 2012, PloS one.

[17]  Wolfgang Glänzel,et al.  A bibliometric study on ageing and reception processes of scientific literature , 1995, J. Inf. Sci..

[18]  Vincent Larivière,et al.  Team size matters: Collaboration and scientific impact since 1900 , 2014, J. Assoc. Inf. Sci. Technol..

[19]  Albert-László Barabási,et al.  Quantifying Long-Term Scientific Impact , 2013, Science.

[20]  Ali Gazni,et al.  Investigating different types of research collaboration and citation impact: a case study of Harvard University’s publications , 2011, Scientometrics.

[21]  Yue Wang,et al.  Evolutionary features of academic articles co-keyword network and keywords co-occurrence network: Based on two-mode affiliation network , 2016 .

[22]  Xiaoyu Yu,et al.  The differential effects of trusting beliefs on social media users' willingness to adopt and share health knowledge , 2021, Inf. Process. Manag..

[23]  Naoki Shibata,et al.  Topological analysis of citation networks to discover the future core articles , 2007, J. Assoc. Inf. Sci. Technol..

[24]  Yan Yan,et al.  The impact of collaboration and knowledge networks on citations , 2017, J. Informetrics.

[25]  S. Schuster,et al.  Metabolic network structure determines key aspects of functionality and regulation , 2002, Nature.

[26]  Félix de Moya Anegón,et al.  Detecting, identifying and visualizing research groups in co-authorship networks , 2010, Scientometrics.

[27]  Daniele Rotolo,et al.  Emerging Technology , 2001 .

[28]  Jianhua Hou,et al.  Emerging trends and new developments in information science: a document co-citation analysis (2009–2016) , 2018, Scientometrics.

[29]  E. Garfield The history and meaning of the journal impact factor. , 2006, JAMA.

[30]  Loet Leydesdorff,et al.  Betweenness centrality as an indicator of the interdisciplinarity of scientific journals , 2007, J. Assoc. Inf. Sci. Technol..

[31]  Ali Gazni,et al.  Are the abstracts of high impact articles more readable? Investigating the evidence from top research institutions in the world , 2011, J. Inf. Sci..

[32]  Jian Wang,et al.  Interdisciplinarity and Impact: Distinct Effects of Variety, Balance, and Disparity , 2014, ISSI.

[33]  Ismael Rafols,et al.  A global map of science based on the ISI subject categories , 2009 .

[34]  E. Yan,et al.  The relationship between journal citation impact and citation sentiment: A study of 32 million citances in PubMed Central , 2020, Quantitative Science Studies.

[35]  Ludo Waltman,et al.  CitNetExplorer: A new software tool for analyzing and visualizing citation networks , 2014, J. Informetrics.

[36]  Lutz Bornmann,et al.  What factors determine citation counts of publications in chemistry besides their quality? , 2012, J. Informetrics.

[37]  Shuo Xu,et al.  Emerging research topics detection with multiple machine learning models , 2019, J. Informetrics.

[38]  Arif Khan,et al.  The impact of author-selected keywords on citation counts , 2016, J. Informetrics.

[39]  Mario Krenn,et al.  Predicting research trends with semantic and neural networks with an application in quantum physics , 2019, Proceedings of the National Academy of Sciences.

[40]  R. Merton Priorities in scientific discovery: A chapter in the sociology of science. , 1957 .

[41]  An Zeng,et al.  Discoverers in scientific citation data , 2019, J. Informetrics.

[42]  H. Small,et al.  Identifying emerging topics in science and technology , 2014 .

[43]  Kevin W. Boyack,et al.  Mapping the structure and evolution of chemistry research , 2009, Scientometrics.

[44]  Adrian Letchford,et al.  The advantage of simple paper abstracts , 2016, J. Informetrics.

[45]  Loet Leydesdorff,et al.  Betweenness and diversity in journal citation networks as measures of interdisciplinarity—A tribute to Eugene Garfield , 2017, Scientometrics.

[46]  Qi Wang,et al.  A bibliometric model for identifying emerging research topics , 2017, J. Assoc. Inf. Sci. Technol..

[47]  Ludo Waltman,et al.  Field-Normalized Citation Impact Indicators and the Choice of an Appropriate Counting Method , 2015, ISSI.

[48]  Henry G. Small,et al.  Clustering the science citation index using co-citations. II. Mapping science , 1985, Scientometrics.

[49]  Xiaoyao Han,et al.  Evolution of research topics in LIS between 1996 and 2019: an analysis based on latent Dirichlet allocation topic model , 2020, Scientometrics.

[50]  Chaomei Chen,et al.  Web site design with the patron in mind: A step-by-step guide for libraries , 2006 .

[51]  Benjamin F. Jones,et al.  Atypical Combinations and Scientific Impact , 2013, Science.

[52]  Eugene Garfield,et al.  THE USE OF CITATION DATA IN WRITING THE HISTORY OF SCIENCE , 1964 .

[53]  Francisco Herrera,et al.  Journal of Informetrics , 2022 .

[54]  James A. Evans,et al.  Large teams develop and small teams disrupt science and technology , 2019, Nature.

[55]  Eugene Garfield,et al.  From the science of science to Scientometrics visualizing the history of science with HistCite software , 2009, J. Informetrics.