Measuring topic network centrality for identifying technology and technological development in online communities

Abstract Online communities are a rapidly growing knowledge repository that provides scholarly research, technical discussion, and social interactivity. This abundance of online information increases the difficulty of keeping up with new developments difficult for researchers and practitioners. Thus, we introduced a novel method that analyses both knowledge and social sentiment within the online community to discover the topical coverage of emerging technology and trace technological trends. The method utilizes the Weibull distribution and Shannon entropy to measure and link social sentiment with technological topics. Based on question-and-answer and social sentiment data from Zhihu, which is an online question and answer (Q&A) community with high-profile entrepreneurs and public intellectuals, we built an undirected weighting network and measured the centrality of nodes for technology identification. An empirical study on artificial intelligence technology trends supported by expert knowledge-based evaluation and cognition provides sufficient evidence of the method's ability to identify technology. We found that the social sentiment of hot technological topics presents a long-tailed distribution statistical pattern. High similarity between the topic popularity and emerging technology development trends appears in the online community. Finally, we discuss the findings in various professional fields that are widely applied to discover and track hot technological topics.

[1]  Yongtae Park,et al.  Proactive development of emerging technology in a socially responsible manner: Data-driven problem solving process using latent semantic analysis , 2018, Journal of Engineering and Technology Management.

[2]  L. Fenton The Sum of Log-Normal Probability Distributions in Scatter Transmission Systems , 1960 .

[3]  S. N. Singh,et al.  Mapping the intellectual structure of scientometrics: a co-word analysis of the journal Scientometrics (2005–2010) , 2014, Scientometrics.

[4]  Yuan Zhou,et al.  Monitoring and forecasting the development trends of nanogenerator technology using citation analysis and text mining , 2020 .

[5]  W. Weibull A Statistical Distribution Function of Wide Applicability , 1951 .

[6]  Xin Li,et al.  Identifying and monitoring the development trends of emerging technologies using patent analysis and Twitter data mining: The case of perovskite solar cell technology , 2019, Technological Forecasting and Social Change.

[7]  Martin G. Moehrle,et al.  Technological speciation as a source for emerging technologies. Using semantic patent analysis for the case of camera technology , 2019, Technological Forecasting and Social Change.

[8]  Fefie Dotsika,et al.  Identifying potentially disruptive trends by means of keyword network analysis , 2017 .

[9]  Antonio Messeni Petruzzelli,et al.  Unveiling the technological trends of augmented reality: A patent analysis , 2020, Comput. Ind..

[10]  Yu-Wei Chang,et al.  Potential Value of Patents With Provisional Applications: An Assessment of Bibliometric Approach , 2019, IEEE Transactions on Engineering Management.

[11]  Ingoo Han,et al.  Knowledge-based data mining of news information on the Internet using cognitive maps and neural networks , 2002, Expert Syst. Appl..

[12]  Marta Ortiz-de-Urbina-Criado,et al.  Knowledge areas, themes and future research on open data: A co-word analysis , 2019, Gov. Inf. Q..

[13]  Xin Li,et al.  Identifying the Development Trends of Emerging Technologies Using Patent Analysis and Web News Data Mining: The Case of Perovskite Solar Cell Technology , 2020, IEEE Transactions on Engineering Management.

[14]  Werner Kristjanpoller,et al.  A hybrid volatility forecasting framework integrating GARCH, artificial neural network, technical analysis and principal components analysis , 2018, Expert Syst. Appl..

[15]  Stefanie Bröring,et al.  Identifying first signals of emerging dominance in a technological innovation system: A novel approach based on patents , 2019, Technological Forecasting and Social Change.

[16]  Junghye Lee,et al.  Word2vec-based latent semantic analysis (W2V-LSA) for topic modeling: A study on blockchain technology trend analysis , 2020, Expert Syst. Appl..

[17]  Haoran Xie,et al.  Detecting latent topics and trends in educational technologies over four decades using structural topic modeling: A retrospective of all volumes of Computers & Education , 2020, Comput. Educ..

[18]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[19]  Anthony Breitzman,et al.  The Emerging Clusters Model: A tool for identifying emerging technologies across multiple patent systems , 2015 .

[20]  Daniele Rotolo,et al.  Emerging Technology , 2001 .

[21]  Shuo Xu,et al.  Emerging research topics detection with multiple machine learning models , 2019, J. Informetrics.

[22]  D. Merino Assessing technological forecasts for the fiber optic communications market , 1990 .

[23]  Fang Dong,et al.  Unfolding the convergence process of scientific knowledge for the early identification of emerging technologies , 2019, Technological Forecasting and Social Change.

[24]  B. Yoon,et al.  Identifying emerging Research and Business Development (R&BD) areas based on topic modeling and visualization with intellectual property right data , 2019, Technological Forecasting and Social Change.

[25]  Alberto Moro,et al.  Emerging technologies in the renewable energy sector: A comparison of expert review with a text mining software , 2020, Futures.

[26]  Mahmood H. Shubbak Advances in solar photovoltaics: Technology review and patent trends , 2019, Renewable and Sustainable Energy Reviews.

[27]  Yuya Kajikawa,et al.  Emerging topics in energy storage based on a large-scale analysis of academic articles and patents , 2020 .

[28]  Oh-Jin Kwon,et al.  Early identification of emerging technologies: A machine learning approach using multiple patent indicators , 2018 .

[29]  Choi Jae-woo,et al.  Themes and Trends in Korean Educational Technology Research: A Social Network Analysis of Keywords★ , 2014 .

[30]  So Young Sohn,et al.  Machine-learning-based deep semantic analysis approach for forecasting new technology convergence , 2020 .

[31]  Chen-Yuan Liu,et al.  Forecasting the development of the biped robot walking technique in Japan through S-curve model analysis , 2009, Scientometrics.

[32]  Alan L. Porter,et al.  Identification of technology development trends based on subject–action–object analysis: The case of dye-sensitized solar cells , 2015 .

[33]  Zheng Wang,et al.  Technology Forecasting Based on Semantic and Citation Analysis of Patents: A Case of Robotics Domain , 2022, IEEE Transactions on Engineering Management.

[34]  Tugrul U. Daim,et al.  Technology forecasting by analogy-based on social network analysis: The case of autonomous vehicles , 2019, Technological Forecasting and Social Change.

[35]  Guangquan Zhang,et al.  Topic-based technological forecasting based on patent data: A case study of Australian patents from 2000 to 2014 , 2017 .

[36]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[37]  Ozcan Saritas,et al.  A methodology for technology trend monitoring: the case of semantic technologies , 2016, Scientometrics.

[38]  Changyong Lee,et al.  Navigating a product landscape for technology opportunity analysis: A word2vec approach using an integrated patent-product database , 2020 .

[39]  Jian Du,et al.  Patent citations to scientific papers as early signs for predicting delayed recognition of scientific discoveries: a comparative study with instant recognition , 2019, ISSI.

[40]  Yongtae Park,et al.  Development of New Technology Forecasting Algorithm: Hybrid Approach for Morphology Analysis and Conjoint Analysis of Patent Information , 2007, IEEE Transactions on Engineering Management.

[41]  Mike Thelwall,et al.  Which health and biomedical topics generate the most Facebook interest and the strongest citation relationships? , 2020, Inf. Process. Manag..

[42]  Jeremy J. Michalek,et al.  Consistency and robustness of forecasting for emerging technologies: the case of Li-ion batteries for electric vehicles , 2017 .

[43]  Saad J. Almalki,et al.  A new modified Weibull distribution , 2013, Reliab. Eng. Syst. Saf..

[44]  R.M. Rodrfguez-Dagnino Some remarks regarding asymptotic packet loss in the Pareto/M/1/K queueing system , 2005, IEEE Communications Letters.

[45]  Sungjoon Lee,et al.  Forecasting Forward Patent Citations: Comparison of Citation-Lag Distribution, Tobit Regression, and Deep Learning Approaches , 2022, IEEE Transactions on Engineering Management.

[46]  D. Pachamanova,et al.  Topic modeling and technology forecasting for assessing the commercial viability of healthcare innovations , 2020 .

[47]  P. Bonacich Power and Centrality: A Family of Measures , 1987, American Journal of Sociology.

[48]  Ge Cheng,et al.  Forecasting emerging technologies: A supervised learning approach through patent analysis , 2017 .

[49]  Matúš Medo,et al.  Early identification of important patents: Design and validation of citation network metrics , 2019 .