A topic modeling based bibliometric exploration of hydropower research

Scientific research articles can provide rich insights into practitioners׳ viewpoints around contentious policy making. Although much attention has been paid to hydropower development in the literature, few of them gathered systematic data and performed a large-scale review of scientific articles. In this study, we employed a topic modeling based bibliometric analysis to quantitatively evaluate global scientific literature of hydropower, with a time frame from 1994 to 2013. We analyzed 1726 scholarly articles highly related to hydropower, to discover the research development, current trends and intellectual structure of hydropower literature. Common bibliometric indicators show that hydropower research publications sustain a rapid growth rate, English is the dominant language, and the hotspots of hydropower research can be concluded as “fish”, “species”, “climate”, “emission”, “lake”, “sediment”, “Turkey”, etc. We established a 29-topic model to describe the intellectual structure of the 1726 articles, and employed cluster analysis and trend analysis to process the derived topics. We find that post construction issues of hydropower are more attractive for scholars than construction technology itself, and an interdisciplinary trend of hydropower research is emerging. The methodology reported in this study is expected to gain traction as a methodological strategy for energy research reviews and subsequently to promote energy policy making.

[1]  Kevin W. Boyack,et al.  Mapping the backbone of science , 2004, Scientometrics.

[2]  Yuh-Shan Ho,et al.  Bibliometric analysis of Patent Ductus Arteriosus treatments , 2004, Scientometrics.

[3]  Aie World Energy Outlook 2011 , 2001 .

[4]  J. H. Ward Hierarchical Grouping to Optimize an Objective Function , 1963 .

[5]  Jian Zuo,et al.  Sustainability in hydropower development—A case study , 2013 .

[6]  Zheng Yan,et al.  Present situation and future prospect of hydropower in China , 2009 .

[7]  Fabio Franch (Wisdom of the Crowds)2: 2010 UK Election Prediction with Social Media , 2013 .

[8]  J. K. Kaldellis,et al.  Critical evaluation of the hydropower applications in Greece , 2008 .

[9]  Desiree Tullos,et al.  Assessing the influence of Environmental Impact Assessments on science and policy: an analysis of the Three Gorges Project. , 2009, Journal of environmental management.

[10]  Y. Ho Bibliometric analysis of biosorption technology in water treatment research from 1991 to 2004 , 2008 .

[11]  J. Milliman,et al.  50,000 dams later: Erosion of the Yangtze River and its delta , 2011 .

[12]  Sai Liang,et al.  An improved input–output model for energy analysis: A case study of Suzhou , 2010 .

[13]  M. F. Porter,et al.  An algorithm for suffix stripping , 1997 .

[14]  M. Thring World Energy Outlook , 1977 .

[15]  Yoshiyuki Takeda,et al.  Nanobiotechnology as an emerging research domain from nanotechnology: A bibliometric approach , 2009, Scientometrics.

[16]  A. Pritchard,et al.  Statistical bibliography or bibliometrics , 1969 .

[17]  Francisco G. Montoya,et al.  The research on energy in spain: A scientometric approach , 2014 .

[18]  Haijun Wang,et al.  A historical review and bibliometric analysis of GPS research from 1991–2010 , 2012, Scientometrics.

[19]  William R. Lowry,et al.  Potential Focusing Projects and Policy Change , 2006 .

[20]  C. Bail The cultural environment: measuring culture with big data , 2014, Theory and Society.

[21]  C. Coglianese E-Rulemaking: Information Technology and the Regulatory Process , 2004 .

[22]  Kurt Hornik,et al.  topicmodels : An R Package for Fitting Topic Models , 2016 .

[23]  Yuh-Shan Ho,et al.  A bibliometric analysis of world volatile organic compounds research trends , 2010, Scientometrics.

[24]  Xu Yaoyang,et al.  Mapping biofuel field: A bibliometric evaluation of research output , 2013 .

[25]  Alan L. Porter,et al.  Clustering scientific documents with topic modeling , 2014, Scientometrics.

[26]  Kurt Hornik,et al.  Text Mining Infrastructure in R , 2008 .

[27]  Ibrahim Yuksel,et al.  Hydropower for sustainable water and energy development , 2010 .

[28]  Mark Steyvers,et al.  Finding scientific topics , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[29]  Yuh-Shan Ho,et al.  Bibliometric analysis on global Parkinson's disease research trends during 1991–2006 , 2008, Neuroscience Letters.

[30]  Stuart W. Shulman,et al.  Electronic rulemaking: a public participation research agenda for the social sciences , 2003 .

[31]  Stephen E. Robertson,et al.  Understanding inverse document frequency: on theoretical arguments for IDF , 2004, J. Documentation.

[32]  Yuh-Shan Ho,et al.  Bibliometric analysis of tsunami research , 2007, Scientometrics.

[33]  M. Ozturk,et al.  Hydropower–water and renewable energy in Turkey: Sources and policy , 2009 .

[34]  Jianhui Huang,et al.  Three-Gorges Dam--Experiment in Habitat Fragmentation? , 2003, Science.

[35]  Y. Lü,et al.  Three Gorges Project: Efforts and challenges for the environment , 2010 .

[36]  K. B. Johnson,et al.  Quantifying the Literature of Computer‐aided Instruction in Medical Education , 2000, Academic medicine : journal of the Association of American Medical Colleges.

[37]  John D. Lafferty,et al.  A correlated topic model of Science , 2007, 0708.3601.

[38]  Xixi Lu,et al.  Ten years of the Three Gorges Dam: a call for policy overhaul , 2013 .

[39]  Mukrimin Sevket Guney,et al.  Evaluation and measures to increase performance coefficient of hydrokinetic turbines , 2011 .

[40]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[41]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[42]  D. Popp,et al.  Renewable Energy Policies and Technological Innovation: Evidence Based on Patent Counts , 2008 .

[43]  Ibrahim Yuksel,et al.  Development of Hydropower: A Case Study in Developing Countries , 2007 .

[44]  Peng-hui Lyu,et al.  Scientometric trends and knowledge maps of global health systems research , 2014, Health Research Policy and Systems.

[45]  Tasawar Hayat,et al.  Bibliometric indicators for sustainable hydropower development , 2014 .

[46]  Kun Lu,et al.  Measuring author research relatedness: A comparison of word-based, topic-based, and author cocitation approaches , 2012, J. Assoc. Inf. Sci. Technol..

[47]  Michael Franklin,et al.  Driving Regulation , 2014 .

[48]  César A.C. Sequeira,et al.  Sodium borohydride as a fuel for the future , 2011 .

[49]  Leah G. Nichols A topic model approach to measuring interdisciplinarity at the National Science Foundation , 2014, Scientometrics.

[50]  Michael I. Jordan,et al.  Hierarchical Dirichlet Processes , 2006 .

[51]  A. Packer,et al.  Is there science beyond English? , 2007, EMBO reports.

[52]  Shailesh S. Kulkarni,et al.  The Use of Latent Semantic Analysis in Operations Management Research , 2014, Decis. Sci..

[53]  Y. Ho,et al.  Highly cited articles in biomass research: A bibliometric analysis , 2015 .

[54]  Aie,et al.  World Energy Outlook 2013 , 2013 .

[55]  H. B. Mann Nonparametric Tests Against Trend , 1945 .

[56]  M. Balat Hydropower Systems and Hydropower Potential in the European Union Countries , 2006 .

[57]  Elizabeth A. Corley,et al.  35 years and 160,000 articles: A bibliometric exploration of the evolution of ecology , 2009, Scientometrics.

[58]  Thomas Hofmann,et al.  Unsupervised Learning by Probabilistic Latent Semantic Analysis , 2004, Machine Learning.

[59]  Ayhan Demirbas,et al.  Global Renewable Energy Resources , 2006 .

[60]  Qian-Jin Zong,et al.  Doctoral dissertations of Library and Information Science in China: A co-word analysis , 2012, Scientometrics.