Scientometrics: How to perform a Big Data Trend Analysis with ScienceMiner

This paper describes the results of the implementation of an application that was designed under the design science principles. The purpose of this application is to identify trends in science. First, the status quo of similar applications as well as the knowledge base about data mining in the field of scientometrics is analyzed. Afterwards, the implementation as well as the evaluation of our application is described. Our web-based application allows to search for contributions (literature and internet, e.g., twitter, news), executes several data mining methods and visualizes the results in seven different ways. Each visualization has some filters and further control elements. It is the first application to provide the complete process from data acquisition to data visualization in an automated way.

[1]  Gobinda G. Chowdhury,et al.  Bibliometric cartography of information retrieval research by using co-word analysis , 2001, Inf. Process. Manag..

[2]  Gregory Piatetsky-Shapiro,et al.  Advances in Knowledge Discovery and Data Mining , 2004, Lecture Notes in Computer Science.

[3]  Ang Li,et al.  Research on the semantic-based co-word analysis , 2011, Scientometrics.

[4]  A. Kinney National scientific facilities and their science impact on nonbiomedical research , 2007, Proceedings of the National Academy of Sciences.

[5]  Lilian Weng,et al.  Information diffusion on online social networks , 2014 .

[6]  Cassidy R. Sugimoto,et al.  Mapping world scientific collaboration: Authors, institutions, and countries , 2012, J. Assoc. Inf. Sci. Technol..

[7]  Lucy Amez,et al.  Citation measures at the micro level: Influence of publication age, field, and uncitedness , 2012, J. Assoc. Inf. Sci. Technol..

[8]  Ilkka Pölönen,et al.  Research literature clustering using diffusion maps , 2013, J. Informetrics.

[9]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery: An Overview , 1996, Advances in Knowledge Discovery and Data Mining.

[10]  Masaki Eto,et al.  Evaluations of context-based co-citation searching , 2012, Scientometrics.

[11]  Frans Coenen,et al.  Finding "interesting" trends in social networks using frequent pattern mining and self organizing maps , 2012, Knowl. Based Syst..

[12]  M. D. Myers,et al.  Qualitative Research in Business & Management , 2008 .

[13]  Yuen-Hsien Tseng,et al.  A comparison of methods for detecting hot topics , 2009, Scientometrics.

[14]  Gohar Feroz Khan,et al.  Network of the core: mapping and visualizing the core of scientific domains , 2011, Scientometrics.

[15]  Mike Thelwall Journal impact evaluation: a webometric perspective , 2012, Scientometrics.

[16]  J. E. Hirsch,et al.  An index to quantify an individual's scientific research output , 2005, Proc. Natl. Acad. Sci. USA.

[17]  Lei Cui,et al.  Integration of three visualization methods based on co-word analysis , 2011, Scientometrics.

[18]  Cécile Favre,et al.  Information diffusion in online social networks: a survey , 2013, SGMD.

[19]  Richard T. Watson,et al.  Analyzing the Past to Prepare for the Future: Writing a Literature Review , 2002, MIS Q..

[20]  L. Egghe,et al.  Theory and practise of the g-index , 2006, Scientometrics.

[21]  Junping Qiu,et al.  Research on the cross-citation relationship of core authors in scientometrics , 2012, Scientometrics.

[22]  Bing He,et al.  Mining enriched contextual information of scientific collaboration: A meso perspective , 2011, J. Assoc. Inf. Sci. Technol..

[23]  Woo Hyoung Lee,et al.  How to identify emerging research fields using scientometrics: An example in the field of Information Security , 2008, Scientometrics.

[24]  Massimo Franceschet,et al.  A cluster analysis of scholar and journal bibliometric indicators , 2009, J. Assoc. Inf. Sci. Technol..

[25]  Alan R. Hevner,et al.  Design Science in Information Systems Research , 2004, MIS Q..

[26]  William R. Hersh,et al.  Reducing workload in systematic review preparation using automated citation classification. , 2006, Journal of the American Medical Informatics Association : JAMIA.

[27]  Massih-Reza Amini,et al.  A co-classification approach to learning from multilingual corpora , 2010, Machine Learning.

[28]  Salvatore T. March,et al.  Design and natural science research on information technology , 1995, Decis. Support Syst..

[29]  Jerome K. Vanclay,et al.  Impact factor: outdated artefact or stepping-stone to journal certification? , 2011, Scientometrics.

[30]  Stan Matwin,et al.  A new algorithm for reducing the workload of experts in performing systematic reviews , 2010, J. Am. Medical Informatics Assoc..

[31]  Ying Ding,et al.  Topic-based PageRank on author cocitation networks , 2011, J. Assoc. Inf. Sci. Technol..

[32]  Katy Börner,et al.  Mixed-indicators model for identifying emerging research areas , 2011, Scientometrics.

[33]  Claudio Castellano,et al.  Analysis of bibliometric indicators for individual scholars in a large data set , 2013, Scientometrics.