Analysis of Computational Science Papers from ICCS 2001-2016 using Topic Modeling and Graph Theory

Abstract This paper presents results of topic modeling and network models of topics using the ICCS corpus, which contains domain-specific (computational science) papers over sixteen years (a total of 5695 papers). We discuss topical structures of ICCS, how these topics evolve over time in response to the topicality of various problems, technologies and methods, and how all these topics relate to one another. This analysis illustrates multidisciplinary research and collaborations among scientific communities, by constructing static and dynamic networks from the topic modeling results and the authors’ keywords. The results of this study give insights about the past and future trends of core discussion topics in computational science. We used the Non-negative Matrix Factorization(NMF) topic modeling algorithm to discover topics and labeled and grouped results hierarchically. We used Gephi to study static networks of topics, and an R library called DyA to analyze the dynamic networks of topics.

[1]  Peter M. A. Sloot,et al.  Big Data Meets Computational Science, Preface for ICCS 2014 , 2014, ICCS.

[2]  Peter M. A. Sloot,et al.  Computational Science at the Gates of Nature, Preface for ICCS 2015 , 2015, ICCS.

[3]  Klavdiya Bochenina ICCS keyword clouds , 2017 .

[4]  Dinesh Manocha,et al.  LU-GPU: Efficient Algorithms for Solving Dense Linear Systems on Graphics Hardware , 2005, ACM/IEEE SC 2005 Conference (SC'05).

[5]  Peter M. A. Sloot,et al.  Data through the Computational Lens, Preface for ICCS 2016 , 2016, ICCS.

[6]  Justin Grimmer,et al.  A Bayesian Hierarchical Topic Model for Political Texts: Measuring Expressed Agendas in Senate Press Releases , 2010, Political Analysis.

[7]  Derek Greene,et al.  How Many Topics? Stability Analysis for Topic Models , 2014, ECML/PKDD.

[8]  Chong Wang,et al.  Continuous Time Dynamic Topic Models , 2008, UAI.

[9]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[10]  Derek Greene,et al.  Exploring the Political Agenda of the European Parliament Using a Dynamic Topic Modeling Approach , 2016, Political Analysis.

[11]  Andrew McCallum,et al.  Topics over time: a non-Markov continuous-time model of topical trends , 2006, KDD '06.

[12]  Carl T. Bergstrom,et al.  Mapping Change in Large Networks , 2008, PloS one.

[13]  David J. Newman,et al.  Probabilistic topic decomposition of an eighteenth-century American newspaper , 2006, J. Assoc. Inf. Sci. Technol..

[14]  David M. Blei,et al.  Probabilistic topic models , 2012, Commun. ACM.

[15]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[16]  John D. Lafferty,et al.  Dynamic topic models , 2006, ICML.

[17]  Derek Greene,et al.  An analysis of the coherence of descriptors in topic modeling , 2015, Expert Syst. Appl..