Optimal timescale for community detection in growing networks

Time-stamped data are increasingly available for many social, economic, and information systems that can be represented as networks growing with time. The World Wide Web, social contact networks, and citation networks of scientific papers and online news articles, for example, are of this kind. Static methods can be inadequate for the analysis of growing networks as they miss essential information on the system's dynamics. At the same time, time-aware methods require the choice of an observation timescale, yet we lack principled ways to determine it. We focus on the popular community detection problem which aims to partition a network's nodes into meaningful groups. We use a multi-layer quality function to show, on both synthetic and real datasets, that the observation timescale that leads to optimal communities is tightly related to the system's intrinsic aging timescale that can be inferred from the time-stamped network data. The use of temporal information leads to drastically different conclusions on the community structure of real information networks, which challenges the current understanding of the large-scale organization of growing networks. Our findings indicate that before attempting to assess structural patterns of evolving networks, it is vital to uncover the timescales of the dynamical processes that generated them.

[1]  Mason A. Porter,et al.  Community Detection in Temporal Multilayer Networks, with an Application to Correlation Networks , 2014, Multiscale Model. Simul..

[2]  Michalis Vazirgiannis,et al.  Clustering and Community Detection in Directed Networks: A Survey , 2013, ArXiv.

[3]  Jari Saramäki,et al.  Detection of timescales in evolving complex systems , 2016, Scientific Reports.

[4]  F. Radicchi,et al.  Benchmark graphs for testing community detection algorithms. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[5]  Petter Holme,et al.  Temporal network structures controlling disease spreading. , 2016, Physical review. E.

[6]  W. Edmunds,et al.  Dynamic social networks and the implications for the spread of infectious disease , 2008, Journal of The Royal Society Interface.

[7]  Cristopher Moore,et al.  Detectability thresholds and optimal algorithms for community structure in dynamic networks , 2015, ArXiv.

[8]  Stanley Wasserman,et al.  Social Network Analysis: Methods and Applications , 1994, Structural analysis in the social sciences.

[9]  Andreas Spitz,et al.  Breaking the news: Extracting the sparse citation network backbone of online news articles , 2015, 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[10]  W. Marsden I and J , 2012 .

[11]  Kristina Lerman,et al.  Centrality metric for dynamic networks , 2010, MLG '10.

[12]  Leto Peel,et al.  The ground truth about metadata and community detection in networks , 2016, Science Advances.

[13]  Y.-Y. Liu,et al.  The fundamental advantages of temporal networks , 2016, Science.

[14]  C. Elton,et al.  The Journal of Animal Ecology. , 1936 .

[15]  Santo Fortunato,et al.  A benchmark model to assess community structure in evolving networks , 2015, Physical review. E, Statistical, nonlinear, and soft matter physics.

[16]  Satu Elisa Schaeffer,et al.  Graph Clustering , 2017, Encyclopedia of Machine Learning and Data Mining.

[17]  D J PRICE,et al.  NETWORKS OF SCIENTIFIC PAPERS. , 1965, Science.

[18]  Charu C. Aggarwal,et al.  Graph Clustering , 2010, Encyclopedia of Machine Learning and Data Mining.

[19]  Lutz Bornmann,et al.  Growth rates of modern science: A bibliometric analysis based on the number of publications and cited references , 2014, J. Assoc. Inf. Sci. Technol..

[20]  Emilio Ferrara,et al.  Bots increase exposure to negative and inflammatory content in online social systems , 2018, Proceedings of the National Academy of Sciences.

[21]  Arkadiusz Stopczynski,et al.  Fundamental structures of dynamic social networks , 2015, Proceedings of the National Academy of Sciences.

[22]  Mason A. Porter,et al.  Relating modularity maximization and stochastic block models in multilayer networks , 2018, SIAM J. Math. Data Sci..

[23]  Sidney Redner,et al.  Community structure of the physical review citation network , 2009, J. Informetrics.

[24]  E. David,et al.  Networks, Crowds, and Markets: Reasoning about a Highly Connected World , 2010 .

[25]  L. Christophorou Science , 2018, Emerging Dynamics: Science, Energy, Society and Values.

[26]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[27]  Sergio Gómez,et al.  Size reduction of complex networks preserving modularity , 2007, ArXiv.

[28]  Danielle S. Bassett,et al.  Functional Network Dynamics of the Language System , 2016, Cerebral cortex.

[29]  Martin Rosvall,et al.  Memory in network flows and its effects on spreading dynamics and community detection , 2013, Nature Communications.

[30]  Ravi Kumar,et al.  Structure and evolution of online social networks , 2006, KDD '06.

[31]  M. Mézard,et al.  Journal of Statistical Mechanics: Theory and Experiment , 2011 .

[32]  E. Todeva Networks , 2007 .

[33]  Christopher W. Lynn,et al.  The physics of brain network structure, function and control , 2018, Nature Reviews Physics.

[34]  Chris Arney,et al.  Networks, Crowds, and Markets: Reasoning about a Highly Connected World (Easley, D. and Kleinberg, J.; 2010) [Book Review] , 2013, IEEE Technology and Society Magazine.

[35]  M. Newman Community detection in networks: Modularity optimization and maximum likelihood are equivalent , 2016, Physical review. E.

[36]  Michael Golosovsky,et al.  Growing complex network of citations of scientific papers: Modeling and measurements. , 2016, Physical review. E.

[37]  J. Herskowitz,et al.  Proceedings of the National Academy of Sciences, USA , 1996, Current Biology.

[38]  Kathryn B. Laskey,et al.  Stochastic blockmodels: First steps , 1983 .

[39]  Hao Liao,et al.  Ranking in evolving complex networks , 2017, ArXiv.

[40]  Jukka-Pekka Onnela,et al.  Community Structure in Time-Dependent, Multiscale, and Multiplex Networks , 2009, Science.

[41]  Ingo Scholtes,et al.  Causality-driven slow-down and speed-up of diffusion in non-Markovian temporal networks , 2013, Nature Communications.

[42]  Nitesh V. Chawla,et al.  Representing higher-order dependencies in networks , 2015, Science Advances.

[43]  Tanya Y. Berger-Wolf,et al.  A framework for community identification in dynamic social networks , 2007, KDD '07.

[44]  Santo Fortunato,et al.  Attention Decay in Science , 2015, J. Informetrics.

[45]  Jari Saramäki,et al.  Temporal Networks , 2011, Encyclopedia of Social Network Analysis and Mining.

[46]  Santo Fortunato,et al.  Community detection in networks: A user guide , 2016, ArXiv.

[47]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[48]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[49]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[50]  Matús Medo,et al.  Statistical validation of high-dimensional models of growing networks , 2013, Physical review. E, Statistical, nonlinear, and soft matter physics.

[51]  Arif Mahmood,et al.  Using Geodesic Space Density Gradients for Network Community Detection , 2017, IEEE Transactions on Knowledge and Data Engineering.

[52]  Lada A. Adamic,et al.  Internet: Growth dynamics of the World-Wide Web , 1999, Nature.

[53]  Matúš Medo,et al.  Randomizing growing networks with a time-respecting null model. , 2017, Physical review. E.

[54]  E. A. Leicht,et al.  Large-scale structure of time evolving citation networks , 2007, 0706.0015.

[55]  Petter Holme,et al.  Modern temporal network theory: a colloquium , 2015, The European Physical Journal B.

[56]  Yi-Cheng Zhang,et al.  Solving the apparent diversity-accuracy dilemma of recommender systems , 2008, Proceedings of the National Academy of Sciences.

[57]  Giulio Cimini,et al.  Temporal effects in the growth of networks , 2011, Physical review letters.

[58]  Jari Saramäki,et al.  Effects of time window size and placement on the structure of an aggregated communication network , 2012, EPJ Data Science.

[59]  O. Bagasra,et al.  Proceedings of the National Academy of Sciences , 1914, Science.

[60]  S. N. Dorogovtsev,et al.  Evolution of networks with aging of sites , 2000, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[61]  Zhuo-Ming Ren,et al.  Nestedness in complex networks: Observation, emergence, and implications , 2019, Physics Reports.

[62]  Santo Fortunato,et al.  Consensus clustering in complex networks , 2012, Scientific Reports.

[63]  Mark Newman,et al.  Networks: An Introduction , 2010 .