Edinburgh Research Explorer Sparse matrix computations for dynamic network centrality

Time sliced networks describing human-human digital interactions are typically large and sparse. This is the case, for example, with pairwise connectivity describing social media, voice call or physical proximity, when measured over seconds, minutes or hours. However, if we wish to quantify and compare the overall time-dependent centrality of the network nodes, then we should account for the global flow of information through time. Because the time-dependent edge structure typically allows information to diffuse widely around the network, a natural summary of sparse but dynamic pairwise interactions will generally take the form of a large dense matrix. For this reason, computing nodal centralities for a time-dependent network can be extremely expensive in terms of both computation and storage; much more so than for a single, static network. In this work, we focus on the case of dynamic communicability, which leads to broadcast and receive centrality measures. We derive a new algorithm for computing time-dependent centrality that works with a sparsified version of the dynamic communicability matrix. In this way, the computation and storage requirements are reduced to those of a sparse, static network at each time point. The new algorithm is justified from first principles and then tested on a large scale data set. We find that even with very stringent sparsity requirements (retaining no more than ten times the number of nonzeros in the individual time slices), the algorithm accurately reproduces the list of highly central nodes given by the underlying full system. This allows us to capture centrality over time with a minimal level of storage and with a cost that scales only linearly with the number of time points. We also describe and test three variants of the proposed algorithm that require fewer parameters and achieve a further reduction in the computational cost.

[1]  Michele Benzi,et al.  Dynamic communicability and epidemic spread: a case study on an empirical dynamic contact network , 2016, J. Complex Networks.

[2]  Mason A. Porter,et al.  Eigenvector-Based Centrality Measures for Temporal Networks , 2015, Multiscale Model. Simul..

[3]  Desmond J. Higham,et al.  Preserving Sparsity in Dynamic Network Computations , 2016, COMPLEX NETWORKS.

[4]  Alexander V. Mantzaris,et al.  Asymmetry through time dependency , 2016 .

[5]  Jari Saramäki,et al.  Temporal Networks , 2011, Encyclopedia of Social Network Analysis and Mining.

[6]  Edo Liberty,et al.  Near-Optimal Entrywise Sampling for Data Matrices , 2013, NIPS.

[7]  Igor M. Sokolov,et al.  Unfolding accessibility provides a macroscopic approach to temporal networks , 2012, Physical review letters.

[8]  Alexander V. Mantzaris,et al.  Dynamic Communicability Predicts Infectiousness , 2013 .

[9]  Mark C. Parsons,et al.  Communicability across evolving networks. , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[10]  Cecilia Mascolo,et al.  Characterising temporal distance and reachability in mobile and online social networks , 2010, CCRV.

[11]  V Latora,et al.  Small-world behavior in time-varying graphs. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[12]  Tamara G. Kolda,et al.  Link Prediction on Evolving Data Using Matrix and Tensor Factorizations , 2009, 2009 IEEE International Conference on Data Mining Workshops.

[13]  Cecilia Mascolo,et al.  Temporal distance metrics for social network analysis , 2009, WOSN '09.

[14]  Tore Opsahl,et al.  Clustering in weighted networks , 2009, Soc. Networks.

[15]  Sanjeev Arora,et al.  A Fast Random Sampling Algorithm for Sparsifying Matrices , 2006, APPROX-RANDOM.

[16]  Alex Pentland,et al.  Reality mining: sensing complex social systems , 2006, Personal and Ubiquitous Computing.

[17]  Gábor Szabó,et al.  Structure of complex networks , 2005 .

[18]  Ronald Fagin,et al.  Comparing top k lists , 2003, SODA '03.

[19]  Leo Katz,et al.  A new status index derived from sociometric analysis , 1953 .

[20]  P. Jaccard,et al.  Etude comparative de la distribution florale dans une portion des Alpes et des Jura , 1901 .