Tensor Decompositions for Identifying Directed Graph Topologies and Tracking Dynamic Networks

Directed networks are pervasive both in nature and engineered systems, often underlying the complex behavior observed in biological systems, microblogs and social interactions over the web, as well as global financial markets. Since their structures are often unobservable, in order to facilitate network analytics, one generally resorts to approaches capitalizing on measurable nodal processes to infer the unknown topology. Structural equation models (SEMs) are capable of incorporating exogenous inputs to resolve inherent directional ambiguities. However, conventional SEMs assume full knowledge of exogenous inputs, which may not be readily available in some practical settings. This paper advocates a novel SEM-based topology inference approach that entails factorization of a three-way tensor, constructed from the observed nodal data, using the well-known parallel factor (PARAFAC) decomposition. It turns out that second-order piecewise stationary statistics of exogenous variables suffice to identify the hidden topology. Capitalizing on the uniqueness properties inherent to high-order tensor factorizations, it is shown that topology identification is possible under reasonably mild conditions. In addition, to facilitate real-time operation and inference of time-varying networks, an adaptive (PARAFAC) tensor decomposition scheme that tracks the topology-revealing tensor factors is developed. Extensive tests on simulated and real stock quote data demonstrate the merits of the novel tensor-based approach.

[1]  Hamid R. Rabiee,et al.  Inferring Dynamic Diffusion Networks in Online Media , 2016, ACM Trans. Knowl. Discov. Data.

[2]  Nikos D. Sidiropoulos,et al.  Tensor Decomposition for Signal Processing and Machine Learning , 2016, IEEE Transactions on Signal Processing.

[3]  D. Kaplan Structural Equation Modeling: Foundations and Extensions , 2000 .

[4]  Georgios B. Giannakis,et al.  Kernel-Based Structural Equation Models for Topology Identification of Directed Networks , 2016, IEEE Transactions on Signal Processing.

[5]  A. Goldberger STRUCTURAL EQUATION METHODS IN THE SOCIAL SCIENCES , 1972 .

[6]  Bernhard Schölkopf,et al.  Uncovering the Temporal Dynamics of Diffusion Networks , 2011, ICML.

[7]  Gonzalo Mateos,et al.  Proximal-Gradient Algorithms for Tracking Cascades Over Social Networks , 2014, IEEE Journal of Selected Topics in Signal Processing.

[8]  Georgios B. Giannakis,et al.  Inference of Gene Regulatory Networks with Sparse Structural Equation Models Exploiting Genetic Perturbations , 2013, PLoS Comput. Biol..

[9]  Jure Leskovec,et al.  Inferring networks of diffusion and influence , 2010, KDD.

[10]  Tamara G. Kolda,et al.  Tensor Decompositions and Applications , 2009, SIAM Rev..

[11]  Lieven De Lathauwer,et al.  New Uniqueness Conditions for the Canonical Polyadic Decomposition of Third-Order Tensors , 2015, SIAM J. Matrix Anal. Appl..

[12]  Richard A. Harshman,et al.  Foundations of the PARAFAC procedure: Models and conditions for an "explanatory" multi-model factor analysis , 1970 .

[13]  Morteza Mardani,et al.  Subspace Learning and Imputation for Streaming Big Data Matrices and Tensors , 2014, IEEE Transactions on Signal Processing.

[14]  Nikos D. Sidiropoulos,et al.  Adaptive Algorithms to Track the PARAFAC Decomposition of a Third-Order Tensor , 2009, IEEE Transactions on Signal Processing.

[15]  Anima Anandkumar,et al.  A tensor approach to learning mixed membership community models , 2013, J. Mach. Learn. Res..

[16]  Christos Faloutsos,et al.  Kronecker Graphs: An Approach to Modeling Networks , 2008, J. Mach. Learn. Res..

[17]  Nikos D. Sidiropoulos,et al.  Batch and Adaptive PARAFAC-Based Blind Separation of Convolutive Speech Mixtures , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[18]  Eric D. Kolaczyk,et al.  Statistical Analysis of Network Data: Methods and Models , 2009 .

[19]  André Lima Férrer de Almeida,et al.  Joint data and connection topology recovery in collaborative wireless sensor networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[20]  E. Todeva Networks , 2007 .

[21]  Georgios B. Giannakis,et al.  Inferring directed network topologies via tensor factorization , 2016, 2016 50th Asilomar Conference on Signals, Systems and Computers.

[22]  E. David,et al.  Networks, Crowds, and Markets: Reasoning about a Highly Connected World , 2010 .

[23]  Georgios B. Giannakis,et al.  Tracking dynamic piecewise-constant network topologies via adaptive tensor factorization , 2016, 2016 IEEE Global Conference on Signal and Information Processing (GlobalSIP).

[24]  Jure Leskovec,et al.  Tensor Spectral Clustering for Partitioning Higher-order Network Structures , 2015, SDM.

[25]  Bernhard Schölkopf,et al.  Structure and dynamics of information pathways in online media , 2012, WSDM.

[26]  Gene H. Golub,et al.  Matrix computations , 1983 .

[27]  J. Chang,et al.  Analysis of individual differences in multidimensional scaling via an n-way generalization of “Eckart-Young” decomposition , 1970 .

[28]  Olaf Sporns,et al.  Complex network measures of brain connectivity: Uses and interpretations , 2010, NeuroImage.

[29]  Dino Ienco,et al.  Do more views of a graph help? Community detection and clustering in multi-graphs , 2013, Proceedings of the 16th International Conference on Information Fusion.

[30]  Nikos D. Sidiropoulos,et al.  Egonet tensor decomposition for community identification , 2016, 2016 IEEE Global Conference on Signal and Information Processing (GlobalSIP).

[31]  Jure Leskovec,et al.  On the Convexity of Latent Social Network Inference , 2010, NIPS.

[32]  Georgios B. Giannakis,et al.  Identifiability of sparse structural equation models for directed and cyclic networks , 2013, 2013 IEEE Global Conference on Signal and Information Processing.

[33]  Nikos D. Sidiropoulos,et al.  Blind PARAFAC receivers for DS-CDMA systems , 2000, IEEE Trans. Signal Process..

[34]  J. Kruskal Three-way arrays: rank and uniqueness of trilinear decompositions, with application to arithmetic complexity and statistics , 1977 .

[35]  Chong-Yung Chi,et al.  A Khatri-Rao subspace approach to blind identification of mixtures of quasi-stationary sources , 2013, Signal Process..

[36]  Joan S. Weszka,et al.  A survey of threshold selection techniques , 1978 .

[37]  E. Rogers,et al.  Diffusion of innovations , 1964, Encyclopedia of Sport Management.

[38]  A. Stegeman,et al.  On Kruskal's uniqueness condition for the Candecomp/Parafac decomposition , 2007 .

[39]  Georgios B. Giannakis,et al.  Sparse graphical modeling of piecewise-stationary time series , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[40]  Dinh-Tuan Pham,et al.  Blind separation of instantaneous mixtures of nonstationary sources , 2001, IEEE Trans. Signal Process..