Spectral methods for analyzing and visualizing networks : an introduction

Network analysis begins with data that describes the set of relationships among the members of a system. The goal of analysis is to obtain from the low-level relational data a higher-level description of the structure of the system which identifies various kinds of patterns in the set of relationships. These patterns will be based on the way individuals are related to other individuals in the network. Some approaches to network analysis look for clusters of individuals who are tightly connected to one another; some look for sets of individuals who have similar patterns of relations to the rest of the network. Other methods don’t “look for” anything in particular — instead, they construct a continuous multidimensional representation of the network in which the coordinates of the individuals can be further analyzed to obtain a variety of kinds of information about them and their relation to the rest of the network. One approach to this is to choose a set of axes in the multidimensional space occupied by the network and rotate them so that the first axis points in the direction of the greatest variability in the data; the second axis, orthogonal to the first, points in the direction of greatest remaining variability, and so on. This set of axes is a coordinate system that can be used to describe the relative positions of the set of points in the data. Most of the variability in the locations of points will be accounted for by the first few dimensions of this coordinate system. The coordinates of the points along each axis will be an eigenvector, and the length of the projection will be an eigenvalue. The set of all eigenvalues is the spectrum of the network. Spectral methods (eigendecomposition) have been a part of graph theory for over a century. Network researchers have used spectral methods either implicitly or explicitly since the late 1960's, when computers became generally accessible in most universities. The eigenvalues of a network are intimately connected to important topological features such as maximum distance across the network (diameter), presence of cohesive clusters, long paths and bottlenecks, and how random the network is. The associated eigenvectors can be used as a natural coordinate system for graph visualization; they also provide methods for discovering clusters and other local features. When combined with other, easily obtained network statistics (e.g., node degree), they can be used to describe a variety of network properties, such as degree of robustness (i.e., tolerance to removal of selected nodes or links), and other structural properties, and the relationship of these properties to node or link attributes in large, complex, multivariate networks. We introduce three types of spectral analysis for graphs and describe some of their mathematical properties. We discuss the strengths and weaknesses of each type and show how they can be used to understand network structure. These discussions are accompanied by interactive graphical displays of small (n=50) and moderately large (n=5000) networks. Throughout, we give special attention to sparse matrix methods which allow rapid, efficient storage and analysis of large networks. We briefly describe algorithms and analytic strategies that allow spectral analysis and identification of clusters in very large networks (n>1,000,000).

[1]  H. Hotelling Analysis of a complex of statistical variables into principal components. , 1933 .

[2]  Dr. M. G. Worster Methods of Mathematical Physics , 1947, Nature.

[3]  M. Fisher On hearing the shape of a drum , 1966 .

[4]  M. Fiedler Algebraic connectivity of graphs , 1973 .

[5]  L. Freeman Centrality in social networks conceptual clarification , 1978 .

[6]  F. Harary,et al.  The spectral approach to determining the number of walks in a graph , 1979 .

[7]  B. Parlett The Symmetric Eigenvalue Problem , 1981 .

[8]  Ronald E. Rice,et al.  The NEGOPY network analysis program , 1981 .

[9]  B. Parlett,et al.  On estimating the largest eigenvalue with the Lanczos algorithm , 1982 .

[10]  I. Chavel Eigenvalues in Riemannian geometry , 1984 .

[11]  N. Alon,et al.  il , , lsoperimetric Inequalities for Graphs , and Superconcentrators , 1985 .

[12]  A. Sokal,et al.  Bounds on the ² spectrum for Markov chains and Markov processes: a generalization of Cheeger’s inequality , 1988 .

[13]  D. L. Powers Graph Partitioning by Eigenvectors , 1988 .

[14]  F. Bien Constructions of telephone networks by group representations , 1989 .

[15]  F. Chung Diameters and eigenvalues , 1989 .

[16]  B. Mohar,et al.  Eigenvalues and the max-cut problem , 1990 .

[17]  D. Cvetkovic,et al.  The largest eigenvalue of a graph: A survey , 1990 .

[18]  V. Sunder,et al.  The Laplacian spectrum of a graph , 1990 .

[19]  P. Diaconis,et al.  Geometric Bounds for Eigenvalues of Markov Chains , 1991 .

[20]  Andrew B. Kahng,et al.  New spectral methods for ratio cut partitioning and clustering , 1991, IEEE Trans. Comput. Aided Des. Integr. Circuits Syst..

[21]  J. A. López del Val,et al.  Principal Components Analysis , 2018, Applied Univariate, Bivariate, and Multivariate Statistics Using Python.

[22]  J. Friedman Some geometric aspects of graphs and their eigenfunctions , 1993 .

[23]  Russell Merris,et al.  The Laplacian Spectrum of a Graph II , 1994, SIAM J. Discret. Math..

[24]  Fan Chung Graham,et al.  An Upper Bound on the Diameter of a Graph from Eigenvalues Associated with its Laplacian , 1994, SIAM J. Discret. Math..

[25]  Alexander Lubotzky,et al.  Discrete groups, expanding graphs and invariant measures , 1994, Progress in mathematics.

[26]  Bruce Hendrickson,et al.  An Improved Spectral Graph Partitioning Algorithm for Mapping Parallel Computations , 1995, SIAM J. Sci. Comput..

[27]  T. Chow The -spectrum and spanning trees of tensor products of bipartite graphs , 1997 .

[28]  John Beidler,et al.  Data Structures and Algorithms , 1996, Wiley Encyclopedia of Computer Science and Engineering.

[29]  Chris Walshaw,et al.  Journal of Graph Algorithms and Applications a Multilevel Algorithm for Force-directed Graph-drawing , 2022 .

[30]  Andrew Seary,et al.  Eigen Analysis of Networks , 2000, J. Soc. Struct..

[31]  Ulrik Brandes,et al.  Journal of Graph Algorithms and Applications Visual Ranking of Link Structures , 2022 .

[32]  James Moody,et al.  Peer influence groups: identifying dense clusters in large networks , 2001, Soc. Networks.

[33]  A. Barabasi,et al.  Spectra of "real-world" graphs: beyond the semicircle law. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[34]  Inderjit S. Dhillon,et al.  Co-clustering documents and words using bipartite spectral graph partitioning , 2001, KDD '01.

[35]  Gordon F. Royle,et al.  Algebraic Graph Theory , 2001, Graduate texts in mathematics.

[36]  Michael William Newman,et al.  The Laplacian spectrum of graphs , 2001 .

[37]  K. Goh,et al.  Spectra and eigenvectors of scale-free networks. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.