Analysis of two-mode network data using nonnegative matrix factorization

Abstract Nonnegative matrix factorization has been offered as a fast and effective method for analyzing nonnegative two-mode proximity data. The goal is to structurally represent a nonnegative proximity matrix as the product of two lower-dimensional nonnegative matrices. Goodness of fit is typically measured as the sum of the squared deviations between the observed matrix elements and the estimated elements yielded by the product of the matrix factors. The preservation of nonnegativity of the factors has been touted as a major practical advantage over comparable decomposition methods (e.g., principal component analysis, singular-value decomposition, spectral analysis) because of its propensity for a more coherent additive interpretation. We provide descriptions of the nonnegative matrix factorization model and a rescaled gradient descent algorithm for estimating the factors. A small numerical example is provided along with application to three network matrices from the empirical literature.

[1]  Michael W. Berry,et al.  Algorithms and applications for approximate nonnegative matrix factorization , 2007, Comput. Stat. Data Anal..

[2]  P. Paatero,et al.  Positive matrix factorization: A non-negative factor model with optimal utilization of error estimates of data values† , 1994 .

[3]  P. Pattison,et al.  Composing a civic arena: Publics, projects, and social settings☆ , 2000 .

[4]  Charles Chubb Collapsing binary data for algebraic multidimensional representation , 1986 .

[5]  A. Householder,et al.  Discussion of a set of points in terms of their mutual distances , 1938 .

[6]  C. Coombs A theory of data. , 1965, Psychology Review.

[7]  Michael J. Brusco,et al.  An interactive multiobjective programming approach to combinatorial data analysis , 2001 .

[8]  Lawrence Hubert,et al.  Problems of seriation using a subject by item response matrix. , 1974 .

[9]  H. White,et al.  “Structural Equivalence of Individuals in Social Networks” , 2022, The SAGE Encyclopedia of Research Design.

[10]  Lawrence Hubert,et al.  Blockmodels from the bond energy approach , 1990 .

[11]  Martin G. Everett,et al.  Network analysis of 2-mode data , 1997 .

[12]  P. Groenen,et al.  The tunneling method for global optimization in multidimensional scaling , 1996 .

[13]  R. Breiger The Duality of Persons and Groups , 1974 .

[14]  W. Goodenough,et al.  Human Group , 1951, Encyclopedia of Social Network Analysis and Mining. 2nd Ed..

[15]  M. Brusco,et al.  A variable neighborhood search method for generalized blockmodeling of two-mode binary matrices , 2007 .

[16]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[17]  M. Brusco,et al.  Integer Programs for One- and Two-Mode Blockmodeling Based on Prespecified Image Matrices for Structural and Regular Equivalence. , 2009, Journal of mathematical psychology.

[18]  Ronald L. Breiger,et al.  Lattices and dimensional representations: matrix decompositions and ordering structures , 2002, Soc. Networks.

[19]  L. Freeman Finding Social Groups: A Meta-Analysis of the Southern Women Data , 2003 .

[20]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[21]  P. Arabie,et al.  An algorithm for clustering relational data with applications to social network analysis and comparison with multidimensional scaling , 1975 .

[22]  John P. Boyd,et al.  Computing core/periphery structures and permutation tests for social relations data , 2004, Soc. Networks.

[23]  M. Brusco,et al.  Inducing a blockmodel structure of two-mode binary data using seriation procedures , 2006 .

[24]  Stanley Wasserman,et al.  Social Network Analysis: Methods and Applications , 1994 .

[25]  A. Comrey The Minimum Residual Method of Factor Analysis , 1962 .

[26]  Vladimir Batagelj,et al.  Exploratory Social Network Analysis with Pajek , 2005 .

[27]  Patrick J. F. Groenen,et al.  Optimization Strategies for Two-Mode Partitioning , 2009, J. Classif..

[28]  Xin Liu,et al.  Document clustering based on non-negative matrix factorization , 2003, SIGIR.

[29]  Philippa Pattison,et al.  A factorization procedure for finite algebras , 1982 .

[30]  Martin G. Everett,et al.  Models of core/periphery structures , 2000, Soc. Networks.

[31]  C. Eckart,et al.  The approximation of one matrix by another of lower rank , 1936 .

[32]  David A. Smith,et al.  Computing continuous core/periphery structures for social relations data with MINRES/SVD , 2010, Soc. Networks.

[33]  L. Freeman Q-analysis and the structure of friendship networks , 1980 .

[34]  E. B. Andersen,et al.  Modern factor analysis , 1961 .

[35]  Chris H. Q. Ding,et al.  On the Equivalence of Nonnegative Matrix Factorization and Spectral Clustering , 2005, SDM.

[36]  Donald R. Baker A Structural Analysis of Social Work Journal Network , 1992 .

[37]  I. Jolliffe Principal Component Analysis , 2002 .

[38]  Patrik O. Hoyer,et al.  Non-negative Matrix Factorization with Sparseness Constraints , 2004, J. Mach. Learn. Res..

[39]  P. Pattison Algebraic models for social networks , 1993 .

[40]  P. Boeck,et al.  Hierarchical classes: Model and data analysis , 1988 .

[41]  Dawn Iacobucci,et al.  Social networks with two sets of actors , 1990 .

[42]  Vladimir Batagelj,et al.  Generalized blockmodeling of two-mode network data , 2004, Soc. Networks.

[43]  Jing Hua,et al.  Non-negative matrix factorization for semi-supervised data clustering , 2008, Knowledge and Information Systems.

[44]  Michael J. Brusco,et al.  Optimal Least-Squares Unidimensional Scaling: Improved Branch-and-Bound Procedures and Comparison to Dynamic Programming , 2005 .