论文信息 - Sparse matrix-variate Gaussian process blockmodels for network modeling

Sparse matrix-variate Gaussian process blockmodels for network modeling

We face network data from various sources, such as protein interactions and online social networks. A critical problem is to model network interactions and identify latent groups of network nodes. This problem is challenging due to many reasons. For example, the network nodes are interdependent instead of independent of each other, and the data are known to be very noisy (e.g., missing edges). To address these challenges, we propose a new relational model for network data, Sparse Matrix-variate Gaussian process Blockmodel (SMGB). Our model generalizes popular bilinear generative models and captures nonlinear network interactions using a matrix-variate Gaussian process with latent membership variables. We also assign sparse prior distributions on the latent membership variables to learn sparse group assignments for individual network nodes. To estimate the latent variables efficiently from data, we develop an efficient variational expectation maximization method. We compared our approaches with several state-of-the-art network models on both synthetic and real-world network datasets. Experimental results demonstrate SMGBs outperform the alternative approaches in terms of discovering latent classes or predicting unknown interactions.

[1] Michael I. Jordan,et al. An Introduction to Variational Methods for Graphical Models , 1999, Machine-mediated learning.

[2] Wei Chu,et al. Stochastic Relational Models for Discriminative Link Prediction , 2006, NIPS.

[3] Peter D. Hoff,et al. Latent Space Approaches to Social Network Analysis , 2002 .

[4] Peter D. Hoff,et al. Modeling homophily and stochastic equivalence in symmetric relational data , 2007, NIPS.

[5] S. Chib,et al. Bayesian analysis of binary and polychotomous response data , 1993 .

[6] Wei Chu,et al. Gaussian Process Models for Link Analysis and Transfer Learning , 2007, NIPS.

[7] Neil D. Lawrence,et al. Non-linear matrix factorization with Gaussian processes , 2009, ICML '09.

[8] A. Rukhin. Matrix Variate Distributions , 1999, The Multivariate Normal Distribution.

[9] Thomas L. Griffiths,et al. Discovering Latent Classes in Relational Data , 2004 .

[10] Edoardo M. Airoldi,et al. Mixed Membership Stochastic Blockmodels , 2007, NIPS.

[11] Mark Girolami,et al. Variational Bayesian Multinomial Probit Regression with Gaussian Process Priors , 2006, Neural Computation.

[12] Zenglin Xu,et al. Sparse Matrix-Variate t Process Blockmodels , 2011, AAAI.

[13] Gal Chechik,et al. Euclidean Embedding of Co-occurrence Data , 2004, J. Mach. Learn. Res..

[14] T. Snijders,et al. Estimation and Prediction for Stochastic Blockmodels for Graphs with Latent Block Structure , 1997 .