Structural regularity exploration in multidimensional networks via Bayesian inference

Multidimensional networks, networks with multiple kinds of relations, widely exist in various fields in the real world, such as sociology, chemistry, biology and economics. One fundamental task of network analysis is to explore network structure, including assortative structure (i.e., community structure), disassortative structure (e.g., bipartite structure) and mixture structure, that is, to find structural regularities in networks. There are two aspects of structural regularity exploration: (1) group partition—how to partition nodes of networks into different groups, and (2) group number—how many groups in networks. Most existing structural regularity exploration methods for multidimensional networks need to pre-assume the structure type (e.g., the community structure) and to give the group number of networks, among which the structure type is a guide to group partition. However, the structure type and group number are usually unavailable in advance. To explore structural regularities in multidimensional networks well without pre-assuming which type of structure they have, we propose a novel feature aggregation method based on a mixture model and Bayesian theory, called the multidimensional Bayesian mixture (MBM) model. To automatically determine the group number of multidimensional networks, we further extend the MBM model using Bayesian nonparametric theory to a new model, called the multidimensional Bayesian nonparametric mixture (MBNPM) model. Experiments conducted on a number of synthetic and real multidimensional networks show that the MBM model outperforms other related models on most networks and the MBNPM model is comparable to the MBM model.

[1]  Kan Li,et al.  A Unified Model for Community Detection of Multiplex Networks , 2014, WISE.

[2]  Alessandro Laio,et al.  Clustering by fast search and find of density peaks , 2014, Science.

[3]  Mark E. J. Newman,et al.  Structure and inference in annotated networks , 2015, Nature Communications.

[4]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[5]  Thomas L. Griffiths,et al.  Learning Systems of Concepts with an Infinite Relational Model , 2006, AAAI.

[6]  Jiangtao Peng,et al.  Error bounds of multi-graph regularized semi-supervised classification , 2009, Inf. Sci..

[7]  Francesco Calabrese,et al.  ABACUS: frequent pAttern mining-BAsed Community discovery in mUltidimensional networkS , 2013, Data Mining and Knowledge Discovery.

[8]  Thomas Seidl,et al.  Finding density-based subspace clusters in graphs with feature vectors , 2012, Data Mining and Knowledge Discovery.

[9]  Samuel Kaski,et al.  A simple infinite topic mixture for rich graphs and relational data , 2008 .

[10]  Karl Moll Community Detection in Multidimensional Social Networks , 2014 .

[11]  J. Coleman,et al.  The Diffusion of an Innovation Among Physicians , 1957 .

[12]  K. Foster,et al.  The ecology of the microbiome: Networks, competition, and stability , 2015, Science.

[13]  Sean Hughes,et al.  Clustering by Fast Search and Find of Density Peaks , 2016 .

[14]  Radford M. Neal Slice Sampling , 2003, The Annals of Statistics.

[15]  P. Pattison,et al.  New Specifications for Exponential Random Graph Models , 2006 .

[16]  Ana L. N. Fred,et al.  Robust data clustering , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[17]  Nando de Freitas,et al.  An Introduction to MCMC for Machine Learning , 2004, Machine Learning.

[18]  Vincenza Carchiolo,et al.  Communities Unfolding in Multislice Networks , 2016, CompleNet.

[19]  Vito Latora,et al.  Structural measures for multiplex networks. , 2013, Physical review. E, Statistical, nonlinear, and soft matter physics.

[20]  Mohammad Khoshneshin A graphical model for multi-relational social network analysis , 2013 .

[21]  Holger Hoffmann,et al.  RMiCS: a robust approach for mining coherent subgraphs in edge-labeled multi-layer graphs , 2013, SSDBM.

[22]  Huan Liu,et al.  Uncovering cross-dimension group structures in multi-dimensional networks , 2009, SDM 2009.

[23]  Padhraic Smyth,et al.  Modeling relational events via latent classes , 2010, KDD.

[24]  Buzhou Tang,et al.  Overlapping community detection in networks with positive and negative links , 2014 .

[25]  Guandong Xu,et al.  Community Detection in Multi-relational Social Networks , 2013, WISE.

[26]  E A Leicht,et al.  Mixture models and exploratory analysis in networks , 2006, Proceedings of the National Academy of Sciences.

[27]  Samuel J. Gershman,et al.  A Tutorial on Bayesian Nonparametric Models , 2011, 1106.2697.

[28]  Zoubin Ghahramani,et al.  An Infinite Latent Attribute Model for Network Data , 2012, ICML.

[29]  Giacomo Indiveri,et al.  Synthesizing cognition in neuromorphic electronic systems , 2013, Proceedings of the National Academy of Sciences.

[30]  Wei Chu,et al.  Relational Learning with Gaussian Processes , 2006, NIPS.

[31]  John Guiver,et al.  Learning to rank with SoftRank and Gaussian processes , 2008, SIGIR '08.

[32]  G. Mauri,et al.  Gene switching rate determines response to extrinsic perturbations in the self-activation transcriptional network motif , 2016, Scientific Reports.

[33]  Pedro M. Domingos,et al.  Learning the structure of Markov logic networks , 2005, ICML.

[34]  Giulio Caravagna,et al.  Gene switching rate determines response to extrinsic perturbations in a transcriptional network motif , 2014, 1410.6993.

[35]  Jian Yu,et al.  Combining a popularity-productivity stochastic block model with a discriminative-content model for general structure detection. , 2013, Physical review. E, Statistical, nonlinear, and soft matter physics.

[36]  Ludovic Denoyer,et al.  Classification and annotation in social corpora using multiple relations , 2011, CIKM '11.

[37]  Huan Liu,et al.  Community Detection in Multi-Dimensional Networks , 2015 .

[38]  Xueqi Cheng,et al.  Exploring the structural regularities in networks , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[39]  Kristian Kersting,et al.  Multi-Relational Learning with Gaussian Processes , 2009, IJCAI.

[40]  Iadh Ounis,et al.  Proceedings of the 20th ACM international conference on Information and knowledge management , 2011, CIKM 2011.

[41]  S. Walker Invited comment on the paper "Slice Sampling" by Radford Neal , 2003 .

[42]  Peter J. Mucha,et al.  Multiscale, and Multiplex Networks Community Structure in Time-Dependent, , 2010 .