Network Estimation by Mixing: Adaptivity and More

Networks analysis has been commonly used to study the interactions between units of complex systems. One problem of particular interest is learning the network’s underlying connection pattern given a single and noisy instantiation. While many methods have been proposed to address this problem in recent years, they usually assume that the true model belongs to a known class, which is not verifiable in most real-world applications. Consequently, network modeling based on these methods either suffers from model misspecification or relies on additional model selection procedures that are not well understood in theory and can potentially be unstable in practice. To address this difficulty, we propose a mixing strategy that leverages available arbitrary models to improve their individual performances. The proposed method is computationally efficient and almost tuning-free; thus, it can be used as an off-the-shelf method for network modeling. We show that the proposed method performs equally well as the oracle estimate when the true model is included as individual candidates. More importantly, the method remains robust and outperforms all current estimates even when the models are misspecified. Extensive simulation examples are used to verify the advantage of the proposed mixing method. Evaluation of link prediction performance on 385 real-world networks from six domains also demonstrates the universal competitiveness of the mixing method across multiple domains.

[1]  M. Pensky,et al.  Sparse Popularity Adjusted Stochastic Block Model , 2019, J. Mach. Learn. Res..

[2]  Andrei Z. Broder,et al.  Workshop on Algorithms and Models for the Web Graph , 2007, WAW.

[3]  Jing Lei,et al.  Network Cross-Validation for Determining the Number of Communities in Network Data , 2014, 1411.1715.

[4]  Alexandre B. Tsybakov,et al.  Optimal Rates of Aggregation , 2003, COLT.

[5]  Edoardo M. Airoldi,et al.  Mixed Membership Stochastic Blockmodels , 2007, NIPS.

[6]  Zongming Ma,et al.  Optimal hypothesis testing for stochastic block models with growing degrees , 2017, ArXiv.

[7]  E. Levina,et al.  Estimating network edge probabilities by neighborhood smoothing , 2015, 1509.08588.

[8]  Jing Lei A goodness-of-fit test for stochastic block models , 2014, 1412.4857.

[9]  D. F. Saldana,et al.  How Many Communities Are There? , 2014, 1412.1684.

[10]  X ZhengAlice,et al.  A Survey of Statistical Network Models , 2010 .

[11]  Jon M. Kleinberg,et al.  The link-prediction problem for social networks , 2007, J. Assoc. Inf. Sci. Technol..

[12]  Can M. Le Edge Sampling Using Local Network Information , 2021 .

[13]  P. Diaconis,et al.  Graph limits and exchangeable random graphs , 2007, 0712.2749.

[14]  David Haussler,et al.  Sequential Prediction of Individual Sequences Under General Loss Functions , 1998, IEEE Trans. Inf. Theory.

[15]  R. Tibshirani,et al.  Combining Estimates in Regression and Classification , 1996 .

[16]  A. Tsybakov,et al.  Aggregation for Gaussian regression , 2007, 0710.3654.

[17]  Jing Lei,et al.  Cross-Validation With Confidence , 2017, Journal of the American Statistical Association.

[18]  Yuguo Chen,et al.  A block model for node popularity in networks with community structure , 2018 .

[19]  Jiashun Jin,et al.  Optimal adaptivity of signed-polygon statistics for network testing , 2019, The Annals of Statistics.

[20]  Kathryn B. Laskey,et al.  Stochastic blockmodels: First steps , 1983 .

[21]  P. Bickel,et al.  Likelihood-based model selection for stochastic block models , 2015, 1502.02069.

[22]  Jon M. Kleinberg,et al.  Subgraph frequencies: mapping the empirical and extremal geography of large graph collections , 2013, WWW.

[23]  Purnamrita Sarkar,et al.  Hypothesis testing for automated community detection in networks , 2013, ArXiv.

[24]  Purnamrita Sarkar,et al.  Provable Estimation of the Number of Blocks in Block Models , 2017, AISTATS.

[25]  Chao Gao,et al.  Achieving Optimal Misclassification Proportion in Stochastic Block Models , 2015, J. Mach. Learn. Res..

[26]  Harrison H. Zhou,et al.  Rate-optimal graphon estimation , 2014, 1410.5837.

[27]  Béla Bollobás,et al.  The phase transition in inhomogeneous random graphs , 2007, Random Struct. Algorithms.

[28]  Leo Breiman,et al.  Stacked regressions , 2004, Machine Learning.

[29]  Edoardo M. Airoldi,et al.  A Consistent Histogram Estimator for Exchangeable Graph Models , 2014, ICML.

[30]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[31]  Nitesh V. Chawla,et al.  New perspectives and methods in link prediction , 2010, KDD.

[32]  Discussion of ‘Network cross-validation by edge sampling’ , 2020 .

[33]  I. Dhillon,et al.  A New Projected Quasi-Newton Approach for the Nonnegative Least Squares Problem , 2006 .

[35]  Bin Yu,et al.  Spectral clustering and the high-dimensional stochastic blockmodel , 2010, 1007.1684.

[36]  Nikhil Srivastava,et al.  Graph sparsification by effective resistances , 2008, SIAM J. Comput..

[37]  Jiashun Jin,et al.  Estimating network memberships by simplex vertex hunting , 2017 .

[38]  O. Catoni The Mixture Approach to Universal Model Selection , 1997 .

[39]  Yuhong Yang Adaptive Regression by Mixing , 2001 .

[40]  A. Rinaldo,et al.  Consistency of spectral clustering in stochastic block models , 2013, 1312.2050.

[41]  S. Chatterjee,et al.  Matrix estimation by Universal Singular Value Thresholding , 2012, 1212.1247.

[42]  A. Juditsky,et al.  Functional aggregation for nonparametric regression , 2000 .

[43]  C. Steglich,et al.  Applying SIENA: An illustrative analysis of the co-evolution of adolescents’ friendship networks, taste in music, and alcohol consumption , 2006 .

[44]  Edoardo M. Airoldi,et al.  Stochastic blockmodel approximation of a graphon: Theory and consistent estimation , 2013, NIPS.

[45]  Mark E. J. Newman,et al.  Stochastic blockmodels and community structure in networks , 2010, Physical review. E, Statistical, nonlinear, and soft matter physics.

[46]  Can M. Le,et al.  Concentration and regularization of random graphs , 2015, Random Struct. Algorithms.

[47]  Zhuang Ma,et al.  Universal Latent Space Model Fitting for Large Networks with Edge Covariates , 2020, J. Mach. Learn. Res..

[48]  S. H. Khasteh,et al.  A survey on exponential random graph models: an application perspective , 2020, PeerJ Comput. Sci..

[49]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[50]  Matthias Hein,et al.  Non-negative least squares for high-dimensional linear models: consistency and sparse recovery without regularization , 2012, 1205.0953.

[51]  Charles X. Ling,et al.  Using AUC and accuracy in evaluating learning algorithms , 2005, IEEE Transactions on Knowledge and Data Engineering.

[52]  M. J. van der Laan,et al.  Statistical Applications in Genetics and Molecular Biology Super Learner , 2010 .

[53]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[54]  Peter D. Hoff,et al.  Modeling homophily and stochastic equivalence in symmetric relational data , 2007, NIPS.

[55]  Nikita Zhivotovskiy,et al.  Uniform Hanson-Wright type concentration inequalities for unbounded entries via the entropy method , 2018, 1812.03548.

[56]  E. Levina,et al.  Network cross-validation by edge sampling , 2016, Biometrika.

[57]  Edoardo M. Airoldi,et al.  A Survey of Statistical Network Models , 2009, Found. Trends Mach. Learn..

[58]  Matthias Hein,et al.  Sparse recovery by thresholded non-negative least squares , 2011, NIPS.

[59]  Tianxi Li,et al.  Informative core identification in complex networks , 2021, ArXiv.

[60]  Arash A. Amini,et al.  Adjusted chi-square test for degree-corrected block models , 2020, ArXiv.

[61]  Yi Yu,et al.  The restricted consistency property of leave-nv-out cross-validation for high-dimensional variable selection , 2018 .

[62]  Peter D. Hoff,et al.  Latent Space Approaches to Social Network Analysis , 2002 .

[63]  Edoardo M. Airoldi,et al.  Stacking models for nearly optimal link prediction in complex networks , 2019, Proceedings of the National Academy of Sciences.

[64]  N. Meinshausen Sign-constrained least squares estimation for high-dimensional regression , 2012, 1202.0889.

[65]  Donghui Chen,et al.  Nonnegativity constraints in numerical analysis , 2009, The Birth of Numerical Analysis.

[66]  M. Stone Cross‐Validatory Choice and Assessment of Statistical Predictions , 1976 .

[67]  Karl Rohe,et al.  The blessing of transitivity in sparse and stochastic networks , 2013, 1307.2302.

[68]  Can M. Le,et al.  Estimating the number of communities in networks by spectral methods , 2015, ArXiv.

[69]  David H. Wolpert,et al.  Stacked generalization , 1992, Neural Networks.

[70]  Purnamrita Sarkar,et al.  Hierarchical community detection by recursive bi-partitioning , 2018 .

[71]  Mark Newman,et al.  Networks: An Introduction , 2010 .

[72]  D. Aldous Representations for partially exchangeable arrays of random variables , 1981 .

[73]  Yuhong Yang Mixing Strategies for Density Estimation , 2000 .

[74]  Kevin Lee,et al.  A review of dynamic network models with latent variables. , 2017, Statistics surveys.

[75]  Linear regression and its inference on noisy network-linked data , 2020, 2007.00803.

[76]  Linyuan Lu,et al.  Link Prediction in Complex Networks: A Survey , 2010, ArXiv.

[77]  Chao Gao,et al.  Testing for Global Network Structure Using Small Subgraph Statistics , 2017, ArXiv.

[78]  Edward R. Scheinerman,et al.  Random Dot Product Graph Models for Social Networks , 2007, WAW.

[79]  Aad van der Vaart,et al.  The Cross-Validated Adaptive Epsilon-Net Estimator , 2006 .

[80]  László Lovász,et al.  Limits of dense graph sequences , 2004, J. Comb. Theory B.

[81]  Subhabrata Sen,et al.  Testing degree corrections in stochastic block models , 2017, Annales de l'Institut Henri Poincaré, Probabilités et Statistiques.

[82]  Harry Crane,et al.  Probabilistic Foundations of Statistical Network Analysis , 2018 .