论文信息 - GLAD: Learning Sparse Graph Recovery - 字舞流文

GLAD: Learning Sparse Graph Recovery

Recovering sparse conditional independence graphs from data is a fundamental problem in machine learning with wide applications. A popular formulation of the problem is an $\ell_1$ regularized maximum likelihood estimation. Many convex optimization algorithms have been designed to solve this formulation to recover the graph structure. Recently, there is a surge of interest to learn algorithms directly based on data, and in this case, learn to map empirical covariance to the sparse precision matrix. However, it is a challenging task in this case, since the symmetric positive definiteness (SPD) and sparsity of the matrix are not easy to enforce in learned algorithms, and a direct mapping from data to precision matrix may contain many parameters. We propose a deep learning architecture, GLAD, which uses an Alternating Minimization (AM) algorithm as our model inductive bias, and learns the model parameters via supervised learning. We show that GLAD learns a very compact and effective model for recovering sparse graphs from data.

Le Song | Guanghui Lan | Xinshi Chen | Harsh Shrivastava | Binghong Chen | Srinvas Aluru | Guanghui Lan | Le Song | Xinshi Chen | Binghong Chen | H. Shrivastava | Srinvas Aluru

[1] Alexandre d'Aspremont,et al. Model Selection Through Sparse Max Likelihood Estimation Model Selection Through Sparse Maximum Likelihood Estimation for Multivariate Gaussian or Binary Data , 2022 .

[2] R. Tibshirani,et al. Sparse inverse covariance estimation with the graphical lasso. , 2008, Biostatistics.

[3] Xiaohan Chen,et al. Theoretical Linear Convergence of Unfolded ISTA and its Practical Weights and Thresholds , 2018, NeurIPS.

[4] Diogo M. Camacho,et al. Wisdom of crowds for robust gene network inference , 2012, Nature Methods.

[5] M. Wainwright,et al. HIGH-DIMENSIONAL COVARIANCE ESTIMATION BY MINIMIZING l1-PENALIZED LOG-DETERMINANT DIVERGENCE BY PRADEEP RAVIKUMAR , 2009 .

[6] Xiaohan Chen,et al. ALISTA: Analytic Weights Are As Good As Learned Weights in LISTA , 2018, ICLR.

[7] Santosh Vempala,et al. Randomized algorithms in numerical linear algebra , 2017, Acta Numerica.

[8] Amir Beck,et al. On the Convergence of Alternating Minimization for Convex Programming with Applications to Iteratively Reweighted Least Squares and Decomposition Schemes , 2015, SIAM J. Optim..

[9] Yann LeCun,et al. Learning Fast Approximations of Sparse Coding , 2010, ICML.

[10] Deepak K. Agarwal,et al. A Flexible, Scalable and Efficient Algorithmic Framework for Primal Graphical Lasso , 2011, 1110.5508.

[11] Jian Sun,et al. Deep ADMM-Net for Compressive Sensing MRI , 2016, NIPS.

[12] Adam J. Rothman,et al. Sparse permutation invariant covariance estimation , 2008, 0801.4837.

[13] Zhaosong Lu,et al. Adaptive First-Order Methods for General Sparse Inverse Covariance Selection , 2009, SIAM J. Matrix Anal. Appl..

[14] Kathleen Marchal,et al. SynTReN: a generator of synthetic gene expression data for design and analysis of structure learning algorithms , 2006, BMC Bioinformatics.

[15] Le Song,et al. 2 Common Formulation for Greedy Algorithms on Graphs , 2018 .

[16] Jitendra Malik,et al. Learning to Optimize , 2016, ICLR.

[17] Qiang Sun,et al. Graphical Nonconvex Optimization via an Adaptive Convex Relaxation , 2018, ICML.

[18] Gaël Varoquaux,et al. Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[19] Arian Maleki,et al. Iterative Thresholding Algorithm for Sparse Inverse Covariance Estimation , 2012, NIPS.

[20] Matthew B. Blaschko,et al. Learning to Discover Sparse Graphical Models , 2016, ICML.

[21] Marcin Andrychowicz,et al. Learning to learn by gradient descent by gradient descent , 2016, NIPS.