Projection-free Graph-based Classifier Learning using Gershgorin Disc Perfect Alignment

In semi-supervised graph-based binary classifier learning, a subset of known labels x̂i are used to infer unknown labels, assuming that the label signal x is smooth with respect to a similarity graph specified by a Laplacian matrix. When restricting labels xi to binary values, the problem is NP-hard. While a conventional semidefinite programming (SDP) relaxation can be solved in polynomial time using, for example, the alternating direction method of multipliers (ADMM), the complexity of iteratively projecting a candidate matrix M onto the positive semi-definite (PSD) cone (M 0) remains high. In this paper, leveraging a recent linear algebraic theory called Gershgorin disc perfect alignment (GDPA), we propose a fast projection-free method by solving a sequence of linear programs (LP) instead. Specifically, we first recast the SDP relaxation to its SDP dual, where a feasible solution H 0 can be interpreted as a Laplacian matrix corresponding to a balanced signed graph sans the last node. To achieve graph balance, we split the last node into two that respectively contain the original positive and negative edges, resulting in a new Laplacian H̄. We repose the SDP dual for solution H̄, then replace the PSD cone constraint H̄ 0 with linear constraints derived from GDPA— sufficient conditions to ensure H̄ is PSD—so that the optimization becomes an LP per iteration. Finally, we extract predicted labels from our converged LP solution H̄. Experiments show that our algorithm enjoyed a 40× speedup on average over the next fastest scheme while retaining comparable label prediction performance.

[1]  Gene Cheung,et al.  Signed Graph Metric Learning via Gershgorin Disc Perfect Alignment , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  R. Varga Geršgorin And His Circles , 2004 .

[3]  Mikhail Belkin,et al.  Regularization and Semi-supervised Learning on Large Graphs , 2004, COLT.

[4]  Carlos D. Castillo,et al.  Biconvex Relaxation for Semidefinite Programming in Computer Vision , 2016, ECCV.

[5]  Gerald Matz,et al.  Signal Processing on Signed Graphs: Fundamentals and Potentials , 2020, IEEE Signal Processing Magazine.

[6]  Yang Zheng,et al.  Fast ADMM for Sum-of-Squares Programs Using Partial Orthogonality , 2017, IEEE Transactions on Automatic Control.

[7]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[8]  P. Vandergheynst,et al.  Semi-Supervised Learning with Spectral Graph Wavelets , 2011 .

[9]  David P. Williamson,et al.  Improved approximation algorithms for maximum cut and satisfiability problems using semidefinite programming , 1995, JACM.

[10]  Ioannis A. Kakadiaris,et al.  An Overview and Empirical Comparison of Distance Metric Learning Methods , 2017, IEEE Transactions on Cybernetics.

[11]  Xiaochen Yang,et al.  Learning Local Metrics and Influential Regions for Classification , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Andrew V. Knyazev,et al.  Toward the Optimal Preconditioned Eigensolver: Locally Optimal Block Preconditioned Conjugate Gradient Method , 2001, SIAM J. Sci. Comput..

[13]  Jeff A. Bilmes,et al.  Label Selection on Graphs , 2009, NIPS.

[14]  Anton van den Hengel,et al.  A Fast Semidefinite Approach to Solving Binary Quadratic Problems , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[15]  Chunhua Shen,et al.  Large-Scale Binary Quadratic Optimization Using Semidefinite Relaxation and Applications , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Zhenguo Li,et al.  Pairwise constraint propagation by semidefinite programming for semi-supervised classification , 2008, ICML '08.

[17]  Enrico Magli,et al.  Graph Spectral Image Processing , 2018, Proceedings of the IEEE.

[18]  Stephen P. Boyd,et al.  Conic Optimization via Operator Splitting and Homogeneous Self-Dual Embedding , 2013, Journal of Optimization Theory and Applications.

[19]  Yang Zheng,et al.  Chordal decomposition in operator-splitting methods for sparse semidefinite programs , 2017, Mathematical Programming.

[20]  Gene Cheung,et al.  Graph Laplacian Regularization for Inverse Imaging: Analysis in the Continuous Domain , 2016, ArXiv.

[21]  Pascal Frossard,et al.  Learning Graphs From Data: A Signal Representation Perspective , 2018, IEEE Signal Processing Magazine.

[22]  Ronald R. Coifman,et al.  Multiscale Wavelets on Trees, Graphs and High Dimensional Data: Theory and Applications to Semi Supervised Learning , 2010, ICML.

[23]  Zhi-Quan Luo,et al.  Semidefinite Relaxation of Quadratic Optimization Problems , 2010, IEEE Signal Processing Magazine.

[24]  Gene Cheung,et al.  Robust Semisupervised Graph Classifier Learning With Negative Edge Weights , 2016, IEEE Transactions on Signal and Information Processing over Networks.

[25]  F. Harary,et al.  STRUCTURAL BALANCE: A GENERALIZATION OF HEIDER'S THEORY1 , 1977 .

[26]  Robert J. Vanderbei,et al.  An Interior-Point Method for Semidefinite Programming , 1996, SIAM J. Optim..

[27]  Stephen P. Boyd,et al.  Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers , 2011, Found. Trends Mach. Learn..

[28]  Bernhard Schölkopf,et al.  Learning with Local and Global Consistency , 2003, NIPS.

[29]  Christopher M. Bishop,et al.  Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .

[30]  Jorge Nocedal,et al.  Algorithm 778: L-BFGS-B: Fortran subroutines for large-scale bound-constrained optimization , 1997, TOMS.

[31]  Sahin Albayrak,et al.  Spectral Analysis of Signed Graphs for Clustering, Prediction and Visualization , 2010, SDM.

[32]  Pierre Vandergheynst,et al.  Graph Signal Processing: Overview, Challenges, and Applications , 2017, Proceedings of the IEEE.