Signed Graph Metric Learning via Gershgorin Disc Alignment

Given a convex and differentiable objective $Q(\M)$ for a real, symmetric matrix $\M$ in the positive definite (PD) cone---used to compute Mahalanobis distances---we propose a fast general metric learning framework that is entirely projection-free. We first assume that $\M$ resides in a space $\cS$ of generalized graph Laplacian matrices (graph metric matrices) corresponding to balanced signed graphs. Unlike low-rank metric matrices common in the literature, $\cS$ includes the important diagonal-only matrices as a special case. The key theorem to circumvent full eigen-decomposition and enable fast metric matrix optimization is Gershgorin disc alignment (GDA): given graph metric matrix $\M \in \cS$ and diagonal matrix $§$, where $S_{ii} = 1/v_i$ and $\v$ is the first eigenvector of $\M$, we prove that Gershgorin disc left-ends of similar transform $\B = §\M §^{-1}$ are perfectly aligned at the smallest eigenvalue $\lambda_{\min}$. Using this theorem, we replace the PD cone constraint in the metric learning problem with tightest possible linear constraints per iteration, so that the alternating optimization of the diagonal / off-diagonal terms in $\M$ can be solved efficiently as linear programs via Frank-Wolfe iterations. We update $\v$ using Locally Optimal Block Preconditioned Conjugate Gradient (LOBPCG) with warm start as matrix entries in $\M$ are optimized successively. Experiments show that our graph metric optimization is significantly faster than cone-projection methods, and produces competitive binary classification performance.

[1]  Lorenzo Torresani,et al.  Large Margin Component Analysis , 2006, NIPS.

[2]  Xiang Zhang,et al.  Metric Learning from Relative Comparisons by Minimizing Squared Residual , 2012, 2012 IEEE 12th International Conference on Data Mining.

[3]  Xianming Liu,et al.  Random Walk Graph Laplacian-Based Smoothness Prior for Soft Decoding of JPEG Images , 2016, IEEE Transactions on Image Processing.

[4]  Heng Huang,et al.  Matrix Variate Gaussian Mixture Distribution Steered Robust Metric Learning , 2018, AAAI.

[5]  Gene Cheung,et al.  Graph Laplacian Regularization for Image Denoising: Analysis in the Continuous Domain , 2016, IEEE Transactions on Image Processing.

[6]  Jiwen Lu,et al.  Discriminative Deep Metric Learning for Face Verification in the Wild , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  B. Scholkopf,et al.  Fisher discriminant analysis with kernels , 1999, Neural Networks for Signal Processing IX: Proceedings of the 1999 IEEE Signal Processing Society Workshop (Cat. No.98TH8468).

[8]  Amir Globerson,et al.  Metric Learning by Collapsing Classes , 2005, NIPS.

[9]  Gene H. Golub,et al.  Matrix computations (3rd ed.) , 1996 .

[10]  Yann LeCun,et al.  Dimensionality Reduction by Learning an Invariant Mapping , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[11]  William J. Cook,et al.  Combinatorial optimization , 1997 .

[12]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[13]  Fei Wang,et al.  Feature Extraction by Maximizing the Average Neighborhood Margin , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[14]  Suvrit Sra,et al.  Geometric Mean Metric Learning , 2016, ICML.

[15]  Thorsten Joachims,et al.  Learning a Distance Metric from Relative Comparisons , 2003, NIPS.

[16]  Pierre Vandergheynst,et al.  Graph Signal Processing: Overview, Challenges, and Applications , 2017, Proceedings of the IEEE.

[17]  Xuelong Li,et al.  Parameter Free Large Margin Nearest Neighbor for Distance Metric Learning , 2017, AAAI.

[18]  Wen Gao,et al.  Graph-Based Joint Dequantization and Contrast Enhancement of Poorly Lit JPEG Images , 2019, IEEE Transactions on Image Processing.

[19]  Yuan Shi,et al.  Sparse Compositional Metric Learning , 2014, AAAI.

[20]  Andreas E. Savakis,et al.  Manifold Modeling with Learned Distance in Random Projection Space for Face Recognition , 2010, 2010 20th International Conference on Pattern Recognition.

[21]  Leif Kobbelt,et al.  Identifying Style of 3D Shapes using Deep Metric Learning , 2016, Comput. Graph. Forum.

[22]  Jiwen Lu,et al.  Deep Metric Learning for Visual Understanding: An Overview of Recent Advances , 2017, IEEE Signal Processing Magazine.

[23]  Kilian Q. Weinberger,et al.  Distance Metric Learning for Large Margin Nearest Neighbor Classification , 2005, NIPS.

[24]  Stephen P. Boyd,et al.  Proximal Algorithms , 2013, Found. Trends Optim..

[25]  Harvey Mudd,et al.  Irreducible Graphs , 2005 .

[26]  Cong Li,et al.  Reduced-Rank Local Distance Metric Learning , 2013, ECML/PKDD.

[27]  Peng Li,et al.  Distance Metric Learning with Eigenvalue Optimization , 2012, J. Mach. Learn. Res..

[28]  James Philbin,et al.  FaceNet: A unified embedding for face recognition and clustering , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[29]  Martin Jaggi,et al.  Revisiting Frank-Wolfe: Projection-Free Sparse Convex Optimization , 2013, ICML.

[30]  Xiaogang Wang,et al.  Deep Learning Face Representation by Joint Identification-Verification , 2014, NIPS.

[31]  Charles R. Johnson,et al.  Matrix analysis , 1985, Statistical Inference for Engineers and Data Scientists.

[32]  Bernard De Baets,et al.  Supervised distance metric learning through maximization of the Jeffrey divergence , 2017, Pattern Recognit..

[33]  Xiaochen Yang,et al.  Learning Local Metrics and Influential Regions for Classification , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34]  Silvio Savarese,et al.  Deep Metric Learning via Lifted Structured Feature Embedding , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[35]  Andrew V. Knyazev,et al.  Toward the Optimal Preconditioned Eigensolver: Locally Optimal Block Preconditioned Conjugate Gradient Method , 2001, SIAM J. Sci. Comput..

[36]  Jure Leskovec,et al.  Signed networks in social media , 2010, CHI.

[37]  Richard D. Green,et al.  Object Recognition by Stochastic Metric Learning , 2014, SEAL.

[38]  Pascal Frossard,et al.  The emerging field of signal processing on graphs: Extending high-dimensional data analysis to networks and other irregular domains , 2012, IEEE Signal Processing Magazine.

[39]  Ming Yang,et al.  DeepFace: Closing the Gap to Human-Level Performance in Face Verification , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[40]  Gang Wang,et al.  Multi-manifold deep metric learning for image set classification , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[41]  Shengcai Liao,et al.  Embedding Deep Metric for Person Re-identification: A Study Against Large Variations , 2016, ECCV.

[42]  Yang Song,et al.  Learning Fine-Grained Image Similarity with Deep Ranking , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[43]  Tat-Seng Chua,et al.  An efficient sparse metric learning in high-dimensional space via l1-penalized log-determinant regularization , 2009, ICML '09.

[44]  Vivek K Goyal,et al.  Foundations of Signal Processing , 2014 .

[45]  Alexandros Kalousis,et al.  Parametric Local Metric Learning for Nearest Neighbor Classification , 2012, NIPS.

[46]  Amaury Habrard,et al.  Regressive Virtual Metric Learning , 2015, NIPS.

[47]  Wen Gao,et al.  Graph-Based Blind Image Deblurring From a Single Photograph , 2018, IEEE Transactions on Image Processing.

[48]  Wei Hu,et al.  Graph Metric Learning via Gershgorin Disc Alignment , 2020, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[49]  Enrico Magli,et al.  Graph Spectral Image Processing , 2018, Proceedings of the IEEE.

[50]  Rongrong Ji,et al.  Low-Rank Similarity Metric Learning in High Dimensions , 2015, AAAI.

[51]  Kavita Bala,et al.  Learning visual similarity for product design with convolutional neural networks , 2015, ACM Trans. Graph..

[52]  Gert R. G. Lanckriet,et al.  Robust Structural Metric Learning , 2013, ICML.

[53]  Jun Li,et al.  Data-Adaptive Metric Learning with Scale Alignment , 2019, AAAI.

[54]  Peter F. Stadler,et al.  Nodal Domain Theorems and Bipartite Subgraphs , 2005 .

[55]  M. R. Rao,et al.  Combinatorial Optimization , 1992, NATO ASI Series.

[56]  Kihyuk Sohn,et al.  Improved Deep Metric Learning with Multi-class N-pair Loss Objective , 2016, NIPS.

[57]  Lijun Zhang,et al.  Efficient Stochastic Optimization for Low-Rank Distance Metric Learning , 2017, AAAI.

[58]  Yuanchao Bai,et al.  Reconstruction-cognizant Graph Sampling Using Gershgorin Disc Alignment , 2018, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[59]  P. Mahalanobis On the generalized distance in statistics , 1936 .

[60]  Jiwen Lu,et al.  Deep transfer metric learning , 2015, CVPR.

[61]  Feng Zhou,et al.  Fine-Grained Categorization and Dataset Bootstrapping Using Deep Metric Learning with Humans in the Loop , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[62]  Chunheng Wang,et al.  Deep nonlinear metric learning with independent subspace analysis for face verification , 2012, ACM Multimedia.

[63]  Michael I. Jordan,et al.  Distance Metric Learning with Application to Clustering with Side-Information , 2002, NIPS.

[64]  Yadong Mu,et al.  Fixed-Rank Supervised Metric Learning on Riemannian Manifold , 2016, AAAI.

[65]  R. Varga Geršgorin And His Circles , 2004 .

[66]  F. Harary,et al.  STRUCTURAL BALANCE: A GENERALIZATION OF HEIDER'S THEORY1 , 1977 .

[67]  Wei Hu,et al.  Fast Graph Metric Learning via Gershgorin Disc Alignment , 2020, ArXiv.

[68]  Gene Cheung,et al.  Alternating Binary Classifier and Graph Learning from Partial Labels , 2018, 2018 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC).

[69]  Wei Hu,et al.  Feature Graph Learning for 3D Point Cloud Denoising , 2019, IEEE Transactions on Signal Processing.

[70]  Cheng Deng,et al.  Robust Metric Learning on Grassmann Manifolds with Generalization Guarantees , 2019, AAAI.

[71]  Nir Ailon,et al.  Deep Metric Learning Using Triplet Network , 2014, SIMBAD.

[72]  Yuanchao Bai,et al.  Fast Graph Sampling Set Selection Using Gershgorin Disc Alignment , 2019, IEEE Transactions on Signal Processing.

[73]  Byoung-Tak Zhang,et al.  Generative Local Metric Learning for Nearest Neighbor Classification , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.