Boosted similarity learning based on discriminative graphs

Similarity measurement is crucial for unsupervised learning and semi-supervised learning. Unsupervised methods need a similarity to do clustering. Semi-supervised algorithms need a similarity to take advantage of unlabeled data. In this paper, we develop a boosted similarity learning algorithm. Based on the manifold assumption, our similarity is learned iteratively by a few discriminative graphs. So our similarity adopts the local structure information underlying the data. We propose “within graph-cluster scatter Sw” and “between graph-cluster scatter Sb”. Sw and Sb are used to analyze the discrimination of a given graph. Experimental results on both synthetic and public available data sets show that the proposed method outperforms the state-of-the-art approaches.

[1]  Hong Yu,et al.  Local density adaptive similarity measurement for spectral clustering , 2011, Pattern Recognit. Lett..

[2]  Xiaojin Zhu,et al.  Semi-Supervised Learning , 2010, Encyclopedia of Machine Learning.

[3]  Hamideh Afsarmanesh,et al.  Boosting for multiclass semi-supervised learning , 2014, Pattern Recognit. Lett..

[4]  G. Baudat,et al.  Generalized Discriminant Analysis Using a Kernel Approach , 2000, Neural Computation.

[5]  Chih-Cheng Hung,et al.  Semi-supervised multi-class Adaboost by exploiting unlabeled data , 2011, Expert Syst. Appl..

[6]  Ke Chen,et al.  Semi-Supervised Learning via Regularized Boosting Working on Multiple Semi-Supervised Assumptions , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Miguel Á. Carreira-Perpiñán,et al.  Proximity Graphs for Clustering and Manifold Learning , 2004, NIPS.

[8]  Tommy W. S. Chow,et al.  Graph Based Constrained Semi-Supervised Learning Framework via Label Propagation over Adaptive Neighborhood , 2015, IEEE Transactions on Knowledge and Data Engineering.

[9]  Pong C. Yuen,et al.  Similarity learning for semi-supervised multi-class boosting , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[10]  Pietro Perona,et al.  Self-Tuning Spectral Clustering , 2004, NIPS.

[11]  Michael I. Jordan,et al.  On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.

[12]  Pavel Pudil,et al.  Introduction to Statistical Pattern Recognition , 2006 .

[13]  Pong C. Yuen,et al.  Semi-supervised metric learning via topology preserving multiple semi-supervised assumptions , 2013, Pattern Recognit..

[14]  Joachim M. Buhmann,et al.  Path-Based Clustering for Grouping of Smooth Curves and Texture Segmentation , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  Yi Liu,et al.  SemiBoost: Boosting for Semi-Supervised Learning , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Michael I. Jordan,et al.  Learning Spectral Clustering , 2003, NIPS.

[17]  Yongdong Zhang,et al.  On defining affinity graph for spectral clustering through ranking on manifolds , 2009, Neurocomputing.

[18]  Rong Jin,et al.  Semi-Supervised Boosting for Multi-Class Classification , 2008, ECML/PKDD.