Research of semi-supervised spectral clustering algorithm based on pairwise constraints

Clustering is often considered as an unsupervised data analysis method, but making full use of the prior information in the process of clustering will significantly improve the performance of the clustering algorithm. Spectral clustering algorithm can well use the prior pairwise constraint information to cluster and has become a new hot spot of machine learning research in recent years. In this paper, we propose an effective clustering algorithm, called a semi-supervised spectral clustering algorithm based on pairwise constraints, in which the similarity matrix of data points is adjusted and optimized by pairwise constraints. The experiments on real-world data sets demonstrate the effectiveness of this algorithm.

[1]  Shifei Ding,et al.  Research on Spectral Clustering algorithms and prospects , 2010, 2010 2nd International Conference on Computer Engineering and Technology.

[2]  Yu Xiao,et al.  Semi-Supervised Clustering Based on Affinity Propagation Algorithm: Semi-Supervised Clustering Based on Affinity Propagation Algorithm , 2009 .

[3]  Dan Klein,et al.  From Instance-level Constraints to Space-Level Constraints: Making the Most of Prior Knowledge in Data Clustering , 2002, ICML.

[4]  Jitendra Malik,et al.  Normalized Cuts and Image Segmentation , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  S. S. Ravi,et al.  The complexity of non-hierarchical clustering with instance and cluster level constraints , 2007, Data Mining and Knowledge Discovery.

[6]  Xianchao Zhang,et al.  An improved spectral clustering algorithm based on random walk , 2011, Frontiers of Computer Science in China.

[7]  Ling Wang,et al.  Density-Sensitive Semi-Supervised Spectral Clustering , 2007 .

[8]  Claire Cardie,et al.  Clustering with Instance-Level Constraints , 2000, AAAI/IAAI.

[9]  Marie desJardins,et al.  Constrained Spectral Clustering under a Local Proximity Structure Assumption , 2005, FLAIRS.

[10]  Michael I. Jordan,et al.  On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.

[11]  Licheng Jiao,et al.  Spectral clustering with fuzzy similarity measure , 2011, Digit. Signal Process..

[12]  Qian Yun-tao Semi-supervised clustering based on spectral clustering , 2005 .

[13]  Joshua M. Lewis,et al.  Multi-view kernel construction , 2010, Machine Learning.

[14]  Ivor W. Tsang,et al.  Spectral Embedded Clustering: A Framework for In-Sample and Out-of-Sample Spectral Clustering , 2011, IEEE Transactions on Neural Networks.

[15]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[16]  Asli Çelikyilmaz,et al.  Soft-Link Spectral Clustering for Information Extraction , 2009, 2009 IEEE International Conference on Semantic Computing.

[17]  Yang Li-bin,et al.  Survey on Spectral Clustering Algorithms , 2008 .

[18]  Hongfei Lin,et al.  An improved spectral clustering algorithm based on local neighbors in kernel space , 2011, Comput. Sci. Inf. Syst..

[19]  M. Fiedler Algebraic connectivity of graphs , 1973 .

[20]  Y. Jiang,et al.  Spectral Clustering on Multiple Manifolds , 2011, IEEE Transactions on Neural Networks.

[21]  Jitendra Malik,et al.  Contour and Texture Analysis for Image Segmentation , 2001, International Journal of Computer Vision.

[22]  Xiao-Feng Lei,et al.  An Efficient Clustering Algorithm Based on Local Optimality of K -Means: An Efficient Clustering Algorithm Based on Local Optimality of K -Means , 2008 .

[23]  Edward Y. Chang,et al.  Parallel Spectral Clustering in Distributed Systems , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Yair Weiss,et al.  Segmentation using eigenvectors: a unifying view , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[25]  Gu Shi Research on Spectral Clustering in Machine Learning , 2007 .

[26]  Li Kun Some Developments on Semi-Supervised Clustering , 2009 .

[27]  Li Xia Active Semi-supervised Spectral Clustering Based on Pairwise Constraints , 2010 .

[28]  Yin Xue,et al.  Discriminative Semi-Supervised Clustering Analysis with Pairwise Constraints , 2008 .

[29]  Lin Fan,et al.  An Efficient Clustering Algorithm Based on Local Optimality of K -Means: An Efficient Clustering Algorithm Based on Local Optimality of K -Means , 2008 .

[30]  Shi Zhong,et al.  Semi-supervised model-based document clustering: A comparative study , 2006, Machine Learning.