Bayesian distance metric learning for discriminative fuzzy c-means clustering

Abstract A great number of machine learning algorithms strongly depend on the underlying distance metric for representing the important correlations of input data. Distance metric learning is defined as learning an appropriate similarity or distance metric for all input data pairs. Metric learning algorithms are of supervised and unsupervised categories with different deterministic and probabilistic approaches. One of the objectives of unsupervised metric learning is to project data points into a new space in such a way that high clustering accuracy is provided. This is obtainable by maximizing between-clusters separation. There exist some deterministic metric learning methods to serve this purpose. In this article, a probabilistic method for unsupervised distance metric learning is proposed which aims to maximize the separability among different clusters in the projected space. In this proposed method, distance metric learning and fuzzy c-means clustering are jointly formulated in a sense that FCM provides clusters, and distance metric learning algorithm applies the obtained clusters to materialize the maximum separability among all; moreover, Markov Chain Monte Carlo (MCMC) algorithm is applied to infer the latent variables. This proposed method, not only can obtain a low dimensional projection with specified number of dimensions, but also it can learn the proper number of reduced dimensions for each dataset in an automated sense. The experimental results reveal the out-performance of this method on different real-world datasets against its counterparts.

[1]  David J. Spiegelhalter,et al.  Introducing Markov chain Monte Carlo , 1995 .

[2]  I. Jolliffe Principal Component Analysis and Factor Analysis , 1986 .

[3]  Michael I. Jordan,et al.  Distance Metric Learning with Application to Clustering with Side-Information , 2002, NIPS.

[4]  Jianyu Yang,et al.  Metric learning based object recognition and retrieval , 2016, Neurocomputing.

[5]  Michael E. Tipping,et al.  Probabilistic Principal Component Analysis , 1999 .

[6]  Nan Jiang,et al.  Individual adaptive metric learning for visual tracking , 2016, Neurocomputing.

[7]  Neil D. Lawrence,et al.  Gaussian Process Latent Variable Models for Visualisation of High Dimensional Data , 2003, NIPS.

[8]  Yadong Wang,et al.  Improving fuzzy c-means clustering based on feature-weight learning , 2004, Pattern Recognit. Lett..

[9]  Cheng Wu,et al.  Discriminative clustering via extreme learning machine , 2015, Neural Networks.

[10]  Jieping Ye,et al.  Adaptive Distance Metric Learning for Clustering , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  J. Bezdek,et al.  FCM: The fuzzy c-means clustering algorithm , 1984 .

[12]  Lei Zhu,et al.  Unsupervised neighborhood component analysis for clustering , 2015, Neurocomputing.

[13]  Fei Wang,et al.  Survey on distance metric learning and dimensionality reduction in data mining , 2014, Data Mining and Knowledge Discovery.

[14]  Bo Du,et al.  LAM3L: Locally adaptive maximum margin metric learning for visual data classification , 2017, Neurocomputing.

[15]  Brian Kulis,et al.  Metric Learning: A Survey , 2013, Found. Trends Mach. Learn..

[16]  Feiping Nie,et al.  Discriminative Embedded Clustering: A Framework for Grouping High-Dimensional Data , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[17]  Paul D. Gader,et al.  Bayesian Fuzzy Clustering , 2015, IEEE Transactions on Fuzzy Systems.