A robust semi-supervised EM-based clustering algorithm with a reject option

In this paper we address the problem of semi-supervision in the framework of parametric clustering by using labeled and unlabeled data together Clustering algorithms can take advantage from few labeled instances in order to tune parameters, improve convergence and overcome local extrema due to bad initialization. We extend a robust parametric clustering algorithm able to manage outlier rejection to the semi-supervision approach. This is achieved by modifying the expectation-maximization algorithm. The proposed method shows good performance with respect to data structure discovering, even facing to outliers.

[1]  Man Ieee Systems,et al.  IEEE transactions on systems, man and cybernetics. Part B, Cybernetics , 1996 .

[2]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[3]  Geoffrey J. McLachlan,et al.  Finite Mixture Models , 2019, Annual Review of Statistics and Its Application.

[4]  Christophe Saint-Jean,et al.  Clustering with EM: Complex Models vs. Robust Estimation , 2000, SSPR/SPR.

[5]  Ayhan Demiriz,et al.  Semi-Supervised Support Vector Machines , 1998, NIPS.

[6]  James C. Bezdek,et al.  Improved semi-supervised point-prototype clustering algorithms , 1998, 1998 IEEE International Conference on Fuzzy Systems Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98CH36228).

[7]  B. Ripley,et al.  Robust Statistics , 2018, Wiley Series in Probability and Statistics.

[8]  Amine Bensaid,et al.  Semi-Supervised Hierarchical Clustering Algorithms , 1997, SCAI.

[9]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[10]  Witold Pedrycz,et al.  Fuzzy clustering with partial supervision , 1997, IEEE Trans. Syst. Man Cybern. Part B.

[11]  G. Celeux,et al.  A Classification EM algorithm for clustering and two stochastic versions , 1992 .

[12]  James C. Bezdek,et al.  Semi-supervised Point Prototype Clustering , 1998, Int. J. Pattern Recognit. Artif. Intell..

[13]  James C. Bezdek,et al.  Partially supervised clustering for image segmentation , 1996, Pattern Recognit..