Silhouette index for determining optimal k-means clustering on images in different color models

Clustering process is an essential part of the image processing. Its aim to group the data according to having the same attributes or similarities of the images. Consequently, determining the number of the optimum clusters or the best (well-clustered) for the image in different color models is very crucial. This is because the cluster validation is fundamental in the process of clustering and it reflects the split between clusters. In this study, the k-means algorithm was used on three colors model: CIE Lab, RGB and HSV and the clustering process made up to k clusters. Next, the Silhouette Index (SI) is used to the cluster validation process, and this value is range between 0 to 1 and the greater value of SI illustrates the best of cluster separation. The results from several experiments show that the best cluster separation occurs when k=2 and the value of average SI is inversely proportional to the number of k cluster for all color model. The result shows in HSV color model the average SI decreased 14.11% from k = 2 to k = 8, 11.1% in HSV color model and 16.7% in CIE Lab color model. Comparisons are also made for the three color models and generally the best cluster separation is found within HSV, followed by the RGB and CIE Lab color models.

[1]  Kemal Polat,et al.  Brain MRI Segmentation based on Different Clustering Algorithms , 2016 .

[2]  Ujjwal Maulik,et al.  Performance Evaluation of Some Clustering Algorithms and Validity Indices , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Eréndira Rendón,et al.  Internal versus External cluster validation indexes , 2011 .

[4]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[5]  Shiv Ram Dubey,et al.  Infected Fruit Part Detection using K-Means Clustering Segmentation Technique , 2013, Int. J. Interact. Multim. Artif. Intell..

[6]  Hui Zhao,et al.  Feature Analysis Based on Edge Extraction and Median Filtering for CBIR , 2009, 2009 11th International Conference on Computer Modelling and Simulation.

[7]  S. Sathappan,et al.  A novel approach for content based image retrieval using hybrid filter techniques , 2013, 2013 8th International Conference on Computer Science & Education.

[8]  Nikhil R. Pal,et al.  Cluster validation using graph theoretic concepts , 1997, Pattern Recognit..

[9]  Frans Coenen,et al.  Best Clustering Configuration Metrics: Towards Multiagent Based Clustering , 2010, ADMA.

[10]  Richard Szeliski,et al.  Computer Vision - Algorithms and Applications , 2011, Texts in Computer Science.

[11]  Nancy M. Salem,et al.  Segmentation of white blood cells from microscopic images using K-means clustering , 2014, 2014 31st National Radio Science Conference (NRSC).

[12]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[13]  Hui Xiong,et al.  Understanding of Internal Clustering Validation Measures , 2010, 2010 IEEE International Conference on Data Mining.

[14]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[15]  Haiyan Qiao,et al.  A Data Clustering Tool with Cluster Validity Indices , 2009, 2009 International Conference on Computing, Engineering and Information.

[16]  Humera Tariq,et al.  K-Means Cluster Analysis for Image Segmentation , 2014 .

[17]  Ricardo J. G. B. Campello,et al.  Relative clustering validity criteria: A comparative overview , 2010, Stat. Anal. Data Min..

[18]  Vitoantonio Bevilacqua,et al.  Face Detection by Means of Skin Detection , 2008, ICIC.

[19]  Yambem Jina Chanu,et al.  Image Segmentation Using K -means Clustering Algorithm and Subtractive Clustering Algorithm , 2015 .

[20]  V. Seenivasagam,et al.  Color image segmentation using feedforward neural networks with FCM , 2016, Int. J. Autom. Comput..

[21]  Olatz Arbelaitz,et al.  An extensive comparative study of cluster validity indices , 2013, Pattern Recognit..

[22]  N. Anbazhagan,et al.  An Effective Method of Image Retrieval using Image Mining Techniques , 2010, ArXiv.

[23]  Michael J. A. Berry,et al.  Data mining techniques - for marketing, sales, and customer support , 1997, Wiley computer publishing.

[24]  Pokkuluri Kiran Sree,et al.  Face Detection from still and Video Images using Unsupervised Cellular Automata with K means clustering algorithm , 2013, ArXiv.