Clustering techniques are unsupervised learning methods of grouping similar from dissimilar data types. Therefore, these are popular for various data mining and pattern recognition purposes. However, their performances are data dependent. Thus, choosing right clustering technique for a given dataset is a research challenge. In this paper, we have tested the performances of a Soft clustering (e.g., Fuzzy C means or FCM) and a Hard clustering technique (e.g., K-means or KM) on Iris (150 x 4); Wine (178 x 13) and Lens (24 x 4) datasets. Distance measure is the heart of any clustering algorithm to compute the similarity between any two data. Two distance measures such as Manhattan (MH) and Euclidean (ED) are used to note how these influence the overall clustering performance. The performance has been compared based on seven parameters: (i) sensitivity, (ii) specificity, (iii) precision, (iv) accuracy, (v) run time, (vi) average intra cluster distance (i.e. compactness of the clusters) and (vii) inter cluster distance (i.e. distinctiveness of the clusters). Based on the experimental results, the paper concludes that both KM and FCM have performed well. However, KM outperforms FCM in terms of speed. FCM-MH combination produces most compact clusters, while KM-ED yields most distinct clusters.
[1]
Manoranjan Dash,et al.
Entropy-based fuzzy clustering and fuzzy modeling
,
2000,
Fuzzy Sets Syst..
[2]
Robin Sibson,et al.
SLINK: An Optimally Efficient Algorithm for the Single-Link Cluster Method
,
1973,
Comput. J..
[3]
James M. Keller,et al.
A fuzzy K-nearest neighbor algorithm
,
1985,
IEEE Transactions on Systems, Man, and Cybernetics.
[4]
Jiawei Han,et al.
Data Mining: Concepts and Techniques
,
2000
.
[5]
J. MacQueen.
Some methods for classification and analysis of multivariate observations
,
1967
.
[6]
Dilip Kumar Pratihar,et al.
A Comparative Study of Fuzzy C-Means Algorithm and Entropy-Based Fuzzy Clustering Algorithms
,
2011,
Comput. Informatics.
[7]
James C. Bezdek,et al.
Fuzzy mathematics in pattern classification
,
1973
.