A Clustering Performance Measure Based on Fuzzy Set Decomposition

Clustering is primarily used to uncover the true underlying structure of a given data set and, for this purpose, it is desirable to subject the same data to several different clustering algorithms. This paper attempts to put an order on the various partitions of a data set obtained from different clustering algorithms. The goodness of each partition is expressed by means of a performance measure based on a fuzzy set decomposition of the data set under consideration. Several experiments reported in here show that the proposed performance measure puts an order on different partitions of the same data which is consistent with the error rate of a classifier designed on the basis of the obtained cluster labelings.