Evaluating and Comparing Soft Partitions: An Approach Based on Dempster–Shafer Theory

In evidential clustering, cluster-membership uncertainty is represented by Dempster–Shafer mass functions. The notion of evidential partition generalizes other soft clustering structures such as fuzzy, possibilistic, or rough partitions. In this paper, we propose two extensions of the Rand index for evaluating and comparing evidential partitions, called similarity and consistency indices. The similarity index is suitable for measuring the closeness of two soft partitions, whereas the consistency index allows one to assess the agreement, or lack of conflict, between a soft partition and the true hard partition. Simulation experiments illustrate some applications of these indices.

[1]  Adrian E. Raftery,et al.  mclust Version 4 for R : Normal Mixture Modeling for Model-Based Clustering , Classification , and Density Estimation , 2012 .

[2]  James M. Keller,et al.  A possibilistic approach to clustering , 1993, IEEE Trans. Fuzzy Syst..

[3]  Anupam Joshi,et al.  Low-complexity fuzzy relational clustering algorithms for Web mining , 2001, IEEE Trans. Fuzzy Syst..

[4]  Glenn Shafer,et al.  A Mathematical Theory of Evidence , 2020, A Mathematical Theory of Evidence.

[5]  Olivier Colot,et al.  Introducing spatial neighbourhood in Evidential C-Means for segmentation of multi-source images: Application to prostate multi-parametric MRI , 2014, Inf. Fusion.

[6]  Hichem Frigui,et al.  Clustering and aggregation of relational data with applications to image database categorization , 2007, Pattern Recognit..

[7]  James M. Keller,et al.  Comparing Fuzzy, Probabilistic, and Possibilistic Partitions , 2010, IEEE Transactions on Fuzzy Systems.

[8]  Anne-Laure Jousselme,et al.  A proof for the positive definiteness of the Jaccard index matrix , 2013, Int. J. Approx. Reason..

[9]  Thierry Denoeux,et al.  EK-NNclus: A clustering procedure based on the evidential K-nearest neighbor rule , 2015, Knowl. Based Syst..

[10]  Eyke Hüllermeier,et al.  Comparing Fuzzy Partitions: A Generalization of the Rand Index and Related Measures , 2012, IEEE Transactions on Fuzzy Systems.

[11]  Quan Pan,et al.  Median evidential c-means algorithm and its application to community detection , 2015, Knowl. Based Syst..

[12]  Richard Weber,et al.  Soft clustering - Fuzzy and rough approaches and their extensions and derivatives , 2013, Int. J. Approx. Reason..

[13]  G. Klir,et al.  Uncertainty-based information: Elements of generalized information theory (studies in fuzziness and soft computing). , 1998 .

[14]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[15]  Ricardo J. G. B. Campello,et al.  A fuzzy extension of the Rand index and other related indexes for clustering and classification assessment , 2007, Pattern Recognit. Lett..

[16]  Anbupalam Thalamuthu,et al.  Gene expression Evaluation and comparison of gene clustering methods in microarray analysis , 2006 .

[17]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[18]  Georg Peters,et al.  Is there any need for rough clustering? , 2015, Pattern Recognit. Lett..

[19]  Thierry Denoeux,et al.  ECM: An evidential version of the fuzzy c , 2008, Pattern Recognit..

[20]  William M. Rand,et al.  Objective Criteria for the Evaluation of Clustering Methods , 1971 .

[21]  Thierry Denoeux,et al.  EVCLUS: evidential clustering of proximity data , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[22]  D. Dubois,et al.  A NOTE ON MEASURES OF SPECIFICITY FOR FUZZY SETS , 1985 .

[23]  Thierry Denoeux,et al.  Fusion of multi-tracer PET images for dose painting , 2014, Medical Image Anal..

[24]  Paolo Giordani,et al.  A toolbox for fuzzy clustering using the R programming language , 2015, Fuzzy Sets Syst..

[25]  Thierry Denoeux,et al.  Evidential clustering of large dissimilarity data , 2016, Knowl. Based Syst..

[26]  Thierry Denoeux,et al.  RECM: Relational evidential c-means algorithm , 2009, Pattern Recognit. Lett..

[27]  Anne-Laure Jousselme,et al.  Distances in evidence theory: Comprehensive survey and generalizations , 2012, Int. J. Approx. Reason..

[28]  Thierry Denoeux,et al.  Beyond Fuzzy, Possibilistic and Rough: An Investigation of Belief Functions in Clustering , 2016, SMPS.

[29]  Pawan Lingras,et al.  Applying Rough Set Concepts to Clustering , 2012 .

[30]  Adrian E. Raftery,et al.  Model-Based Clustering, Discriminant Analysis, and Density Estimation , 2002 .

[31]  Miin-Shen Yang,et al.  A Robust Automatic Merging Possibilistic Clustering Method , 2011, IEEE Transactions on Fuzzy Systems.

[32]  Roelof K. Brouwer Extending the rand, adjusted rand and jaccard indices to fuzzy partitions , 2008, Journal of Intelligent Information Systems.

[33]  Georg Peters,et al.  Rough clustering utilizing the principle of indifference , 2014, Inf. Sci..

[34]  Noureddine Zerhouni,et al.  Evidential evolving Gustafson-Kessel algorithm for online data streams partitioning using belief function theory , 2012, Int. J. Approx. Reason..

[35]  A. Ramer Uniqueness of information measure in the theory of evidence , 1987 .

[36]  Graham R. Wood,et al.  A multi-stage approach to clustering and imputation of gene expression profiles , 2007, Bioinform..

[37]  Thierry Denoeux A k -Nearest Neighbor Classification Rule Based on Dempster-Shafer Theory , 2008, Classic Works of the Dempster-Shafer Theory of Belief Functions.

[38]  J. Bezdek Numerical taxonomy with fuzzy sets , 1974 .

[39]  Thierry Denoeux,et al.  Inner and Outer Approximation of Belief Structures Using a Hierarchical Clustering Approach , 2001, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[40]  Éloi Bossé,et al.  A new distance between two bodies of evidence , 2001, Inf. Fusion.