Clustering Data and Vague Concepts Using Prototype Theory Interpreted Label Semantics

Clustering analysis is well-used in data mining to group a set of observations into clusters according to their similarity, thus, the (dis)similarity measure between observations becomes a key feature for clustering analysis. However, classical clustering analysis algorithms cannot deal with observation contains both data and vague concepts by using traditional distance measures. In this paper, we proposed a novel (dis)similarity measure based on a prototype theory interpreted knowledge representation framework named label semantics. The new proposed measure is used to extend classical K-means algorithm for clustering data instances and the vague concepts represented by logical expressions of linguistic labels. The effectiveness of proposed measure is verified by experimental results on an image clustering problem, this measure can also be extended to cluster data and vague concepts represented by other granularities.

[1]  Jonathan Lawry,et al.  A framework for linguistic modelling , 2004, Artif. Intell..

[2]  Alvy Ray Smith,et al.  Color gamut transform pairs , 1978, SIGGRAPH.

[3]  A R Smith,et al.  Color Gamut Transformation Pairs , 1978 .

[4]  Zengchang Qin,et al.  Uncertainty Modeling for Data Mining , 2014, Advanced Topics in Science and Technology in China.

[5]  Miguel Á. Carreira-Perpiñán,et al.  Multiscale conditional random fields for image labeling , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[6]  Khaled Mellouli,et al.  Clustering Approach Using Belief Function Theory , 2006, AIMSA.

[7]  Zengchang Qin,et al.  Uncertainty Modeling for Data Mining: A Label Semantics Approach , 2015 .

[8]  B. F. Momin,et al.  Modifications in K-Means Clustering Algorithm , 2012 .

[9]  Zied Elouedi,et al.  A New Possibilistic Clustering Method: The Possibilistic K-Modes , 2011, AI*IA.

[10]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[11]  Jonathan Lawry,et al.  Uncertainty modelling for vague concepts: A prototype theory approach , 2009, Artif. Intell..

[12]  Weifeng Zhang,et al.  Clustering data and imprecise concepts , 2011, 2011 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE 2011).