Interval clustering using fuzzy and rough set theory

In many data mining applications, use of interval sets to represent clusters can be more appropriate than crisp representations. Interval set representation of a cluster consists of a lower bound and an upper bound. Objects in lower bound are definitely part of the cluster, and only belong to that cluster. Objects in the upper bound are possibly part of that cluster and potentially belong to another cluster. The interval sets make it possible to describe ambiguity in categorizing some of the objects. The interval clusters can be unsupervised counterparts of supervised rough sets. This paper describes two unsupervised algorithms for obtaining interval clusters. First algorithm is an extension of K-means based on properties of rough sets. The second algorithm is an extension of fuzzy C-means clustering. The paper describes conditions under which the fuzzy C-means clustering can lead to interval sets that obey some of the properties of rough sets. An experimental comparison of interval clusters from both the approaches is also provided.

[1]  Pawan Lingras,et al.  Rough set clustering for Web mining , 2002, 2002 IEEE World Congress on Computational Intelligence. 2002 IEEE International Conference on Fuzzy Systems. FUZZ-IEEE'02. Proceedings (Cat. No.02CH37291).

[2]  Hichem Frigui,et al.  Fuzzy and possibilistic shell clustering algorithms and their application to boundary detection and surface approximation. II , 1995, IEEE Trans. Fuzzy Syst..

[3]  Pawan Lingras,et al.  Interval set clustering of web users using modified Kohonen self-organizing maps based on the properties of rough sets , 2004, Web Intell. Agent Syst..

[4]  James M. Keller,et al.  A possibilistic approach to clustering , 1993, IEEE Trans. Fuzzy Syst..

[5]  Pawan Lingras,et al.  Interval Set Clustering of Web Users with Rough K-Means , 2004, Journal of Intelligent Information Systems.

[6]  Jerzy W. Grzymala-Busse,et al.  Rough Sets , 1995, Commun. ACM.

[7]  Pawan Lingras,et al.  Unsupervised Rough Set Classification Using GAs , 2001, Journal of Intelligent Information Systems.

[8]  Gao Xinbo,et al.  Parameter optimization in FCM clustering algorithms , 2000, WCC 2000 - ICSP 2000. 2000 5th International Conference on Signal Processing Proceedings. 16th World Computer Congress 2000.

[9]  Andrzej Skowron,et al.  Information Granules in Distributed Environment , 1999, RSFDGrC.

[10]  James C. Bezdek,et al.  Efficient Implementation of the Fuzzy c-Means Clustering Algorithms , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  R.J. Hathaway,et al.  Switching regression models and fuzzy clustering , 1993, IEEE Trans. Fuzzy Syst..

[12]  Anupam Joshi,et al.  Robust Fuzzy Clustering Methods to Support Web Mining , 1998 .