A Framework of Three-Way Cluster Analysis

A new framework of clustering is proposed inspired by the theory of three-way decisions, which is an alternative formulation different from the ones used in the existing studies. The novel three-way representation intuitively shows which objects are fringe to the cluster and it is proposed for dealing with uncertainty clustering. Instead of using two regions to represent a cluster by a single set, a cluster is represented using three regions through a pair of sets, and there are three regions such as the core region, fringe region and trivial region. A cluster is therefore more realistically characterized by a set of core objects and a set of boundary objects. In this paper, we also illustrate an algorithm for incomplete data by using the proposed evaluation-based three-way cluster model. The preliminary experimental results show that the proposed method is effective for clustering incomplete data which is one kind of uncertainty data. Furthermore, this paper reviews some three-way clustering approaches and discusses some future perspectives and potential research topics based on the three-way cluster analysis.

[1]  Min Chen,et al.  Interval set clustering , 2011, Expert Syst. Appl..

[2]  Yiyu Yao,et al.  Three-Way Decisions and Cognitive Computing , 2016, Cognitive Computation.

[3]  Guoyin Wang,et al.  A tree-based incremental overlapping clustering method using the three-way decision theory , 2016, Knowl. Based Syst..

[4]  Yiyu Yao,et al.  Detecting and refining overlapping regions in complex networks with three-way decisions , 2016, Inf. Sci..

[5]  Bing Shi,et al.  Regression-based three-way recommendation , 2017, Inf. Sci..

[6]  Decui Liang,et al.  A novel three-way decision model based on incomplete information system , 2016, Knowl. Based Syst..

[7]  Zeshui Xu,et al.  Three-way decisions with intuitionistic fuzzy decision-theoretic rough sets based on point operators , 2017, Inf. Sci..

[8]  Jingtao Yao,et al.  Gini objective functions for three-way classifications , 2017, Int. J. Approx. Reason..

[9]  Yiyu Yao,et al.  Cost-sensitive three-way email spam filtering , 2013, Journal of Intelligent Information Systems.

[10]  P. Lingras,et al.  Interval clustering using fuzzy and rough set theory , 2004, IEEE Annual Meeting of the Fuzzy Information, 2004. Processing NAFIPS '04..

[11]  James C. Bezdek,et al.  Fuzzy c-means clustering of incomplete data , 2001, IEEE Trans. Syst. Man Cybern. Part B.

[12]  Rui Xu,et al.  Survey of clustering algorithms , 2005, IEEE Transactions on Neural Networks.

[13]  Vladimir Estivill-Castro,et al.  Why so many clustering algorithms: a position paper , 2002, SKDD.

[14]  Yiyu Yao,et al.  Interval Set Cluster Analysis: A Re-formulation , 2009, RSFDGrC.

[15]  Dawei Li,et al.  Fuzzy clustering of incomplete data based on missing attribute interval size , 2015, 2015 IEEE 9th International Conference on Anti-counterfeiting, Security, and Identification (ASID).

[16]  Guoyin Wang,et al.  A Decision-Theoretic Rough Set Approach for Dynamic Data Mining , 2015, IEEE Transactions on Fuzzy Systems.

[17]  Cheng Wu,et al.  Nearest Neighbor Intervals Based AP Clustering Algorithm for Large Incomplete Data , 2015 .

[18]  Hong Yu,et al.  A Three-Way Decisions Clustering Algorithm for Incomplete Data , 2014, RSKT.

[19]  Huaxiong Li,et al.  Risk Decision Making Based on Decision-theoretic Rough Set: A Three-way View Decision Model , 2011 .

[20]  Alessandro Laio,et al.  Clustering by fast search and find of density peaks , 2014, Science.

[21]  Nouman Azam,et al.  Analyzing uncertainties of probabilistic rough set regions with game-theoretic rough sets , 2014, Int. J. Approx. Reason..

[22]  Xi Chen,et al.  Cost-Sensitive Three-Way Decisions Model Based on CCA , 2014, RSCTC.

[23]  Ying Wang,et al.  Three-Way Decisions Method for Overlapping Clustering , 2012, RSCTC.

[24]  Yiyu Yao,et al.  An Outline of a Theory of Three-Way Decisions , 2012, RSCTC.

[25]  Decui Liang,et al.  A Novel Risk Decision Making Based on Decision-Theoretic Rough Sets Under Hesitant Fuzzy Information , 2015, IEEE Transactions on Fuzzy Systems.

[26]  Yiyu Yao,et al.  Interval sets and three-way concept analysis in incomplete contexts , 2016, International Journal of Machine Learning and Cybernetics.

[27]  Guoyin Wang,et al.  An automatic method to determine the number of clusters using decision-theoretic rough set , 2014, Int. J. Approx. Reason..

[28]  V. J. Rayward-Smith,et al.  Fuzzy Cluster Analysis: Methods for Classification, Data Analysis and Image Recognition , 1999 .

[29]  Pawan Lingras,et al.  Interval Set Clustering of Web Users with Rough K-Means , 2004, Journal of Intelligent Information Systems.

[30]  Yao Li,et al.  TDUP: an approach to incremental mining of frequent itemsets with three-way-decision pattern updating , 2015, International Journal of Machine Learning and Cybernetics.