Evaluation of a clustering technique based on game theory

In this paper we focus on the task of clustering in data mining applications. We introduce a formulation of a new clustering algorithm by modelling the system as a cooperative game in strategic form using game theory. The goal is to partition a dataset into k clusters. Our approach has been applied to both simulated and real-world datasets. In addition, we have implemented functions based on the calculation of errors to track both similarity of the data within the same cluster and dissimilarity measure of the data elements between different clusters. Experimental results show that our algorithm is capable of providing a comprehensive description of the final solutions and it has good predictive capabilities.

[1]  George Karypis,et al.  C HAMELEON : A Hierarchical Clustering Algorithm Using Dynamic Modeling , 1999 .

[2]  Andrew W. Moore,et al.  Accelerating exact k-means algorithms with geometric reasoning , 1999, KDD '99.

[3]  Huaning Yan Determining the Best K for Clustering Transactional Datasets : A Coverage Density-based Approach , 2008 .

[4]  Jiawei Han,et al.  Efficient and Effective Clustering Methods for Spatial Data Mining , 1994, VLDB.

[5]  Jiong Yang,et al.  STING: A Statistical Information Grid Approach to Spatial Data Mining , 1997, VLDB.

[6]  Pietro Perona,et al.  A Factorization Approach to Grouping , 1998, ECCV.

[7]  Vipin Kumar,et al.  Chameleon: Hierarchical Clustering Using Dynamic Modeling , 1999, Computer.

[8]  Aoying Zhou,et al.  Density-Based Clustering over an Evolving Data Stream with Noise , 2006, SDM.

[9]  Dimitrios Gunopulos,et al.  Automatic subspace clustering of high dimensional data for data mining applications , 1998, SIGMOD '98.

[10]  Duc Truong Pham,et al.  Incorporating Delay and Feedback in Intelligent Manufacturing Strategy Selection , 2006 .

[11]  Charles Elkan,et al.  Using the Triangle Inequality to Accelerate k-Means , 2003, ICML.

[12]  Keke Chen,et al.  Determining the best K , 2009, Data Knowl. Eng..

[13]  Philip S. Yu,et al.  A Framework for Clustering Evolving Data Streams , 2003, VLDB.

[14]  Yi Wang,et al.  Combining data mining and Game Theory in manufacturing strategy analysis , 2007, J. Intell. Manuf..

[15]  Samuel,et al.  A Game-Theoretic Framework for Similarity-Based Data Clustering , 2009 .

[16]  Marcello Pelillo,et al.  Dominant Sets and Pairwise Clustering , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.