An Agent Oriented Approach for Implementation of the Range Method of Initial Centroids in K-Means Clustering Data Mining Algorithm

Intelligent agents are today accepted as powerful tools for data mining in a distributed environment. Artificial Intelligence (AI) algorithms are utilized to render software agents more intelligent. Knowledge discovery and data mining methods have been applied to discover hidden patterns and relations in complex datasets where clustering is one of the most important approaches used for this purpose. There exists a number of clustering algorithms among which the K-means algorithm is commonly used to find clusters due to its simplicity of implementation and fast execution. It appears extensively in the machine learning literature and in most data mining suite of tools. The algorithm is significantly sensitive to the selection of initial centroids. In this paper, we will present an agent oriented approach for implementation of the Range Method of initial centroids in K-means data mining algorithm. This Range Method is based on the actual sample datapoints. We have tested this method with both Euclidean and City Block (Manhattan) distances formulae on a different number of real life datasets.

[1]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[2]  Matjaz Gams,et al.  An agent version of a cluster server , 2003, CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings..

[3]  Yoav,et al.  Agent Oriented Programming an overview of the framework and summary of recent research , 2005 .

[4]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[5]  Masoud Mohammadian Intelligent Agents for Data Mining and Information Retrieval , 2004 .

[6]  I. Davidson Understanding K-Means Non-hierarchical Clustering , 2002 .

[7]  H. Mannila,et al.  Data mining: machine learning, statistics, and databases , 1996, Proceedings of 8th International Conference on Scientific and Statistical Data Base Management.

[8]  Boriana L. Milenova Clustering Large Databases with Numeric and Nominal Values Using Orthogonal Projections , 2004 .

[9]  Bjorn Hermans,et al.  Intelligent software agents on the internet: An inventory of currently offered functionality in the , 1996 .

[10]  D. Edwards Data Mining: Concepts, Models, Methods, and Algorithms , 2003 .

[11]  Glenn Fung,et al.  A Comprehensive Overview of Basic Clustering Algorithms , 2001 .

[12]  George Cybenko,et al.  Mobile Agents: Motivations and State-of-the-Art Systems , 2000 .

[13]  Alison Cawsey,et al.  The essence of artificial intelligence , 1997 .

[14]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery in Databases , 1996, AI Mag..

[15]  Joseph P. Bigus The agent building and learning environment , 2000, AGENTS '00.

[16]  Nils J. Nilsson,et al.  Artificial Intelligence , 1974, IFIP Congress.