Agent Based Frequent Set Meta Mining: Introducing EMADS

In this paper we: introduce EMADS, the Extendible Multi-Agent Data mining System, to support the dynamic creation of communities of data mining agents; explore the capabilities of such agents and demonstrate (by experiment) their application to data mining on distributed data. Although, EMADS is not restricted to one data mining task, the study described here, for the sake of brevity, concentrates on agent based Association Rule Mining (ARM), in particular what we refer to as frequent set meta mining (or Meta ARM). A full description of our proposed Meta ARM model is presented where we describe the concept of Meta ARM and go on to describe and analyse a number of potential solutions in the context of EMADS. Experimental results are considered in terms of: the number of data sources, the number of records in the data sets and the number of attributes represented.

[1]  Philip S. Yu,et al.  A Condensation Approach to Privacy Preserving Data Mining , 2004, EDBT.

[2]  Frans Coenen,et al.  Tree Structures for Mining Association Rules , 2004, Data Mining and Knowledge Discovery.

[3]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[4]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[5]  Rüdiger Schollmeier,et al.  A definition of peer-to-peer networking for the classification of peer-to-peer architectures and applications , 2001, Proceedings First International Conference on Peer-to-Peer Computing.

[6]  Frans Coenen,et al.  Frequent Set Meta Mining: Towards Multi-Agent Data Mining , 2007, SGAI Conf..

[7]  Yike Guo,et al.  Probing Knowledge in Distributed Data Mining , 1999, PAKDD.

[8]  Ramakrishnan Srikant,et al.  The Quest Data Mining System , 1996, KDD.

[9]  H. Sivakumar,et al.  Papyrus: A System for Data Mining over Local and Wide Area Clusters and Super-Clusters , 1999, ACM/IEEE SC 1999 Conference (SC'99).

[10]  Agostino Poggi,et al.  Jade - a fipa-compliant agent framework , 1999 .

[11]  Salvatore J. Stolfo,et al.  JAM: Java Agents for Meta-Learning over Distributed Databases , 1997, KDD.

[12]  AgrawalRakesh,et al.  Mining association rules between sets of items in large databases , 1993 .

[13]  Y. Shoham Introduction to Multi-Agent Systems , 2002 .

[14]  Frans Coenen,et al.  Algorithms for computing association rules using a partial-support tree , 2000, Knowl. Based Syst..

[15]  Jiawei Han,et al.  Generalization and decision tree induction: efficient classification in data mining , 1997, Proceedings Seventh International Workshop on Research Issues in Data Engineering. High Performance Database Management for Large-Scale Applications.

[16]  Ilker Hamzaoglu,et al.  Scalable, Distributed Data Mining - An Agent Architecture , 1997, KDD.

[17]  Philip K. Chan,et al.  Advances in Distributed and Parallel Knowledge Discovery , 2000 .