Using Distributed Data Mining and Distributed Artificial Intelligence for Knowledge Integration

In this paper we study Distributed Data Mining from a Distributed Artificial Intelligence perspective. Very often, databases are very large to be mined. Then Distributed Data Mining can be used for discovering knowledge (rule sets) generated from parts of the entire training data set. This process requires cooperation and coordination between the processors because incon-sistent, incomplete and useless knowledge can be generated, since each processor uses partial data. Cooperation and coordination are important issues in Distributed Artificial Intelligence and can be accomplished with different techniques: planning (centralized, partially distributed and distributed), negotiation, reaction, etc. In this work we discuss a coordination protocol for cooperative learning agents of a MAS developed previously, comparing it conceptually with other learning systems. This cooperative process is hierarchical and works under the coordination of a manager agent. The proposed model aims to select the best rules for integration into the global model without, however, decreasing its accuracy rate. We have also done experiments comparing accuracy and complexity of the knowledge generated by the cooperative agents.

[1]  Hillol Kargupta,et al.  Distributed Data Mining: Algorithms, Systems, and Applications , 2003 .

[2]  Philip K. Chan,et al.  Advances in Distributed and Parallel Knowledge Discovery , 2000 .

[3]  Nicholas R. Jennings,et al.  A Roadmap of Agent Research and Development , 2004, Autonomous Agents and Multi-Agent Systems.

[4]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[5]  Fabrício Enembreck,et al.  Multiagent-Based Model Integration , 2006, 2006 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology Workshops.

[6]  Thies Wittig,et al.  ARCHON: an architecture for multi-agent systems , 1992 .

[7]  Herna L. Viktor,et al.  Combining data mining and human expertise for making decisions, sense and policies , 2000 .

[8]  Simon Parsons,et al.  Principles of Data Mining by David J. Hand, Heikki Mannila and Padhraic Smyth, MIT Press, 546 pp., £34.50, ISBN 0-262-08290-X , 2004, The Knowledge Engineering Review.

[9]  Alex Alves Freitas,et al.  Mining Very Large Databases with Parallel Processing , 1997, The Kluwer International Series on Advances in Database Systems.

[10]  Nicholas R. Jennings,et al.  Foundations of distributed artificial intelligence , 1996, Sixth-generation computer technology series.

[11]  William W. Cohen Fast Effective Rule Induction , 1995, ICML.

[12]  Heikki Mannila,et al.  Principles of Data Mining , 2001, Undergraduate Topics in Computer Science.

[13]  Michael Wooldridge,et al.  Autonomous agents and multi-agent systems , 2014 .

[14]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[15]  Nicholas R. Jennings,et al.  Coordination techniques for distributed artificial intelligence , 1996 .

[16]  Munindar P. Singh Book review: ARCHON: An Architecture for Multi-agent Systems. Edited by Wittig, Thies (Ellis Horwood Limited (distributed by Prentice Hall), 1992) , 1996, SGAR.

[17]  Michael Wooldridge,et al.  Reasoning about rational agents , 2000, Intelligent robots and autonomous agents.

[18]  Nong Ye,et al.  The Handbook of Data Mining , 2003 .

[19]  Ana L. C. Bazzan,et al.  A Multi-agent System to Facilitate Knowledge Discovery : an application to Bioinformatics , 2002 .

[20]  Ana L. C. Bazzan,et al.  Integrating Knowledge Through Cooperative Negotiation - A Case Study in Bioinformatics , 2005, AIS-ADM.

[21]  Jiming Liu,et al.  Autonomous Intelligent Systems: Agents and Data Mining: International Workshop, AIS-ADM 2005, St. Petersburg, Russia, June 6-8, 2005. Proceedings , 2005, AIS-ADM.

[22]  H. Sivakumar,et al.  Papyrus: A System for Data Mining over Local and Wide Area Clusters and Super-Clusters , 1999, ACM/IEEE SC 1999 Conference (SC'99).

[23]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[24]  Ilker Hamzaoglu,et al.  Scalable, Distributed Data Mining - An Agent Architecture , 1997, KDD.

[25]  Robert E. Schapire,et al.  The Boosting Approach to Machine Learning An Overview , 2003 .

[26]  Salvatore J. Stolfo,et al.  JAM: Java Agents for Meta-Learning over Distributed Databases , 1997, KDD.

[27]  Ian H. Witten,et al.  Stacking Bagged and Dagged Models , 1997, ICML.