MALEF: Framework for distributed machine learning and data mining

Growing importance of distributed data mining techniques has recently attracted attention of researchers in multiagent domain. In this paper we present a novel framework MultiAgent Learning Framework (MALEF) designed for both the agent-based distributed machine learning as well as data mining. Proposed framework is based on: the exchange of meta-level descriptions of individual learning process; online reasoning about learning success and learning progress. This paper illustrates how MALEF framework can be used in practical system in which different learners use different datasets, hypotheses and learning algorithms. We describe our experimental results obtained using this system and review related work on the subject.

[1]  Sean Luke,et al.  Cooperative Multi-Agent Learning: The State of the Art , 2005, Autonomous Agents and Multi-Agent Systems.

[2]  Gerhard Weiß,et al.  A multiagent perspective of parallel and distributed machine learning , 1998, AGENTS '98.

[3]  Nicholas R. Jennings,et al.  Recommender systems: a market-based design , 2003, AAMAS '03.

[4]  Vlado Keselj,et al.  Document clustering using character N-grams: a comparative evaluation with term-based and word-based clustering , 2005, CIKM '05.

[5]  Pavel Berkhin,et al.  A Survey of Clustering Data Mining Techniques , 2006, Grouping Multidimensional Data.

[6]  Vasant Honavar,et al.  Analysis and Synthesis of Agents That Learn from Distributed Dynamic Data Sources , 2001, Emergent Neural Computational Architectures Based on Neuroscience.

[7]  Pierre Dillenbourg,et al.  Collaborative Learning: Cognitive and Computational Approaches , 1999 .

[8]  Hillol Kargupta,et al.  Distributed Data Mining: Algorithms, Systems, and Applications , 2003 .

[9]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[10]  Nitesh V. Chawla,et al.  Creating ensembles of classifiers , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[11]  Salvatore J. Stolfo,et al.  JAM: Java Agents for Meta-Learning over Distributed Databases , 1997, KDD.

[12]  Lee A. Becker,et al.  Coactive Learning for Distributed Data Mining , 1998, KDD.

[13]  Michal Pechoucek,et al.  Market-Inspired Approach to Collaborative Learning , 2006, CIA.

[14]  Gregory F. Cooper,et al.  Model Averaging for Prediction with Discrete Bayesian Networks , 2004, J. Mach. Learn. Res..

[15]  Sati S. Sian,et al.  Extending Learning to Multiple Agents: Issues and a Model for Multi-Agent Machine Learning (MA-ML) , 1991, EWSL.

[16]  Ian H. Witten,et al.  Text mining: a new frontier for lossless compression , 1999, Proceedings DCC'99 Data Compression Conference (Cat. No. PR00096).

[17]  Reid G. Smith,et al.  The Contract Net Protocol: High-Level Communication and Control in a Distributed Problem Solver , 1980, IEEE Transactions on Computers.

[18]  Matthias Klusch,et al.  Agent-Based Distributed Data Mining: The KDEC Scheme , 2003, AgentLink.

[19]  Pierre Dillenbourg,et al.  What is 'multi' in multi-agent learning? , 1999 .

[20]  Santiago Ontañón,et al.  Recycling data for multi-agent learning , 2005, ICML '05.

[21]  Joydeep Ghosh,et al.  A Consensus Framework for Integrating Distributed Clusterings Under Limited Knowledge Sharing , 2002 .

[22]  M. Larkin,et al.  Websites , 2001 .

[23]  Donald W. Bouldin,et al.  A Cluster Separation Measure , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  H. Sivakumar,et al.  Papyrus: A System for Data Mining over Local and Wide Area Clusters and Super-Clusters , 1999, ACM/IEEE SC 1999 Conference (SC'99).

[25]  Eric Bauer,et al.  An Empirical Comparison of Voting Classification Algorithms: Bagging, Boosting, and Variants , 1999, Machine Learning.

[26]  Foster J. Provost,et al.  Scaling Up: Distributed Machine Learning with Cooperation , 1996, AAAI/IAAI, Vol. 1.