iEnsemble: A Framework for Committee Machine Based on Multiagent Systems with Reinforcement Learning

The Machine Learning is one of the areas of Artificial Intelligence whose objective is the development of computational techniques for knowledge and building systems able to acquire knowledge automatically. One of the main challenges of learning algorithms is to maximize generalization. Thus the board machine, or a combination of more of a learning machine approach known in literature with the denomination ensemble along with the theory agents, become a promising alternative in this challenge. In this context, this research proposes the iEnsemble framework, which aims to provide a model of the ensemble through a multi-agent system architecture, where generalization, combination and learning are made through agents, through the performance of their respective roles. In the proposal, the agents follow each their life cycle and also perform the iStacking algorithm. This algorithm is based on Stacking method, which uses the reinforcement learning to define the result of the Ensemble. To validate the initial proposal of the framework, some experiments have been performed and the results obtained and limitations are presented.

[1]  Ricardo Azambuja Silveira,et al.  E-HIPS: An Extention of the Framework HIPS for Stagger of Distributed Process in Production Systems Based on Multiagent Systems and Memetic Algorithms , 2015, MICAI.

[2]  Rüdiger Zarnekow,et al.  Intelligent Software Agents , 1998, Springer Berlin Heidelberg.

[3]  Emilio Corchado,et al.  A survey of multiple classifier systems as hybrid systems , 2014, Inf. Fusion.

[4]  Tarek Helmy,et al.  Adaptive Ensemble and Hybrid Models for Classification of Bioinformatics Datasets , 2012 .

[5]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[6]  Christopher M. Bishop,et al.  Neural networks for pattern recognition , 1995 .

[7]  Ricardo Azambuja Silveira,et al.  HIPS: Um framework para escalonamento distribuído de processos em sistemas de produção utilizando sistemas multi-agentes , 2010 .

[8]  R JenningsNicholas,et al.  Developing multiagent systems , 2003 .

[9]  Jeffrey M. Bradshaw,et al.  An introduction to software agents , 1997 .

[10]  Dong Wang,et al.  Learning machines: Rationale and application in ground-level ozone prediction , 2014, Appl. Soft Comput..

[11]  Pavel Vrba JAVA-Based Agent Platform Evaluation , 2003, HoloMAS.

[12]  R. Schapire The Strength of Weak Learnability , 1990, Machine Learning.

[13]  Lars Kai Hansen,et al.  Neural Network Ensembles , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  L. Cooper,et al.  When Networks Disagree: Ensemble Methods for Hybrid Neural Networks , 1992 .

[15]  Alexander K. Seewald,et al.  How to Make Stacking Better and Faster While Also Taking Care of an Unknown Weakness , 2002, International Conference on Machine Learning.

[16]  Jaewan Lee,et al.  Agent-Based Approach to Distributed Ensemble Learning of Fuzzy ARTMAP Classifiers , 2007, KES-AMSTA.

[17]  Haytham Elghazel,et al.  An Empirical Comparison of Supervised Ensemble Learning Approaches , 2013 .

[18]  Patrick Gallinari,et al.  Multivariate Linear Regression on Classifier Outputs: a Capacity Study , 1998 .

[19]  Santiago Ontañón Villar,et al.  Ensemble case based learning for multi-agent systems , 2005 .

[20]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[21]  Peter Dayan,et al.  Technical Note: Q-Learning , 2004, Machine Learning.

[22]  Kagan Tumer,et al.  Classifier ensembles: Select real-world applications , 2008, Inf. Fusion.

[23]  David H. Wolpert,et al.  The Lack of A Priori Distinctions Between Learning Algorithms , 1996, Neural Computation.

[24]  Rüdiger Zarnekow,et al.  Intelligent software agents - foundations and applications , 1998 .

[25]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[26]  Michael Wooldridge,et al.  Programming Multi-Agent Systems in AgentSpeak using Jason (Wiley Series in Agent Technology) , 2007 .

[27]  Shi Yafeng,et al.  General characteristics of temperature variation in China during the last two millennia , 2002 .

[28]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[29]  Omar López-Ortega,et al.  A Multi-agent Ensemble of Classifiers , 2015, MICAI.