Multi-agent systems with reinforcement hierarchical neuro-fuzzy models

This paper introduces a new multi-agent model for intelligent agents, called reinforcement learning hierarchical neuro-fuzzy multi-agent system. This class of model uses a hierarchical partitioning of the input space with a reinforcement learning algorithm to overcome limitations of previous RL methods. The main contribution of the new system is to provide a flexible and generic model for multi-agent environments. The proposed generic model can be used in several applications, including competitive and cooperative problems, with the autonomous capacity to create fuzzy rules and expand their own rule structures, extracting knowledge from the direct interaction between the agents and the environment, without any use of supervised algorithms. The proposed model was tested in three different case studies, with promising results. The tests demonstrated that the developed system attained good capacity of convergence and coordination among the autonomous intelligent agents.

[1]  Sean Kelly,et al.  The Definitive Guide to Project Management: The Fast Track to Getting the Job Done on Time and on Budget , 2004 .

[2]  Ricardo Tanscheit,et al.  Hierarchical type-2 neuro-fuzzy BSP model , 2011, Inf. Sci..

[3]  Tao Xiong,et al.  A combined SVM and LDA approach for classification , 2005, Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005..

[4]  M. Benda,et al.  On Optimal Cooperation of Knowledge Sources , 1985 .

[5]  Guillermo Ricardo Simari,et al.  Multiagent systems: a modern approach to distributed artificial intelligence , 2000 .

[6]  C. Ribeiro A Tutorial on Reinforcement Learning Techniques , 1999 .

[7]  Martin Dosedla Use of multi-agent systems in project management , 2009 .

[8]  C. Boutilier,et al.  Accelerating Reinforcement Learning through Implicit Imitation , 2003, J. Artif. Intell. Res..

[9]  Marley M. B. R. Vellasco,et al.  Reinforcement learning/spl I.bar/hierarchical neuro-fuzzy politree model for control of autonomous agents , 2004, Fourth International Conference on Hybrid Intelligent Systems (HIS'04).

[10]  Bart De Schutter,et al.  A Comprehensive Survey of Multiagent Reinforcement Learning , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[11]  Lionel Jouffe,et al.  Fuzzy inference system learning by reinforcement methods , 1998, IEEE Trans. Syst. Man Cybern. Part C.

[12]  Wolfgang Rosenstiel,et al.  Organizing and Planning the ASIC Design Process by Means of a Multi-agent System , 2010, ICAART.

[13]  Constanta-Nicoleta Bodea,et al.  CONTRIBUTIONS TO MULTI-AGENT SYSTEMS IMPLEMENTATION FOR PROJECT SCHEDULING , 2010 .

[14]  Hamideh Afsarmanesh,et al.  Multi-agent-based agile scheduling , 1999, Robotics Auton. Syst..

[15]  Marley M. B. R. Vellasco,et al.  Methods for acceleration of learning process of Reinforcement Learning Neuro-Fuzzy Hierarchical Politree model , 2010, 2010 International Conference on Autonomous and Intelligent Systems, AIS 2010.

[16]  Marley M. B. R. Vellasco,et al.  Hierarchical neuro-fuzzy quadtree models , 2002, Fuzzy Sets Syst..

[17]  Nikos A. Vlassis,et al.  Utile Coordination: Learning Interdependencies Among Cooperative Agents , 2005, CIG.

[18]  P. Glorennec,et al.  Fuzzy Q-learning , 1997, Proceedings of 6th International Fuzzy Systems Conference.

[19]  Vladimir Gorodetsky,et al.  Multi-agent Software Tool for Management of Design Process in Microelectronics , 2006, 2006 IEEE/WIC/ACM International Conference on Intelligent Agent Technology.

[20]  Marley M. B. R. Vellasco,et al.  Electric load forecasting: evaluating the novel hierarchical neuro-fuzzy BSP model , 2004 .

[21]  Rudolf Kruse,et al.  Neuro-fuzzy systems for function approximation , 1999, Fuzzy Sets Syst..

[22]  Jürgen Bode,et al.  Application of multiagent systems in project management , 2000 .

[23]  R. McAfee,et al.  Auctions and Bidding , 1986 .

[24]  Constanta-Nicoleta BODEA COMPLEX PROJECT SCHEDULING USING MULTI-AGENT METHODS : A CASE STUDY FOR RESEARCH PROJECTS , 2010 .

[25]  Michael Wooldridge,et al.  Agent technology: foundations, applications, and markets , 1998 .

[26]  Richard S. Sutton,et al.  Generalization in ReinforcementLearning : Successful Examples UsingSparse Coarse , 1996 .

[27]  Craig Boutilier,et al.  The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems , 1998, AAAI/IAAI.

[28]  Marley Vellasco,et al.  Modified Reinforcement Learning-Hierarchical Neuro-Fuzzy Politree Model for Control of Autonomous Agents , 2005 .

[29]  Richard S. Sutton,et al.  Introduction to Reinforcement Learning , 1998 .

[30]  Li Gang A Study on the Project Scheduling Based on Multi-Agent Systems , 2005 .

[31]  Murray Turoff,et al.  The Delphi Method: Techniques and Applications , 1976 .

[32]  Ming Tan,et al.  Multi-Agent Reinforcement Learning: Independent versus Cooperative Agents , 1997, ICML.

[33]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[34]  Robert Fitch,et al.  Structural Abstraction Experiments in Reinforcement Learning , 2005, Australian Conference on Artificial Intelligence.

[35]  Gerhard Weiss,et al.  Multiagent systems: a modern approach to distributed artificial intelligence , 1999 .

[36]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[37]  D. Malcolm,et al.  Application of a Technique for Research and Development Program Evaluation , 1959 .

[38]  Ishwar K. Sethi,et al.  A Metric-Based Multi-Agent System for Software Project Management , 2009, 2009 Eighth IEEE/ACIS International Conference on Computer and Information Science.

[39]  Sandip Sen,et al.  Multiagent Coordination with Learning Classifier Systems , 1995, Adaption and Learning in Multi-Agent Systems.