Learning in Heterogeneous Environments: a Case Study

We propose a learning mechanism suitable for using in a heterogeneous environment, where interconnected systems present simultaneously a competitive and cooperative behaviour. By interconnected systems we understand systems that need to interact with each other and then act either in an individual as well as a global way: each of the systems observes itself and the environment, and acts according to its unique perspective; but a system can also need the help of others to complete its own tasks. In our case, systems are represented by agents, since we are studying environments where autonomy and pro-activity are essential features. The learning mechanism we propose here can be classified as a multi-agent learning mechanism, not only because there are multiple agents learning concurrently in the same environment but also because it allows them to understand how to improve their own performance without degrading the performance of the other agents. We tested our learning mechanism over the Disruption Management in Airline Operations Control Center application domain, where interconnected agents (namely the Aircraft Manager, the Passenger Manager and the Crew Manager) need to combine their expertise. The results show that our learning mechanism provides a good performance to the agents that operate simultaneously in cooperative and competitive situations.

[1]  Sean Luke,et al.  Cooperative Multi-Agent Learning: The State of the Art , 2005, Autonomous Agents and Multi-Agent Systems.

[2]  Kagan Tumer,et al.  Optimal Payoff Functions for Members of Collectives , 2001, Adv. Complex Syst..

[3]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[4]  Peter Stone,et al.  Reinforcement Learning for RoboCup Soccer Keepaway , 2005, Adapt. Behav..

[5]  A.G. Bakirtzis,et al.  A Reinforcement Learning Algorithm for Market Participants in FTR Auctions , 2007, 2007 IEEE Lausanne Power Tech.

[6]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[7]  R. Arkin,et al.  Behavioral diversity in learning robot teams , 1998 .

[8]  Ryszard Kowalczyk,et al.  Dynamic analysis of multiagent Q-learning with ε-greedy exploration , 2009, ICML '09.

[9]  Ana L. C. Bazzan,et al.  Learning in groups of traffic signals , 2010, Eng. Appl. Artif. Intell..

[10]  Richard S. Sutton,et al.  Introduction to Reinforcement Learning , 1998 .

[11]  Ana Paula Rocha,et al.  Towards an autonomous and intelligent Airline Operations Control , 2012, 2012 15th International IEEE Conference on Intelligent Transportation Systems.

[12]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[13]  Manuela M. Veloso,et al.  Multiagent Systems: A Survey from a Machine Learning Perspective , 2000, Auton. Robots.

[14]  Keiki Takadama,et al.  Toward Guidelines for Modeling Learning Agents in Multiagent-Based Simulation: Implications from Q-Learning and Sarsa Agents , 2004, MABS.

[15]  Ana Paula Rocha,et al.  An Electronic Market Architecture for the Formation of Virtual Enterprises , 1999, PRO-VE.