Multi-agent reinforcement learning for adaptive scheduling: application to multi-site company

Abstract In recent years, most companies have resorted to multi-site organization in an effort to improve their competitiveness and to adapt to current conditions. In this article, we propose a model for adaptive scheduling in multi-site companies. We adopt a multi-agent approach in which intelligent agents have reactive learning ability. This allows them to make accurate short-term decisions. Our model is implemented on a 3-tier architecture that ensures the security of the data exchanged between the various company sites. Experimentations on a real case study demonstrate the applicability and the effectiveness of our model concerning both optimality and reactivity.

[1]  Jacques Ferber,et al.  A meta-model for the analysis and design of organizations in multi-agent systems , 1998, Proceedings International Conference on Multi Agent Systems (Cat. No.98EX160).

[2]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[3]  Nicolas Soulié,et al.  Cahiers du GRES Organisational and spatial determinants of the multi- unit firm: Evidence from the French industry , 2007 .

[4]  R. E. Miles,et al.  Managing 21st century network organizations , 1992 .

[5]  A. David,et al.  RATP : la métamorphose - Réalités et théorie du pilotage du changement , 1995 .

[6]  Jürgen Sauer,et al.  Towards agent-based multi-site scheduling , 2000, PuK.

[7]  D. Trentesaux,et al.  Pilotage hétérarchique des systèmes de production , 2002 .

[8]  Botond Kádár,et al.  Adaptation and Learning in Distributed Production Control , 2004 .

[9]  Florence Pirard A hybrid decision aid approach for supply networks of multi-site enterprises redesign and strategic planing / Une démarche hybride d'aide à la décision pour la reconfiguration et la planification stratégique des réseaux logistiques des entreprises multi-sites , 2005 .

[10]  Ben J. A. Kröse,et al.  Learning from delayed rewards , 1995, Robotics Auton. Syst..

[11]  Yuri N. Sotskov,et al.  Schedule execution for two-machine flow-shop with interval processing times , 2009, Math. Comput. Model..

[12]  Damien Trentesaux,et al.  Self-organization in distributed manufacturing control: state-of-the-art and future trends , 2002, IEEE International Conference on Systems, Man and Cybernetics.