论文信息 - Multi-agent reinforcement learning for adaptive scheduling: application to multi-site company

Multi-agent reinforcement learning for adaptive scheduling: application to multi-site company

Abstract In recent years, most companies have resorted to multi-site organization in an effort to improve their competitiveness and to adapt to current conditions. In this article, we propose a model for adaptive scheduling in multi-site companies. We adopt a multi-agent approach in which intelligent agents have reactive learning ability. This allows them to make accurate short-term decisions. Our model is implemented on a 3-tier architecture that ensures the security of the data exchanged between the various company sites. Experimentations on a real case study demonstrate the applicability and the effectiveness of our model concerning both optimality and reactivity.

Damien Trentesaux | Bouziane Beldjilali | Nassima Aissani

[1] Jacques Ferber,et al. A meta-model for the analysis and design of organizations in multi-agent systems , 1998, Proceedings International Conference on Multi Agent Systems (Cat. No.98EX160).

[2] Peter Norvig,et al. Artificial Intelligence: A Modern Approach , 1995 .

[3] Nicolas Soulié,et al. Cahiers du GRES Organisational and spatial determinants of the multi- unit firm: Evidence from the French industry , 2007 .

[4] R. E. Miles,et al. Managing 21st century network organizations , 1992 .

[5] A. David,et al. RATP : la métamorphose - Réalités et théorie du pilotage du changement , 1995 .

[6] Jürgen Sauer,et al. Towards agent-based multi-site scheduling , 2000, PuK.

[7] D. Trentesaux,et al. Pilotage hétérarchique des systèmes de production , 2002 .

[8] Botond Kádár,et al. Adaptation and Learning in Distributed Production Control , 2004 .

[9] Florence Pirard. A hybrid decision aid approach for supply networks of multi-site enterprises redesign and strategic planing / Une démarche hybride d'aide à la décision pour la reconfiguration et la planification stratégique des réseaux logistiques des entreprises multi-sites , 2005 .

[10] Ben J. A. Kröse,et al. Learning from delayed rewards , 1995, Robotics Auton. Syst..

[11] Yuri N. Sotskov,et al. Schedule execution for two-machine flow-shop with interval processing times , 2009, Math. Comput. Model..

[12] Damien Trentesaux,et al. Self-organization in distributed manufacturing control: state-of-the-art and future trends , 2002, IEEE International Conference on Systems, Man and Cybernetics.