论文信息 - Elevator Control Using Reinforcement Learning to Select Strategy

Elevator Control Using Reinforcement Learning to Select Strategy

In this thesis, we investigated if reinforcement learning could be applied on elevator systems to improve performance. The performance was evaluated by the average squared waiting time for the passengers, and the buildings considered were apartment buildings.The problem of scheduling elevator cars is an NP-hard problem, and no optimal solution is known. Therefore, an approach where the system learns a strategy instead of using a heuristic, should be the easiest way to get near an optimal solution.A learning system was constructed, where the system was trained to use the best scheduling algorithm out of five in a given situation, based on the prevailing traffic. The purpose of this approach was to reduce the training time that was required in order to get good performance and to lower the complexity of the system.A simulator was then developed, in which the different algorithms were implemented and tested in four different scenarios, where the size of the building and the number of elevator cars varied. The results generated by the simulator showed that reinforcement learning is a great strategy to use in buildings with 16 floors and three or four elevator cars. However, reinforcement learning did not increase the performance in buildings with 10 floors and two to three elevator cars. A possible reason for this is that the variation in performance between the different scheduling algorithms was too small in these scenarios.

Anton Jansson | Kristoffer Uggla Lingvall | A. Jansson

[1] Andrew G. Barto,et al. Elevator Group Control Using Multiple Reinforcement Learning Agents , 1998, Machine Learning.

[2] Matthew Brand,et al. Decision-Theoretic Group Elevator Scheduling , 2003, ICAPS.

[3] Marja-Liisa Siikonen,et al. Elevator Group Control with Artificial Intelligence , 1997 .

[4] Risto Lahdelma,et al. Estimated Time of Arrival ( ETA ) Based Elevator Group Control Algorithm with More Accurate Estimation , 2004 .

[5] Jana Koehler,et al. An AI-Based Approach to Destination Control in Elevators , 2002, AI Mag..

[6] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[7] David Thomas,et al. The Art in Computer Programming , 2001 .

[8] Thomas Strang,et al. Context-Aware Elevator Scheduling , 2007, 21st International Conference on Advanced Information Networking and Applications Workshops (AINAW'07).

[9] Peter B. Luh,et al. Group Elevator Scheduling with Advanced Traffic Information for Normal Operations and Coordinated Emergency Evacuation , 2005, Proceedings of the 2005 IEEE International Conference on Robotics and Automation.