Optimal route selection in complex multi-stage supply chain networks using SARSA(λ)

Success of any supply chain highly depends on the appropriate use of transportation. It is always a tough decision to select an optimal route in the stages of a supply chain that will balance the reverse condition of cost and time. Q-learning is already getting implied to optimize routes, but it greatly lacks in designing a complete Markov Decision Process (MDP) with enough number of states and actions sets. In this paper, we propose an MDP that can fit the design of a proper multi-stage supply chain network and intend to solve the MDP with SARSA (X) algorithm. The algorithm was simulated in a java based environment and experimental results show that Q-learning produced higher chunks of rewards but SARSA (X) was faster in case of convergence speed.

[1]  Panos M. Pardalos,et al.  Supply Chain Optimization , 2007 .

[2]  A. Vazacopoulos,et al.  SUPPLY CHAIN OPTIMIZATION , 2005 .

[3]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[4]  Dawei Lu,et al.  Fundamentals of supply chain management , 2011 .

[5]  J. Winch,et al.  Supply Chain Management: Strategy, Planning, and Operation , 2003 .

[6]  Michael Hugos,et al.  Essentials of Supply Chain Management , 2002 .

[7]  Turan Paksoy,et al.  A genetic algorithm approach for multi-objective optimization of supply chain networks , 2006, Comput. Ind. Eng..

[8]  Karan M. Gupta Performance Comparison of Sarsa ( λ ) and Watkin ’ s Q ( λ ) Algorithms , .

[9]  Arafat Habib,et al.  Reinforcement learning based autonomic virtual machine management in clouds , 2016, 2016 5th International Conference on Informatics, Electronics and Vision (ICIEV).

[10]  Ali Selamat,et al.  Route planning model of multi-agent system for a supply chain management , 2013, Expert Syst. Appl..

[11]  Michael Schwind,et al.  Supply Chain Yield Management based on Reinforcement Learning , 2003 .

[12]  Shoshanah Cohen,et al.  Strategic supply chain management : the five disciplines for top performance , 2005 .

[13]  Isis Truck,et al.  From Data Center Resource Allocation to Control Theory and Back , 2010, 2010 IEEE 3rd International Conference on Cloud Computing.

[14]  S. Chopra,et al.  Supply Chain Management: Strategy, Planning & Operation , 2007 .