Uncertainty Estimation with Distributional Reinforcement Learning for Applications in Intelligent Transportation Systems: A Case Study

Reinforcement learning (RL) algorithms have been successfully used in the area of Intelligent Transportation Systems (ITS) for applications such as energy management strategies (EMS) of hybrid electric vehicles, autonomous driving, traffic light cycle control and bottleneck decongestion. In this work, we investigate a distributional RL algorithm on an EMS problem as a case study to show the benefits of estimating the uncertainty associated with different actions at different states. The uncertainty estimation is highly beneficial to ITS applications as randomness and uncertainty are intrinsic characteristics of real-world problems due to incomplete knowledge of the environment and the stochastic nature of some real-world systems and human behaviors. The modeled uncertainty has the form of a return distribution for taking an action at a certain state. We provide a case study to show that only considering the expected reward value would result in a loss of important information such as the spread. By modeling the return distribution, domain knowledge can be incorporated to make the results more interpretable. Also, risk-sensitive strategies can be developed to build more robust solutions using a chosen utility function.

[1]  Rémi Munos,et al.  Implicit Quantile Networks for Distributional Reinforcement Learning , 2018, ICML.

[2]  Xin Zhang,et al.  End to End Learning for Self-Driving Cars , 2016, ArXiv.

[3]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[4]  Yuval Tassa,et al.  Continuous control with deep reinforcement learning , 2015, ICLR.

[5]  Jonathan P. How,et al.  Decision Making Under Uncertainty: Theory and Application , 2015 .

[6]  Alexandre M. Bayen,et al.  Lagrangian Control through Deep-RL: Applications to Bottleneck Decongestion , 2018, 2018 21st International Conference on Intelligent Transportation Systems (ITSC).

[7]  Bo Gao,et al.  Driving Style Recognition for Intelligent Vehicle Control and Advanced Driver Assistance: A Survey , 2018, IEEE Transactions on Intelligent Transportation Systems.

[8]  Jieping Ye,et al.  Deep Reinforcement Learning with Knowledge Transfer for Online Rides Order Dispatching , 2018, 2018 IEEE International Conference on Data Mining (ICDM).

[9]  Zhu Han,et al.  A Deep Reinforcement Learning Network for Traffic Light Cycle Control , 2018, IEEE Transactions on Vehicular Technology.

[10]  Shashi Shekhar,et al.  A Deep Reinforcement Learning Framework for Energy Management of Extended Range Electric Delivery Vehicles , 2019, 2019 IEEE Intelligent Vehicles Symposium (IV).

[11]  Matthew W. Hoffman,et al.  Distributed Distributional Deterministic Policy Gradients , 2018, ICLR.

[12]  Marc G. Bellemare,et al.  A Distributional Perspective on Reinforcement Learning , 2017, ICML.

[13]  Yee Whye Teh,et al.  An Analysis of Categorical Distributional Reinforcement Learning , 2018, AISTATS.

[14]  Guy Lever,et al.  Deterministic Policy Gradient Algorithms , 2014, ICML.