Deep reinforcement learning based energy management for a hybrid electric vehicle

Abstract This research proposes a reinforcement learning-based algorithm and a deep reinforcement learning-based algorithm for energy management of a series hybrid electric tracked vehicle. Firstly, the powertrain model of the series hybrid electric tracked vehicle (SHETV) is constructed, then the corresponding energy management formulation is established. Subsequently, a new variant of reinforcement learning (RL) method Dyna, namely Dyna-H, is developed by combining the heuristic planning step with the Dyna agent and is applied to energy management control for SHETV. Its rapidity and optimality are validated by comparing with DP and conventional Dyna method. Facing the problem of the “curse of dimensionality” in the reinforcement learning method, a novel deep reinforcement learning algorithm deep Q-learning (DQL) is designed for energy management control, which uses a new optimization method (AMSGrad) to update the weights of the neural network. Then the proposed deep reinforcement learning control system is trained and verified by the realistic driving condition with high-precision, and is compared with the benchmark method DP and the traditional DQL method. Results show that the proposed deep reinforcement learning method realizes faster training speed and lower fuel consumption than traditional DQL policy does, and its fuel economy quite approximates to global optimum. Furthermore, the adaptability of the proposed method is confirmed in another driving schedule.

[1]  Simona Onori,et al.  A Comparative Analysis of Energy Management Strategies for Hybrid Electric Vehicles , 2011 .

[2]  Chunhua Zheng,et al.  An Energy Management Strategy of Hybrid Energy Storage Systems for Electric Vehicle Applications , 2018, IEEE Transactions on Sustainable Energy.

[3]  Benjamín Pla,et al.  Analytical Optimal Solution to the Energy Management Problem in Series Hybrid Electric Vehicles , 2018, IEEE Transactions on Vehicular Technology.

[4]  Xiaosong Hu,et al.  Combined Optimal Sizing and Control for a Hybrid Tracked Vehicle , 2012 .

[5]  Yuan Zou,et al.  Intelligent energy management for hybrid electric tracked vehicles using online reinforcement learning , 2019, Applied Energy.

[6]  H. JoséAntonioMartín,et al.  Dyna-H: A heuristic planning reinforcement learning algorithm applied to role-playing game strategy decision systems , 2011, Knowl. Based Syst..

[7]  Huei Peng,et al.  Comparative Study of Dynamic Programming and Pontryagin’s Minimum Principle on Energy Management for a Parallel Hybrid Electric Vehicle , 2013 .

[8]  Hongwen He,et al.  Deep reinforcement learning of energy management with continuous control strategy and traffic information for a series-parallel plug-in hybrid electric bus , 2019, Applied Energy.

[9]  Yan Xu,et al.  Data-Driven Load Frequency Control for Stochastic Power Systems: A Deep Reinforcement Learning Method With Continuous Action Search , 2019, IEEE Transactions on Power Systems.

[10]  Jingyuan Zhan,et al.  Hybrid-Trip-Model-Based Energy Management of a PHEV With Computation-Optimized Dynamic Programming , 2018, IEEE Transactions on Vehicular Technology.

[11]  J. Karl Hedrick,et al.  Dynamic Traffic Feedback Data Enabled Energy Management in Plug-in Hybrid Electric Vehicles , 2015, IEEE Transactions on Control Systems Technology.

[12]  Yuan Zou,et al.  Reinforcement Learning of Adaptive Energy Management With Transition Probability for a Hybrid Electric Tracked Vehicle , 2015, IEEE Transactions on Industrial Electronics.

[13]  M Maarten Steinbuch,et al.  Rule-based energy management strategies for hybrid vehicles , 2007 .

[14]  Junnian Wang,et al.  Calibration efficiency improvement of rule-based energy management system for a plug-in hybrid electric vehicle , 2017 .

[15]  Bin Jiao,et al.  Optimal Energy Management Strategy of a Plug-in Hybrid Electric Vehicle Based on a Particle Swarm Optimization Algorithm , 2015 .

[16]  Sergio Grammatico,et al.  A series-parallel hybrid electric powertrain for industrial vehicles , 2010, 2010 IEEE Vehicle Power and Propulsion Conference.

[17]  Kumeresan A. Danapalasingam,et al.  A review on hybrid electric vehicles architecture and energy management strategies , 2016 .

[18]  Xiaosong Hu,et al.  Pontryagin’s Minimum Principle based model predictive control of energy management for a plug-in hybrid electric bus , 2019, Applied Energy.

[19]  Ilya V. Kolmanovsky,et al.  Game Theory Controller for Hybrid Electric Vehicles , 2014, IEEE Transactions on Control Systems Technology.

[20]  Yijun Wang,et al.  An Optimization Strategy Based on Hybrid Algorithm of Adam and SGD , 2018 .

[21]  Yuan Zou,et al.  Reinforcement Learning–Based Energy Management Strategy for a Hybrid Electric Tracked Vehicle , 2015 .

[22]  Jean-Philippe Diguet,et al.  Reinforcement-Learning Approach Guidelines for Energy Management , 2019, J. Low Power Electron..

[23]  Hyunsoo Kim,et al.  Development of Brake System and Regenerative Braking Cooperative Control Algorithm for Automatic-Transmission-Based Hybrid Electric Vehicles , 2015, IEEE Transactions on Vehicular Technology.

[24]  Tomaž Katrašnik,et al.  Analytical method to evaluate fuel consumption of hybrid electric vehicles at balanced energy content of the electric storage devices , 2010 .

[25]  Yang Zhou,et al.  A survey on driving prediction techniques for predictive energy management of plug-in hybrid electric vehicles , 2019, Journal of Power Sources.

[26]  Richard S. Sutton,et al.  Dyna, an integrated architecture for learning, planning, and reacting , 1990, SGAR.

[27]  Hongwen He,et al.  Rule based energy management strategy for a series–parallel plug-in hybrid electric bus optimized by dynamic programming , 2017 .

[28]  Daniel F. Opila,et al.  Real-World Robustness for Hybrid Vehicle Optimal Energy Management Strategies Incorporating Drivability Metrics , 2014 .

[29]  Yang Li,et al.  A real-time energy management strategy combining rule-based control and ECMS with optimization equivalent factor for HEVs , 2017, 2017 Chinese Automation Congress (CAC).

[30]  Yuan Zou,et al.  A Real-Time Markov Chain Driver Model for Tracked Vehicles and Its Validation: Its Adaptability via Stochastic Dynamic Programming , 2017, IEEE Transactions on Vehicular Technology.

[31]  Yuan Zou,et al.  Reinforcement learning-based real-time energy management for a hybrid tracked vehicle , 2016 .

[32]  Christian Schroer,et al.  Deep Reinforcement Learning for Advanced Energy Management of Hybrid Electric Vehicles , 2018, ICAART.