Deep reinforcement learning of energy management with continuous control strategy and traffic information for a series-parallel plug-in hybrid electric bus

Abstract Hybrid electric vehicles offer an immediate solution for emissions reduction and fuel displacement under the current technique level. Energy management strategies are critical for improving fuel economy of hybrid electric vehicles. In this paper we propose a energy management strategy for a series-parallel plug-in hybrid electric bus based on deep deterministic policy gradients. Specifically, deep deterministic policy gradients is an actor-critic, model-free reinforcement learning algorithm that can assign the optimal energy split of the bus over continuous spaces. We consider that the buses are driving in a fixed bus line, where driving cycle is constrained by the traffic. The traffic information and number of passengers are also incorporated into the energy management system. The deep reinforcement learning based energy management agent is trained with a large amount of driving cycles that generated from traffic simulation. Experiments on the traffic simulation driving cycles show that the proposed approach outperforms conventional reinforcement learning approach and exhibits performance close to the global optimal dynamic programming. Moreover, it also has great generality to the standard driving cycles that are significantly different with the ones that it has been trained with. We also show some interesting attributes of learned energy management strategies through visualizations of the actor and critic. The main contribution of this study is to explore the incorporation of traffic information within hybrid electric vehicle energy managment through advanced intelligent algorithms.

[1]  Hongwen He,et al.  Rule based energy management strategy for a series–parallel plug-in hybrid electric bus optimized by dynamic programming , 2017 .

[2]  Hongwen He,et al.  Continuous reinforcement learning of energy management with deep Q network for a power split hybrid electric bus , 2018, Applied Energy.

[3]  Louis Wehenkel,et al.  Reinforcement Learning Versus Model Predictive Control: A Comparison on a Power System Problem , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[4]  Bin Ran,et al.  A hybrid deep learning based traffic flow prediction method and its understanding , 2018 .

[5]  Zheng Chen,et al.  Energy Management for a Power-Split Plug-in Hybrid Electric Vehicle Based on Dynamic Programming and Neural Networks , 2014, IEEE Transactions on Vehicular Technology.

[6]  Chun Wang,et al.  An on-line predictive energy management strategy for plug-in hybrid electric vehicles to counter the uncertain prediction of the driving cycle , 2017 .

[7]  Bo Egardt,et al.  Assessing the Potential of Predictive Control for Hybrid Vehicle Powertrains Using Stochastic Dynamic Programming , 2005, IEEE Transactions on Intelligent Transportation Systems.

[8]  Sergey Levine,et al.  Trust Region Policy Optimization , 2015, ICML.

[9]  Yuan Zou,et al.  Reinforcement Learning of Adaptive Energy Management With Transition Probability for a Hybrid Electric Tracked Vehicle , 2015, IEEE Transactions on Industrial Electronics.

[10]  Reza Ghorbani,et al.  Drive Cycle Generation for Design Optimization of Electric Vehicles , 2013, IEEE Transactions on Vehicular Technology.

[11]  Jordi Riera,et al.  Energy management strategies for hybrid electric vehicles , 2003, IEEE International Electric Machines and Drives Conference, 2003. IEMDC'03..

[12]  Jianqiu Li,et al.  Optimization for a hybrid energy storage system in electric vehicles using dynamic programing approach , 2015 .

[13]  Huei Peng,et al.  Comparative Study of Dynamic Programming and Pontryagin’s Minimum Principle on Energy Management for a Parallel Hybrid Electric Vehicle , 2013 .

[14]  Stefano Di Cairano,et al.  MPC-Based Energy Management of a Power-Split Hybrid Electric Vehicle , 2012, IEEE Transactions on Control Systems Technology.

[15]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[16]  Tara N. Sainath,et al.  Improving deep neural networks for LVCSR using rectified linear units and dropout , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[17]  Demis Hassabis,et al.  Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[18]  Ramon Gonzalez,et al.  Energy management strategy for plug-in hybrid electric vehicles. A comparative study , 2014 .

[19]  Hongwen He,et al.  An energy management strategy based on stochastic model predictive control for plug-in hybrid electric buses , 2017 .

[20]  Amro M. Farid,et al.  Symmetrica: test case for transportation electrification research , 2015 .

[21]  Kumeresan A. Danapalasingam,et al.  A review on hybrid electric vehicles architecture and energy management strategies , 2016 .

[22]  Bo Gao,et al.  Energy Management in Plug-in Hybrid Electric Vehicles: Recent Progress and a Connected Vehicles Perspective , 2017, IEEE Transactions on Vehicular Technology.

[23]  Sergey Levine,et al.  End-to-End Training of Deep Visuomotor Policies , 2015, J. Mach. Learn. Res..

[24]  Sun Chao,et al.  Real-time global driving cycle construction and the application to economy driving pro system in plug-in hybrid electric vehicles , 2018, Energy.

[25]  Junwei Cao,et al.  Optimal energy management strategies for energy Internet via deep reinforcement learning approach , 2019, Applied Energy.

[26]  Xiaosong Hu,et al.  Energy efficiency analysis of a series plug-in hybrid electric bus with different energy management strategies and battery sizes , 2013 .

[27]  Chang Liu,et al.  Power management for Plug-in Hybrid Electric Vehicles using Reinforcement Learning with trip information , 2014, 2014 IEEE Transportation Electrification Conference and Expo (ITEC).

[28]  Bin Ran,et al.  Vehicle Behavior Learning via Sparse Reconstruction with $\ell_{2}-\ell_{p}$ Minimization and Trajectory Similarity , 2017, IEEE Transactions on Intelligent Transportation Systems.

[29]  Pandian Vasant,et al.  An Overview of Electric Vehicle Technology: A Vision Towards Sustainable Transportation , 2017 .

[30]  Rubiyah Yusof,et al.  A review on the applications of driving data and traffic information for vehicles׳ energy conservation , 2014 .

[31]  Yanjun Huang,et al.  Model predictive control power management strategies for HEVs: A review , 2017 .

[32]  Chenming Li,et al.  Energy Management Strategy for a Hybrid Electric Vehicle Based on Deep Reinforcement Learning , 2018 .

[33]  Xiaosong Hu,et al.  Comparison of Three Electrochemical Energy Buffers Applied to a Hybrid Bus Powertrain With Simultaneous Optimal Sizing and Energy Management , 2014, IEEE Transactions on Intelligent Transportation Systems.

[34]  Olle Sundström,et al.  A generic dynamic programming Matlab function , 2009, 2009 IEEE Control Applications, (CCA) & Intelligent Control, (ISIC).

[35]  Joao P. Trovao,et al.  A multi-level energy management system for multi-source electric vehicles – An integrated rule-based meta-heuristic approach , 2013 .

[36]  Pandian Vasant,et al.  Review of recent trends in optimization techniques for plug-in hybrid, and electric vehicle charging infrastructures , 2016 .

[37]  Yuan Zou,et al.  Reinforcement learning-based real-time energy management for a hybrid tracked vehicle , 2016 .

[38]  Guy Lever,et al.  Deterministic Policy Gradient Algorithms , 2014, ICML.

[39]  Sergey Levine,et al.  Continuous Deep Q-Learning with Model-based Acceleration , 2016, ICML.

[40]  Naehyuck Chang,et al.  Reinforcement learning based power management for hybrid electric vehicles , 2014, 2014 IEEE/ACM International Conference on Computer-Aided Design (ICCAD).

[41]  Xiaosong Hu,et al.  Velocity Predictors for Predictive Energy Management in Hybrid Electric Vehicles , 2015, IEEE Transactions on Control Systems Technology.

[42]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[43]  Seung Ho Hong,et al.  Incentive-based demand response for smart grid with reinforcement learning and deep neural network , 2019, Applied Energy.

[44]  Jeffrey B. Burl,et al.  Catch energy saving opportunity (CESO), an instantaneous optimal energy management strategy for series hybrid electric vehicles , 2017 .

[45]  Hicham Chaoui,et al.  Deep Reinforcement Learning Energy Management System for Multiple Battery Based Electric Vehicles , 2018, 2018 IEEE Vehicle Power and Propulsion Conference (VPPC).

[46]  Pieter Abbeel,et al.  Benchmarking Deep Reinforcement Learning for Continuous Control , 2016, ICML.

[47]  Bernard Bäker,et al.  Safe Deep Reinforcement Learning Hybrid Electric Vehicle Energy Management , 2018, ICAART.

[48]  Jiayi Cao,et al.  Reinforcement learning-based real-time power management for hybrid energy storage system in the plug-in hybrid electric vehicle , 2018 .