Autonomous PEV Charging Scheduling Using Dyna-Q Reinforcement Learning

This paper proposes a demand response method to reduce the long-term charging cost of single plug-in electric vehicles (PEV) while overcoming obstacles such as the stochastic nature of the user's driving behaviour, traffic condition, energy usage, and energy price. The problem is formulated as a Markov Decision Process (MDP) with an unknown transition probability matrix and solved using deep reinforcement learning (RL) techniques. The proposed method does not require any initial data on the PEV driver's behaviour and shows improvement on learning speed when compared to a pure model-free reinforcement learning method. A combination of model-based and model-free learning methods called Dyna-Q reinforcement learning is utilized in our strategy. Every time a real experience is obtained, the model is updated, and the RL agent will learn from both the real experience and “imagined” experiences from the model. Due to the vast amount of state space, a table-lookup method is impractical, and a value approximation method using deep neural networks is employed for estimating the long-term expected reward of all state-action pairs. An average of historical price and a long short-term memory (LSTM) network are used to predict future price. Simulation results demonstrate the effectiveness of this approach and its ability to reach an optimal policy quicker while avoiding state of charge (SOC) depletion during trips when compared to existing PEV charging schemes.

[1]  Ming Li,et al.  Electric Vehicle Route Selection and Charging Navigation Strategy Based on Crowd Sensing , 2017, IEEE Transactions on Industrial Informatics.

[2]  Jianping Pan,et al.  Networked Electric Vehicles for Green Intelligent Transportation , 2017, IEEE Communications Standards Magazine.

[3]  Yue Li,et al.  Intelligent Parking Garage EV Charging Scheduling Considering Battery Charging Characteristic , 2018, IEEE Transactions on Industrial Electronics.

[4]  Gabriela Hug,et al.  A fully distributed approach for plug-in electric vehicle charging , 2016, 2016 Power Systems Computation Conference (PSCC).

[5]  Jian Zhao,et al.  Risk-Based Day-Ahead Scheduling of Electric Vehicle Aggregator Using Information Gap Decision Theory , 2017, IEEE Transactions on Smart Grid.

[6]  Miguel A. Ortega-Vazquez,et al.  Optimal scheduling of electric vehicle charging and vehicle-to-grid services at household level including battery degradation and price uncertainty , 2014 .

[7]  Xuemin Shen,et al.  Task Time Allocation and Reward Scheme for PEV Charging Station Advertising , 2019, ICC 2019 - 2019 IEEE International Conference on Communications (ICC).

[8]  Mohamed A. El-Sharkawi,et al.  Optimal Charging Strategies for Unidirectional Vehicle-to-Grid , 2011, IEEE Transactions on Smart Grid.

[9]  Lian Zhao,et al.  A Decentralized Load Balancing Approach for Neighbouring Charging Stations via EV Fleets , 2017, 2017 IEEE 86th Vehicular Technology Conference (VTC-Fall).

[10]  Lian Zhao,et al.  An SMDP-Based Prioritized Channel Allocation Scheme in Cognitive Enabled Vehicular Ad Hoc Networks , 2017, IEEE Transactions on Vehicular Technology.

[11]  Yanning Zhang,et al.  Smart and Resilient EV Charging in SDN-Enhanced Vehicular Edge Computing Networks , 2020, IEEE Journal on Selected Areas in Communications.

[12]  Ahmed E. Kamal,et al.  Reinforcement Learning Exploration Algorithms for Energy Harvesting Communications Systems , 2018, 2018 IEEE International Conference on Communications (ICC).

[13]  Mahmud Fotuhi-Firuzabad,et al.  Investigating the Impacts of Plug-in Hybrid Electric Vehicles on Power Distribution Systems , 2013, IEEE Transactions on Smart Grid.

[14]  Tom Holvoet,et al.  Reinforcement Learning of Heuristic EV Fleet Charging in a Day-Ahead Electricity Market , 2015, IEEE Transactions on Smart Grid.

[15]  Lian Zhao,et al.  DeepNOMA: A Unified Framework for NOMA Using Deep Multi-Task Learning , 2020, IEEE Transactions on Wireless Communications.

[16]  Khizir Mahmud,et al.  Electric vehicle charge scheduling using an artificial neural network , 2016, 2016 IEEE Innovative Smart Grid Technologies - Asia (ISGT-Asia).

[17]  Xuemin Shen,et al.  Decentralized PEV Power Allocation With Power Distribution and Transportation Constraints , 2020, IEEE Journal on Selected Areas in Communications.

[18]  Nei Kato,et al.  The Deep Learning Vision for Heterogeneous Network Traffic Control: Proposal, Challenges, and Future Perspective , 2017, IEEE Wireless Communications.

[19]  Marina Gonzalez Vaya,et al.  Optimal Bidding Strategy of a Plug-In Electric Vehicle Aggregator in Day-Ahead Electricity Markets Under Uncertainty , 2015, IEEE Transactions on Power Systems.

[20]  Florent Di Meglio,et al.  Optimal charging of vehicle-to-grid fleets via PDE aggregation techniques , 2015, 2015 American Control Conference (ACC).

[21]  Sergey Levine,et al.  Continuous Deep Q-Learning with Model-based Acceleration , 2016, ICML.

[22]  Zheng Wen,et al.  Optimal Demand Response Using Device-Based Reinforcement Learning , 2014, IEEE Transactions on Smart Grid.

[23]  B. Welford Note on a Method for Calculating Corrected Sums of Squares and Products , 1962 .

[24]  Haibo He,et al.  Model-Free Real-Time EV Charging Scheduling Based on Deep Reinforcement Learning , 2019, IEEE Transactions on Smart Grid.

[25]  Toru Namerikawa,et al.  Real-Time Pricing Mechanism for Electricity Market With Built-In Incentive for Participation , 2015, IEEE Transactions on Smart Grid.

[26]  Adriana Chis,et al.  Reinforcement Learning-Based Plug-in Electric Vehicle Charging With Forecasted Price , 2017, IEEE Transactions on Vehicular Technology.

[27]  Sebastian Ruder,et al.  An overview of gradient descent optimization algorithms , 2016, Vestnik komp'iuternykh i informatsionnykh tekhnologii.

[28]  Birgitte Bak-Jensen,et al.  Integration of Vehicle-to-Grid in the Western Danish Power System , 2011, IEEE Transactions on Sustainable Energy.

[29]  Abdullah Abusorrah,et al.  A Game Theoretic Approach to Risk-Based Optimal Bidding Strategies for Electric Vehicle Aggregators in Electricity Markets With Variable Wind Energy Resources , 2016, IEEE Transactions on Sustainable Energy.

[30]  Victor O. K. Li,et al.  Opportunistic Routing for Vehicular Energy Network , 2018, IEEE Internet of Things Journal.

[31]  Yuxuan Zhao,et al.  Development of Optimal Bidding Strategy for an Electric Vehicle Aggregator in a Real-Time Electricity Market , 2018, 2018 IEEE Innovative Smart Grid Technologies - Asia (ISGT Asia).

[32]  Jian-Min Xu,et al.  A Novel Contour Extraction Approach Based on Q-Learning , 2006, 2006 International Conference on Machine Learning and Cybernetics.

[33]  Leehter Yao,et al.  A Real-Time Charging Scheme for Demand Response in Electric Vehicle Parking Station , 2017, IEEE Transactions on Smart Grid.

[34]  Xuemin Sherman Shen,et al.  Energy Management Framework for Mobile Vehicular Electric Storage , 2019, IEEE Network.

[35]  Nei Kato,et al.  State-of-the-Art Deep Learning: Evolving Machine Intelligence Toward Tomorrow’s Intelligent Network Traffic Control Systems , 2017, IEEE Communications Surveys & Tutorials.

[36]  Lei Zhao,et al.  Big Data Acquisition Under Failures in FiWi Enhanced Smart Grid , 2019, IEEE Transactions on Emerging Topics in Computing.

[37]  Di Wu,et al.  Two-Stage Energy Management for Office Buildings With Workplace EV Charging and Renewable Energy , 2017, IEEE Transactions on Transportation Electrification.

[38]  Marija Ilic,et al.  Optimal Autonomous Charging of Electric Vehicles with Stochastic Driver Behavior , 2014, 2014 IEEE Vehicle Power and Propulsion Conference (VPPC).

[39]  Hongwei Li,et al.  A Hybrid Machine Learning Model for Range Estimation of Electric Vehicles , 2016, 2016 IEEE Global Communications Conference (GLOBECOM).

[40]  Gabriela Hug,et al.  Distributed cooperative charging for plug-in electric vehicles: A consensus+innovations approach , 2016, 2016 IEEE Global Conference on Signal and Information Processing (GlobalSIP).

[41]  Michael Nicholas,et al.  Studying the PEV market in california: Comparing the PEV, PHEV and hybrid markets , 2013, 2013 World Electric Vehicle Symposium and Exhibition (EVS27).