Coordinated control of gas supply system in PEMFC based on multi-agent deep reinforcement learning

Abstract In proton exchange membrane fuel cells (PEMFCs), the hydrogen supply system and air supply system jointly impact the output characteristics, and there is a coordination problem between these two systems. To solve this coordination problem, an intelligent control framework is presented that considers the coordination between the air flux controller and hydrogen flux controller in PEMFCs, and an ensemble imitation learning multi-trick deep deterministic policy gradient (EILMMA-DDPG) is advanced for this framework. The algorithm proposed here complies with an ensemble imitation learning policy, i.e., exploiting multiple reinforcement learning explorers that contain actor networks to perform distributed exploration in the environment, thereby improving the exploration efficiency. Moreover, a control algorithm explorer that contains various conventional control algorithms is presented to create model samples over a range of scenarios in an attempt to address sparse rewards and improve the training efficiency in conjunction with an experience probability replay mechanism. Next, multiple tricks are adopted to improve the overestimated Q value. Finally, a model-free intelligent control algorithm capable of coordinating controllers and exhibiting a better global searching ability is developed. In addition, the proposed algorithm is adopted in the control framework of the air and hydrogen supply system in PEMFCs. Furthermore, as revealed from the simulated results, the proposed intelligent control framework can more effectively control the oxygen excess rate (OER) and output voltage.

[1]  Romeo Ortega,et al.  Experimental Validation of a PEM Fuel-Cell Reduced-Order Model and a Moto-Compressor Higher Order Sliding-Mode Control , 2010, IEEE Transactions on Industrial Electronics.

[2]  Mohammad Hassan Khooban,et al.  A New Adaptive Type-II Fuzzy-Based Deep Reinforcement Learning Control: Fuel Cell Air-Feed Sensors Control , 2019, IEEE Sensors Journal.

[3]  Zhun Fan,et al.  Ensemble learning for optimal active power control of distributed energy resources and thermostatically controlled loads in an islanded microgrid , 2018, International Journal of Hydrogen Energy.

[4]  Ya-Xiong Wang,et al.  Performance Optimization for Open‐cathode Fuel Cell Systems with Overheating Protection and Air Starvation Prevention , 2017 .

[5]  Hua Wang,et al.  Performance analysis and improvement of a proton exchange membrane fuel cell using comprehensive intelligent control , 2008, 2008 International Conference on Electrical Machines and Systems.

[6]  Ya-Xiong Wang,et al.  Feedforward fuzzy-PID control for air flow regulation of PEM fuel cell system , 2015 .

[7]  David J. Friedman,et al.  Requirements for a Flexible and Realistic Air Supply Model for Incorporation into a Fuel Cell Vehicle (FCV) System Simulation , 1999 .

[8]  Michael A. Danzer,et al.  Model-based control of cathode pressure and oxygen excess ratio of a PEM fuel cell system , 2008 .

[9]  Baharuddin,et al.  A new method for optimal parameters identification of a PEMFC using an improved version of Monarch Butterfly Optimization Algorithm , 2020 .

[10]  Minhae Kwon,et al.  Intelligent IoT Connectivity: Deep Reinforcement Learning Approach , 2020, IEEE Sensors Journal.

[11]  Carlos Bordons,et al.  Nonlinear MPC for the airflow in a PEM fuel cell using a Volterra series model , 2012 .

[12]  Kwang Y. Lee,et al.  Data-driven oxygen excess ratio control for proton exchange membrane fuel cell , 2018, Applied Energy.

[13]  Ricardo Cajo,et al.  Experts agents in PEM fuel cell control , 2015, 2015 International Conference on Renewable Energy Research and Applications (ICRERA).

[14]  G. Uday Bhaskar Babu,et al.  A new control strategy for a higher order proton exchange membrane fuel cell system , 2020 .

[15]  Carlos Bordons,et al.  Design and experimental validation of a constrained MPC for the air feed of a fuel cell , 2009 .

[16]  Hanxin Zhu,et al.  Efficient experience replay based deep deterministic policy gradient for AGC dispatch in integrated energy system , 2021 .

[17]  James A. Adams,et al.  The Development of Ford's P2000 Fuel Cell Vehicle , 2000 .

[18]  Young-Bae Kim,et al.  Improving dynamic performance of proton-exchange membrane fuel cell system using time delay control , 2010 .

[19]  Z. Gajic,et al.  A Simple Sliding Mode Controller of a Fifth-Order Nonlinear PEM Fuel Cell Model , 2014, IEEE Transactions on Energy Conversion.

[20]  Maria Pina Serra,et al.  Nonlinear model predictive control methodology for efficiency and durability improvement in a fuel cell power system , 2016 .

[21]  P. Farhadi,et al.  PEMFC voltage control using PSO-tunned-PID controller , 2014, Proceedings of the 2014 IEEE NW Russia Young Researchers in Electrical and Electronic Engineering Conference.

[22]  Mustapha Hatti,et al.  Dynamic neural network controller model of PEM fuel cell system , 2009 .

[23]  Sun Yi,et al.  Adaptive control for robust air flow management in an automotive fuel cell system , 2017 .

[24]  Jiawen Li,et al.  A new adaptive controller based on distributed deep reinforcement learning for PEMFC air supply system , 2021 .

[25]  Zhu Xin-jian Adaptive Fuzzy PID Control of PEMFC System Electric Output Characteristic , 2005 .

[26]  Zhehan Yi,et al.  Deep-Reinforcement-Learning-Based Autonomous Voltage Control for Power Grid Operations , 2020, IEEE Transactions on Power Systems.

[27]  Yujie Wang,et al.  Robust fault diagnosis and fault tolerant control for PEMFC system based on an augmented LPV observer , 2020 .

[28]  Young-Bae Kim,et al.  Real-Time Control for Air Excess Ratio of a PEM Fuel Cell System , 2014, IEEE/ASME Transactions on Mechatronics.

[29]  Ralph E. White,et al.  A water and heat management model for proton-exchange-membrane fuel cells , 1993 .

[30]  Cédric Damour,et al.  Real-time implementation of a neural model-based self-tuning PID strategy for oxygen stoichiometry control in PEM fuel cell , 2014 .

[31]  Zhidong Qi,et al.  Dynamic modeling and fuzzy PID control study on proton exchange membrane fuel cell , 2010, 2010 The 2nd Conference on Environmental Science and Information Application Technology.

[32]  Qi Li,et al.  Nonlinear controller design based on cascade adaptive sliding mode control for PEM fuel cell air supply systems , 2019, International Journal of Hydrogen Energy.

[33]  Yi Wu,et al.  Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments , 2017, NIPS.

[34]  Nigel P. Brandon,et al.  Modelling and Explicit MPC of PEM Fuel Cell Systems , 2010 .

[35]  Hamed Beirami,et al.  Optimal PID plus fuzzy controller design for a PEM fuel cell air feed system using the self-adaptive differential evolution algorithm , 2015 .

[36]  Yuval Tassa,et al.  Continuous control with deep reinforcement learning , 2015, ICLR.

[37]  Guo-Ping Liu,et al.  Design and Implementation of On-Line Self-Tuning Control for PEM Fuel Cells , 2008 .

[38]  Lakhdar Khochemane,et al.  An adaptive fuzzy logic controller (AFLC) for PEMFC fuel cell , 2015 .

[39]  Fu-Cheng Wang,et al.  Multivariable robust PID control for a PEMFC system , 2010 .

[40]  Amornchai Arpornwichanop,et al.  Control structure design and robust model predictive control for controlling a proton exchange membrane fuel cell , 2017 .

[41]  Wei Liu,et al.  Optimal design of cathode flow channel for air-cooled PEMFC with open cathode , 2020 .