Thermal and Energy Management Based on Bimodal Airflow-Temperature Sensing and Reinforcement Learning

Multi-physical field sensing and machine learning have drawn great attention in various fields such as sensor networks, robotics, energy devices, smart buildings, intelligent system and so on. In this paper, we present a novel efficient method for thermal and energy management based on bimodal airflow-temperature sensing and reinforcement learning, which expedites an exploration process by self-learning and adjusts action policy only through actuators interacting with the environment, being free of the controlled object model and priori experiences. In general, training of reinforcement learning requires a large amount of data iterations, which takes a long time and is not suitable for real-time control. Here, we propose an approach to speed up the learning process by indicating the action adjustment direction. We adopt tailor-designed bimodal sensors to simultaneously detect airflow and temperature field, which provides comprehensive information for reinforcement learning. The proposed thermal and energy management incorporates bimodal parametric sensing with an improved actor-critic algorithm to realize self-learning control. Experiments of thermal and energy management in a multi-module integrated system validate the effectiveness of the proposed methodology, which demonstrate high efficiency, fast response, and good robustness in various control scenarios. The proposed methodology can be widely applied to thermal and energy management of diverse integrated systems.

[1]  Peng Liu,et al.  A Flexible Flow Sensor System and Its Characteristics for Fluid Mechanics Measurements , 2009, Sensors.

[2]  Stefan Becker,et al.  CAA of an Air-Cooling System for Electronic Devices , 2016 .

[3]  Tao Dong,et al.  Recent Developments in Optical Detection Technologies in Lab-on-a-Chip Devices for Biosensing Applications , 2014, Sensors.

[4]  Geert Deconinck,et al.  Battery Energy Management in a Microgrid Using Batch Reinforcement Learning , 2017 .

[5]  Weihua Gui,et al.  Temperature Uniformity Control of Large-Scale Vertical Quench Furnaces for Aluminum Alloy Thermal Treatment , 2016, IEEE Transactions on Control Systems Technology.

[6]  Chunsun Zhang,et al.  PCR microfluidic devices for DNA amplification. , 2006, Biotechnology advances.

[7]  Yuan Zou,et al.  Reinforcement Learning–Based Energy Management Strategy for a Hybrid Electric Tracked Vehicle , 2015 .

[8]  Alberto Cardoso,et al.  Affine Neural Network-Based Predictive Control Applied to a Distributed Solar Collector Field , 2014, IEEE Transactions on Control Systems Technology.

[9]  Cheng Ma,et al.  Self-Tuning Fully-Connected PID Neural Network System for Distributed Temperature Sensing and Control of Instrument with Multi-Modules , 2016, Sensors.

[10]  S. Patankar Airflow and Cooling in a Data Center , 2010 .

[11]  R. Z. Wang,et al.  Comfortable, high-efficiency heat pump with desiccant-coated, water-sorbing heat exchangers , 2017, Scientific Reports.

[12]  A. Pesaran,et al.  A parametric study on thermal management of an air-cooled lithium-ion battery module for plug-in hybrid electric vehicles , 2013 .

[13]  X. M. Xu,et al.  Research on the heat dissipation performance of battery pack based on forced air cooling , 2013 .

[14]  Peng Jiang,et al.  Smart Sensing Strip Using Monolithically Integrated Flexible Flow Sensor for Noninvasively Monitoring Respiratory Flow , 2015, Sensors.

[15]  Juliana de Souza Granja Barros,et al.  PID temperature controller in pig nursery: spatial characterization of thermal environment , 2018, International Journal of Biometeorology.

[16]  Rong Zhu,et al.  Temperature compensation for thermal anemometers using temperature sensors independent of flow sensors , 2011 .

[17]  T. Liao,et al.  A cell counting/sorting system incorporated with a microfabricated flow cytometer chip , 2006 .

[18]  Meng Xiao-feng,et al.  Temperature decoupling control of double-level air flow field dynamic vacuum system based on neural network and prediction principle , 2013 .

[19]  Demis Hassabis,et al.  Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[20]  D. Pamela,et al.  Wireless Control and Automation of Hot Air Temperature in Oven for Sterilization using Fuzzy PID Controller and Adaptive Smith Predictor , 2017, Wirel. Pers. Commun..

[21]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[22]  Zhengping Li,et al.  One-step ultrasensitive detection of microRNAs with loop-mediated isothermal amplification (LAMP). , 2011, Chemical communications.

[23]  Fakhri Karray,et al.  Multisensor data fusion: A review of the state-of-the-art , 2013, Inf. Fusion.

[24]  Gian Luca Foresti,et al.  A Multimodal Learning System for Individuals with Sensorial, Neuropsychological, and Relational Impairments , 2013, J. Sensors.

[25]  Rongshun Chen,et al.  Optimal Self-Tuning PID Controller Based on Low Power Consumption for a Server Fan Cooling System , 2015, Sensors.

[26]  L. Cabeza,et al.  Free-cooling of buildings with phase change materials , 2004 .

[27]  Andreas Manz,et al.  Handheld real-time PCR device. , 2016, Lab on a chip.