Multi-task learning for the prediction of wind power ramp events with deep neural networks

In Machine Learning, the most common way to address a given problem is to optimize an error measure by training a single model to solve the desired task. However, sometimes it is possible to exploit latent information from other related tasks to improve the performance of the main one, resulting in a learning paradigm known as Multi-Task Learning (MTL). In this context, the high computational capacity of deep neural networks (DNN) can be combined with the improved generalization performance of MTL, by designing independent output layers for every task and including a shared representation for them. In this paper we exploit this theoretical framework on a problem related to Wind Power Ramps Events (WPREs) prediction in wind farms. Wind energy is one of the fastest growing industries in the world, with potential global spreading and deep penetration in developed and developing countries. One of the main issues with the majority of renewable energy resources is their intrinsic intermittency, which makes it difficult to increase the penetration of these technologies into the energetic mix. In this case, we focus on the specific problem of WPREs prediction, which deeply affect the wind speed and power prediction, and they are also related to different turbines damages. Specifically, we exploit the fact that WPREs are spatially-related events, in such a way that predicting the occurrence of WPREs in different wind farms can be taken as related tasks, even when the wind farms are far away from each other. We propose a DNN-MTL architecture, receiving inputs from all the wind farms at the same time to predict WPREs simultaneously in each of the farms locations. The architecture includes some shared layers to learn a common representation for the information from all the wind farms, and it also includes some specification layers, which refine the representation to match the specific characteristics of each location. Finally we modified the Adam optimization algorithm for dealing with imbalanced data, adding costs which are updated dynamically depending on the worst classified class. We compare the proposal against a baseline approach based on building three different independent models (one for each wind farm considered), and against a state-of-the-art reservoir computing approach. The DNN-MTL proposal achieves very good performance in WPREs prediction, obtaining a good balance for all the classes included in the problem (negative ramp, no ramp and positive ramp).

[1]  Matthias Scholz,et al.  Nonlinear Principal Component Analysis: Neural Network Models and Applications , 2008 .

[2]  Zong Woo Geem,et al.  A Coral Reefs Optimization algorithm with Harmony Search operators for accurate wind speed prediction , 2015 .

[3]  J Elith,et al.  A working guide to boosted regression trees. , 2008, The Journal of animal ecology.

[4]  Pedro Antonio Gutiérrez,et al.  Combining Reservoir Computing and Over-Sampling for Ordinal Wind Power Ramp Prediction , 2017, IWANN.

[5]  O. Kisi,et al.  Solar radiation prediction using different techniques: model evaluation and comparison , 2016 .

[6]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[7]  José R. Dorronsoro,et al.  Deep Neural Networks for Wind and Solar Energy Prediction , 2017, Neural Processing Letters.

[8]  Pedro Antonio Gutiérrez,et al.  Robust estimation of wind power ramp events with reservoir computing , 2017 .

[9]  Massimiliano Pontil,et al.  Convex multi-task feature learning , 2008, Machine Learning.

[10]  Lawrence Carin,et al.  Multi-Task Learning for Classification with Dirichlet Process Priors , 2007, J. Mach. Learn. Res..

[11]  Dong Jiang,et al.  Evaluating the spatio-temporal variation of China's offshore wind resources based on remotely sensed wind field data , 2013 .

[12]  Jonathan Baxter,et al.  A Model of Inductive Bias Learning , 2000, J. Artif. Intell. Res..

[13]  Xuanjing Huang,et al.  Recurrent Neural Network for Text Classification with Multi-Task Learning , 2016, IJCAI.

[14]  Jinfu Chen,et al.  Wind Speed Prediction with Spatio–Temporal Correlation: A Deep Learning Approach , 2018 .

[15]  Jie Zhang,et al.  Conditional aggregated probabilistic wind power forecasting based on spatio-temporal correlation , 2019, Applied Energy.

[16]  L. Qin,et al.  Research on wind power ramp events prediction based on strongly convective weather classification , 2017 .

[17]  Pedro Antonio Gutiérrez,et al.  Multiclass Prediction of Wind Power Ramp Events Combining Reservoir Computing and Support Vector Machines , 2016, CAEPIA.

[18]  James W. Taylor,et al.  Probabilistic forecasting of wind power ramp events using autoregressive logit models , 2017, Eur. J. Oper. Res..

[19]  Rich Caruana,et al.  Multitask Learning , 1998, Encyclopedia of Machine Learning and Data Mining.

[20]  Á. Cuerva,et al.  Identifying wind power ramp causes from multivariate datasets: a methodological proposal and its application to reanalysis data , 2015 .

[21]  John Boland,et al.  Scenario generation and probabilistic forecasting analysis of spatio-temporal wind speed series with multivariate autoregressive volatility models , 2019, Applied Energy.

[22]  Andrew Kusiak,et al.  Optimisation of time window size for wind power ramps prediction , 2017 .

[23]  Massimiliano Pontil,et al.  The Benefit of Multitask Representation Learning , 2015, J. Mach. Learn. Res..

[24]  Nikolaos S. Thomaidis,et al.  Do current wind farms in Spain take maximum advantage of spatiotemporal balancing of the wind resource , 2016 .

[25]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[26]  Jie Zhang,et al.  Statistical Representation of Wind Power Ramps Using a Generalized Gaussian Mixture Model , 2018, IEEE Transactions on Sustainable Energy.

[27]  Sancho Salcedo-Sanz,et al.  Spatio-temporal analysis of wind resource in the Iberian Peninsula with data-coupled clustering , 2018 .

[28]  J. Thepaut,et al.  The ERA‐Interim reanalysis: configuration and performance of the data assimilation system , 2011 .

[29]  Andrzej Cichocki,et al.  Group Component Analysis for Multiblock Data: Common and Individual Feature Extraction , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[30]  Dirk J. Cannon,et al.  Using reanalysis data to quantify extreme wind power generation statistics: A 33 year case study in Great Britain , 2015 .

[31]  Jie Zhang,et al.  Wind Power Ramp Event Forecasting Using a Stochastic Scenario Generation Method , 2015, IEEE Transactions on Sustainable Energy.

[32]  Cristobal Gallego-Castillo,et al.  A review on the recent history of wind power ramp forecasting , 2015 .