An EEMD-BiLSTM Algorithm Integrated with Boruta Random Forest Optimiser for Significant Wave Height Forecasting along Coastal Areas of Queensland, Australia

Using advanced deep learning (DL) algorithms for forecasting significant wave height of coastal sea waves over a relatively short period can generate important information on its impact and behaviour. This is vital for prior planning and decision making for events such as search and rescue and wave surges along the coastal environment. Short-term 24 h forecasting could provide adequate time for relevant groups to take precautionary action. This study uses features of ocean waves such as zero up crossing wave period (Tz), peak energy wave period (Tp), sea surface temperature (SST) and significant lags for significant wave height (Hs) forecasting. The dataset was collected from 2014 to 2019 at 30 min intervals along the coastal regions of major cities in Queensland, Australia. The novelty of this study is the development and application of a highly accurate hybrid Boruta random forest (BRF)–ensemble empirical mode decomposition (EEMD)–bidirectional long short-term memory (BiLSTM) algorithm to predict significant wave height (Hs). The EEMD–BiLSTM model outperforms all other models with a higher Pearson’s correlation (R) value of 0.9961 (BiLSTM—0.991, EEMD-support vector regression (SVR)—0.9852, SVR—0.9801) and comparatively lower relative mean square error (RMSE) of 0.0214 (BiLSTM—0.0248, EEMD-SVR—0.043, SVR—0.0507) for Cairns and similarly a higher Pearson’s correlation (R) value of 0.9965 (BiLSTM—0.9903, EEMD–SVR—0.9953, SVR—0.9935) and comparatively lower RMSE of 0.0413 (BiLSTM—0.075, EEMD-SVR—0.0481, SVR—0.057) for Gold Coast.

[1]  Makarand Deo,et al.  Real time wave forecasting using neural networks , 1998 .

[2]  Ah Chung Tsoi,et al.  Discrete time recurrent neural network architectures: A unifying review , 1997, Neurocomputing.

[3]  Sancho Salcedo-Sanz,et al.  Significant wave height and energy flux prediction for marine energy applications: A grouping genetic algorithm – Extreme Learning Machine approach , 2016 .

[4]  J. Nash,et al.  River flow forecasting through conceptual models part I — A discussion of principles☆ , 1970 .

[5]  Alexander J. Smola,et al.  Support Vector Method for Function Approximation, Regression Estimation and Signal Processing , 1996, NIPS.

[6]  Jie Zhang,et al.  A data-driven multi-model methodology with deep feature selection for short-term wind forecasting , 2017 .

[7]  Ravinesh C. Deo,et al.  Deep solar radiation forecasting with convolutional neural network and long short-term memory network algorithms , 2019, Applied Energy.

[8]  Yan Li,et al.  Weekly soil moisture forecasting with multivariate sequential, ensemble empirical mode decomposition and Boruta-random forest hybridizer algorithm approach , 2019, CATENA.

[9]  Jia-Qi Zhu,et al.  Improved EEMD-based crude oil price forecasting using LSTM networks , 2019, Physica A: Statistical Mechanics and its Applications.

[10]  Yuansheng Huang,et al.  Wind Speed Forecasting Method Using EEMD and the Combination Forecasting Method Based on GPR and LSTM , 2018, Sustainability.

[11]  N. Huang,et al.  The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis , 1998, Proceedings of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[12]  Lb Mason,et al.  A wave model for the Great Barrier Reef , 2001 .

[13]  Dongyan Zhao,et al.  A Convolution BiLSTM Neural Network Model for Chinese Event Extraction , 2016, NLPCC/ICCPOL.

[14]  C. L. Philip Chen,et al.  Predictive Deep Boltzmann Machine for Multiperiod Wind Speed Forecasting , 2015, IEEE Transactions on Sustainable Energy.

[15]  Ravinesh C. Deo,et al.  Optimization of Windspeed Prediction Using an Artificial Neural Network Compared With a Genetic Programming Model , 2018, Research Anthology on Multi-Industry Uses of Genetic Programming and Algorithms.

[16]  Ali Mostafaeipour,et al.  Prediction of wind speed using a new Grey-extreme learning machine hybrid algorithm: A case study , 2018, Energy & Environment.

[17]  Xiaoxia Qi,et al.  Deep belief network based k-means cluster approach for short-term wind power forecasting , 2018, Energy.

[18]  C. Willmott ON THE VALIDATION OF MODELS , 1981 .

[19]  Ravinesh C. Deo,et al.  Self-adaptive differential evolutionary extreme learning machines for long-term solar radiation prediction with remotely-sensed MODIS satellite and Reanalysis atmospheric products in solar-rich cities , 2018, Remote Sensing of Environment.

[20]  James L. Hench,et al.  Episodic circulation and exchange in a wave‐driven coral reef and lagoon system , 2008 .

[21]  N. Mimura Vulnerability of island countries in the South Pacific to sea level rise and climate change , 1999 .

[22]  Jürgen Schmidhuber,et al.  LSTM: A Search Space Odyssey , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[23]  R. Dennis Cook,et al.  Detection of Influential Observation in Linear Regression , 2000, Technometrics.

[24]  Shailesh Nayak,et al.  Coastal Vulnerability Assessment for Orissa State, East Coast of India , 2010 .

[25]  Ganix Esnaola,et al.  Short-term forecasting of the wave energy flux: Analogues, random forests, and physics-based models , 2015 .

[26]  Yitao Liu,et al.  Deep belief network based deterministic and probabilistic wind speed forecasting approach , 2016 .

[27]  M. Demissie,et al.  Coupling of hydrologic and hydraulic models for the Illinois River Basin , 2007 .

[28]  D. Legates,et al.  Evaluating the use of “goodness‐of‐fit” Measures in hydrologic and hydroclimatic model validation , 1999 .

[29]  Yuan Liu,et al.  Study on network traffic forecast model of SVR optimized by GAFSA , 2016 .

[30]  D. Basak,et al.  Support Vector Regression , 2008 .

[31]  Andrew W. Senior,et al.  Long Short-Term Memory Based Recurrent Neural Network Architectures for Large Vocabulary Speech Recognition , 2014, ArXiv.

[32]  Hongfei Lin,et al.  An attention‐based BiLSTM‐CRF approach to document‐level chemical named entity recognition , 2018, Bioinform..

[33]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[34]  Cort J. Willmott,et al.  On the Evaluation of Model Performance in Physical Geography , 1984 .

[35]  Shou-yu Chen,et al.  Improved annual rainfall-runoff forecasting using PSO-SVM model based on EEMD , 2013 .

[36]  Witold R. Rudnicki,et al.  Feature Selection with the Boruta Package , 2010 .

[37]  Stephen R. Workman,et al.  STATISTICAL PROCEDURES FOR EVALUATING DAILY AND MONTHLY HYDROLOGIC MODEL PREDICTIONS , 2004 .

[38]  Q. Feng,et al.  LSTM integrated with Boruta-random forest optimiser for soil moisture estimation under RCP4.5 and RCP8.5 global warming scenarios , 2021, Stochastic Environmental Research and Risk Assessment.

[39]  T. Hardy,et al.  Modelling Spectral Wave Transformation on a Coral Reef Flat , 1991 .

[40]  Abdullah Al Mamun,et al.  Regional ocean wave height prediction using sequential learning neural networks , 2017 .

[41]  Manuel Berenguel,et al.  New approach for solar tracking systems based on computer vision, low cost hardware and deep learning , 2018, Renewable Energy.

[42]  Qinghua Hu,et al.  Transfer learning for short-term wind speed prediction with deep neural networks , 2016 .

[43]  Guan Gui,et al.  Behavioral Modeling and Linearization of Wideband RF Power Amplifiers Using BiLSTM Networks for 5G Wireless Systems , 2019, IEEE Transactions on Vehicular Technology.

[44]  Norden E. Huang,et al.  The Multi-Dimensional Ensemble Empirical Mode Decomposition Method , 2009, Adv. Data Sci. Adapt. Anal..

[45]  Awnesh M. Singh,et al.  Sea Level Threat in Tuvalu , 2009 .

[46]  Yi Qin,et al.  The Optimized Deep Belief Networks With Improved Logistic Sigmoid Units and Their Application in Fault Diagnosis for Planetary Gearboxes of Wind Turbines , 2019, IEEE Transactions on Industrial Electronics.

[47]  Hongwen He,et al.  Long Short-Term Memory Recurrent Neural Network for Remaining Useful Life Prediction of Lithium-Ion Batteries , 2018, IEEE Transactions on Vehicular Technology.

[48]  Yaguo Lei,et al.  Application of the EEMD method to rotor fault diagnosis of rotating machinery , 2009 .

[49]  S. Jain,et al.  Fitting of Hydrologic Models: A Close Look at the Nash–Sutcliffe Index , 2008 .

[50]  Ali S. Hadi,et al.  Detection of outliers , 2009 .

[51]  R. McCuen,et al.  Evaluation of the Nash-Sutcliffe Efficiency Index , 2006 .

[52]  Kun Ren,et al.  Binary Grey Wolf Optimization-Regularized Extreme Learning Machine Wrapper Coupled with the Boruta Algorithm for Monthly Streamflow Forecasting , 2021, Water Resources Management.

[53]  T. Jagadeeswari,et al.  IDENTIFICATION OF OUTLIERS BY COOK’S DISTANCE IN AGRICULTURE DATASETS , 2013 .

[54]  I. Young Wave transformation over coral reefs , 1989 .

[55]  Luigi Troiano,et al.  Replicating a Trading Strategy by Means of LSTM for Financial Industry Applications , 2018, IEEE Transactions on Industrial Informatics.

[56]  E. Doukakis,et al.  Coastal Vulnerability and Risk Parameters , 2007 .

[57]  Meng Sun,et al.  A New Feature Extraction Method Based on EEMD and Multi-Scale Fuzzy Entropy for Motor Bearing , 2016, Entropy.

[58]  Nadeem Javaid,et al.  ELS-Net: A New Approach to Forecast Decomposed Intrinsic Mode Functions of Electricity Load , 2020, IEEE Access.

[59]  Jun Yan,et al.  Multiscale Convolutional Neural Networks for Fault Diagnosis of Wind Turbine Gearbox , 2019, IEEE Transactions on Industrial Electronics.

[60]  Dimitris Kanellopoulos,et al.  Data Preprocessing for Supervised Leaning , 2007 .

[61]  Zhiyong Cui,et al.  Deep Bidirectional and Unidirectional LSTM Recurrent Neural Network for Network-wide Traffic Speed Prediction , 2018, ArXiv.

[62]  Jing Wang,et al.  Empirical Algorithm for Significant Wave Height Retrieval from Wave Mode Data Provided by the Chinese Satellite Gaofen-3 , 2018, Remote. Sens..

[63]  X. D. Xie,et al.  Ocean wave energy harvesting with a piezoelectric coupled buoy structure , 2015 .

[64]  Shanlin Yang,et al.  Optimal load dispatch of community microgrid with deep learning based solar power and load forecasting , 2019, Energy.

[65]  Yoshua Bengio,et al.  Learning long-term dependencies with gradient descent is difficult , 1994, IEEE Trans. Neural Networks.

[66]  K. Chau,et al.  Improving forecasting accuracy of medium and long-term runoff using artificial neural network based on EEMD decomposition. , 2015, Environmental research.