Nearshore Wave Predictions Using Data Mining Techniques during Typhoons: A Case Study near Taiwan’s Northeastern Coast

Seasonal typhoons provide energy into the wave field in summer and autumn in Taiwan. Typhoons lead to abundant wave energy near the coastal area and cause storm surges that can destroy offshore facilities. The potential for wave energy can be obtained from analyzing the wave height. To develop an effective model for predicting typhoon-induced wave height near coastal areas, this study employed various popular data mining models—namely k-nearest neighbors (kNN), linear regressions (LR), model trees (M5), multilayer perceptron (MLP) neural network, and support vector regression (SVR) algorithms—as forecasting techniques. The principal component analysis (PCA) was then performed to reduce the potential variables from the original data at the first stage of data preprocessing. The experimental site was the Longdong buoy off the northeastern coast of Taiwan. Data on typhoons that occurred during 2002–2011 and 2012–2013 were collected for training and testing, respectively. This study designed four PCA cases, namely EV1, TV90, TV95, and ORI: EV1 used eigenvalues higher than 1.0 as principal components; TV90 and TV95 used the total variance percentages of 90% and 95%, respectively; and ORI used the original data. The forecast horizons varying from 1 h to 6 h were evaluated. The results show that (1) in the PCA model’ cases, when the number of attributes decreases, computing time decreases and prediction error increases; (2) regarding classified wave heights, M5 provides excellent outcomes at the small wavelet wavelet level; MLP has favorable outcomes at the large wavelet and small/moderate wave levels; meanwhile, SVR gives optimal outcomes at the long wave and high/very high wave levels; and (3) for performance of lead times, MLP and SVR achieve more favorable relative weighted performance without consideration of computational complexity; however, MLP and SVR might obtain lower performance when computational complexity is considered.

[1]  Mevlut Ture,et al.  Comparison of dimension reduction methods using patient satisfaction data , 2007, Expert Syst. Appl..

[2]  Ryo Saegusa,et al.  Nonlinear principal component analysis to preserve the order of principal components , 2003, Neurocomputing.

[3]  Chih-Chiang Wei Surface Wind Nowcasting in the Penghu Islands Based on Classified Typhoon Tracks and the Effects of the Central Mountain Range of Taiwan , 2014 .

[4]  K. Taylor Summarizing multiple aspects of model performance in a single diagram , 2001 .

[5]  Paresh Chandra Deka,et al.  Discrete wavelet neural network approach in significant wave height forecasting for multistep lead time , 2012 .

[6]  Cheng‐Han Tsai,et al.  Wave measurements by pressure transducers using artificial neural networks , 2009 .

[7]  A. Etemad-Shahidi,et al.  COMPARISON BETWEEN M5 MODEL TREE AND NEURAL NETWORKS FOR PREDICTION OF SIGNIFICANT WAVE HEIGHT IN LAKE SUPERIOR , 2009 .

[8]  Hsien-Kuo Chang,et al.  Simulated wave-driven ANN model for typhoon waves , 2011, Adv. Eng. Softw..

[10]  Ming-Hsi Hsu,et al.  Predicting typhoon-induced storm surge tide with a two-dimensional hydrodynamic model and artificial neural network model , 2012 .

[11]  Karl Pearson F.R.S. LIII. On lines and planes of closest fit to systems of points in space , 1901 .

[12]  LinChitsan,et al.  Prediction of Influential Operational Compost Parameters for Monitoring Composting Process , 2016 .

[13]  Geoff Holmes,et al.  Generating Rule Sets from Model Trees , 1999, Australian Joint Conference on Artificial Intelligence.

[14]  Adam H. Monahan,et al.  Nonlinear Principal Component Analysis by Neural Networks: Theory and Application to the Lorenz System , 2000 .

[15]  Seong S. Chae,et al.  Effect of using principal coordinates and principal components on retrieval of clusters , 2006, Comput. Stat. Data Anal..

[16]  Xiangbo Feng,et al.  Wave spectra assimilation in typhoon wave modeling for the East China Sea , 2012 .

[17]  Stephan Trenn,et al.  Multilayer Perceptrons: Approximation Order and Necessary Number of Hidden Units , 2008, IEEE Transactions on Neural Networks.

[18]  Senay Asma,et al.  MLR and ANN models of significant wave height on the west coast of India , 2012, Comput. Geosci..

[19]  N. Hsu,et al.  Rainfall-Runoff Prediction Using Dynamic Typhoon Information and Surface Weather Characteristic Considering Monsoon Effects , 2015, Water Resources Management.

[20]  Helge Blaker,et al.  Minimax estimation in linear regression under restrictions , 2000 .

[21]  Kate Smith-Miles,et al.  On learning algorithm selection for classification , 2006, Appl. Soft Comput..

[22]  Andrew W. Moore,et al.  Locally Weighted Learning , 1997, Artificial Intelligence Review.

[23]  Andrew O. Finley,et al.  Efficient k-nearest neighbor searches for multi-source forest attribute mapping , 2008 .

[24]  A. B. M. Shawkat Ali,et al.  Prospects of renewable energy – a feasibility study in the Australian context , 2012 .

[25]  Patra,et al.  Auto-calibration and -compensation of a capacitive pressure sensor using multilayer perceptrons , 2000, ISA transactions.

[26]  John E Richards,et al.  Recovering dipole sources from scalp-recorded event-related-potentials using component analysis: principal component analysis and independent component analysis. , 2004, International journal of psychophysiology : official journal of the International Organization of Psychophysiology.

[27]  Pat Morin,et al.  Output-Sensitive Algorithms for Computing Nearest-Neighbour Decision Boundaries , 2005, Discret. Comput. Geom..

[28]  Gm. Shafiullah,et al.  Hybrid renewable energy integration (HREI) system for subtropical climate in Central Queensland, Australia , 2016 .

[29]  Dina Makarynska,et al.  Artificial neural networks in wave predictions at the west coast of Portugal , 2005, Comput. Geosci..

[30]  Wei-Zhen Lu,et al.  Potential assessment of a neural network model with PCA/RBF approach for forecasting pollutant trends in Mong Kok urban air, Hong Kong. , 2004, Environmental research.

[31]  Hsiao-Chung Tsai,et al.  Maximum Covariance Analysis of Typhoon Surface Wind and Rainfall Relationships in Taiwan , 2009 .

[32]  Ahmadreza Zamani,et al.  Learning from data for wind–wave forecasting , 2008 .

[33]  C. Tsay Orography Effects on the Structure of Typhoons: Analyses of Two Typhoons Crossing Taiwan , 1994 .

[34]  Ian H. Witten,et al.  Induction of model trees for predicting continuous classes , 1996 .

[35]  Paul C. Liu,et al.  On the growth of ocean waves , 2007 .

[36]  Christopher S. Ruf,et al.  Bayesian Wind Speed Estimation Conditioned on Significant Wave Height for GNSS-R Ocean Observations , 2017 .

[37]  Chih-Pei Chang,et al.  Effects of Terrain on the Surface Structure of Typhoons over Taiwan , 1993 .

[38]  H. Hotelling Analysis of a complex of statistical variables into principal components. , 1933 .

[39]  Chih-Chiang Wei,et al.  Comparing single- and two-segment statistical models with a conceptual rainfall-runoff model for river streamflow prediction during typhoons , 2016, Environ. Model. Softw..

[40]  H. Kaiser The Application of Electronic Computers to Factor Analysis , 1960 .

[41]  Godfried T. Toussaint,et al.  Geometric proximity graphs for improving nearest neighbor methods in instance-based learning and data mining , 2005, Int. J. Comput. Geom. Appl..

[42]  Akbar A. Javadi Estimation of air losses in compressed air tunneling using neural network , 2006 .

[43]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[44]  Ying-Hwa Kuo,et al.  Typhoons Affecting Taiwan: Current Understanding and Future Challenges , 1999 .

[45]  Y. Kuo,et al.  Rainfall Simulation Associated with Typhoon Herb (1996) near Taiwan. Part I: The Topographic Effect , 2001 .

[46]  R. X. Liu,et al.  Principal component regression analysis with SPSS , 2003, Comput. Methods Programs Biomed..

[47]  Melih Iphar,et al.  ANN and ANFIS performance prediction models for hydraulic impact hammers , 2012 .

[48]  F. Chiu,et al.  The spatial and temporal characteristics of the wave energy resources around Taiwan , 2013 .

[49]  Yi-Ching Liaw,et al.  Fast k-nearest neighbors search using modified principal axis search tree , 2010, Digit. Signal Process..

[50]  Chih-Chiang Wei Improvement of Typhoon Precipitation Forecast Efficiency by Coupling SSM/I Microwave Data with Climatologic Characteristics and Precipitation , 2013 .

[51]  Shreenivas Londhe,et al.  Soft computing approach for real-time estimation of missing wave heights , 2008 .

[52]  Chih-Chiang Wei,et al.  RBF Neural Networks Combined with Principal Component Analysis Applied to Quantitative Precipitation Forecast for a Reservoir Watershed during Typhoon Periods , 2012 .