Retrieval of missing values in water temperature series using a data-driven model

A measurement buoy with attached sensors has been deployed at our study area to monitor hydrodynamics, water properties, and water quality conditions. High-resolution temporal data have been collected and streamed into an online system that is accessible in nearly real-time. However, in certain circumstances the sensors may fail to provide continuous and high quality data. This results in gaps or corrupted values. The aim of this study was to reconstruct the faulty values. This paper proposes a method based on a data-driven model, using an Artificial Neural Network combined with a Genetic Algorithm to generate a synthetic data series. The generated data can be used as a patch for the incomplete measured data. Additional improvements were achieved by removing seasonal patterns from the original time series using a wavelet decomposition prior to the data-driven model training process. Comparisons with a standard missing-data imputation method using the Kohonen self-organizing map were made to further asses the performance of the proposed data-driven model. The algorithm was applied to water temperature data, but the same approach is applicable to other parameters of interest.

[1]  William Remus,et al.  Time series forecasting using neural networks: should the data be deseasonalized first? , 1999 .

[2]  Kenneth Levenberg A METHOD FOR THE SOLUTION OF CERTAIN NON – LINEAR PROBLEMS IN LEAST SQUARES , 1944 .

[3]  Iyan E. Mulia,et al.  Hybrid ANN–GA model for predicting turbidity and chlorophyll-a concentrations , 2013 .

[4]  Bharat Bhushan,et al.  Biofouling: lessons from nature , 2012, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[5]  Pavel Tkalich,et al.  Temporal variability and climatology of hydrodynamic, water property and water quality parameters in the West Johor Strait of Singapore. , 2013, Marine pollution bulletin.

[6]  Pavel Tkalich,et al.  The Various Components of the Circulation in the Singapore Strait Region: Tidal, Wind and Eddy-driven Circulations and Their Relative Importance , 2010 .

[7]  Ioannis P. Vlahavas,et al.  Applying adaptive prediction to sea-water quality measurements , 2009, Expert Syst. Appl..

[8]  A. Malik,et al.  Artificial neural network modeling of the river water quality—A case study , 2009 .

[9]  William W. Hsieh,et al.  Forecasts of Tropical Pacific Sea Surface Temperatures by Neural Networks and Support Vector Regression , 2009 .

[10]  Louis Wehenkel,et al.  Data validation and missing data reconstruction using self-organizing map for water treatment , 2011, Neural Computing and Applications.

[11]  Ingrid Daubechies,et al.  The wavelet transform, time-frequency localization and signal analysis , 1990, IEEE Trans. Inf. Theory.

[12]  Tommy D. Dickey,et al.  Methods for Reducing Biofouling of Moored Optical Sensors , 2004 .

[13]  Walid Elshorbagy,et al.  Temperature-salinity modeling for Ruwais coastal area in United Arab Emirates. , 2013, Marine pollution bulletin.

[14]  Mohammad Firuz Ramli,et al.  Artificial neural network modeling of the water quality index for Kinta River (Malaysia) using water quality variables as predictors. , 2012, Marine pollution bulletin.

[15]  Guoqiang Peter Zhang,et al.  Time series forecasting using a hybrid ARIMA and neural network model , 2003, Neurocomputing.

[16]  Laurent Delauney,et al.  Biofouling protection for marine environmental sensors , 2009 .

[17]  Subimal Ghosh,et al.  Predicting Sea Surface Temperatures in the North Indian Ocean with Nonlinear Autoregressive Neural Networks , 2013 .

[18]  Vladan Babovic,et al.  Error correction of a predictive ocean wave model using local model approximation , 2005 .

[19]  Adebayo Adeloye,et al.  Infilling of missing rainfall and streamflow data in the Shire River basin, Malawi – A self organizing map approach , 2012 .

[20]  Georg Umgiesser,et al.  Climate change response of the Mar Menor coastal lagoon (Spain) using a hydrodynamic finite element model , 2012 .

[21]  D. Marquardt An Algorithm for Least-Squares Estimation of Nonlinear Parameters , 1963 .

[22]  Holger R. Maier,et al.  Review of Input Variable Selection Methods for Artificial Neural Networks , 2011 .

[23]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[24]  Erkki Oja,et al.  Engineering applications of the self-organizing map , 1996, Proc. IEEE.

[25]  Simon Haykin,et al.  Neural Networks and Learning Machines , 2010 .

[26]  S. Mallat Multiresolution approximations and wavelet orthonormal bases of L^2(R) , 1989 .

[27]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[28]  Laurence C. Breaker,et al.  Predicting offshore temperatures in Monterey Bay based on coastal observations using linear forecast models , 2009 .

[29]  Nasreen Islam Khan,et al.  Simulating and predicting river discharge time series using a wavelet‐neural network hybrid modelling approach , 2012 .

[30]  Giorgio Corani,et al.  Air quality prediction in Milan: feed-forward neural networks, pruned neural networks and lazy learning , 2005 .

[31]  Trevor Platt,et al.  Ocean response to attenuation of visible light by phytoplankton in the Gulf of St. Lawrence , 2011 .

[32]  Shie-Yui Liong,et al.  An ANN application for water quality forecasting. , 2008, Marine pollution bulletin.

[33]  Adebayo Adeloye,et al.  Kohonen self‐organizing map estimator for the reference crop evapotranspiration , 2011 .

[34]  Guoqiang Peter Zhang,et al.  Neural network forecasting for seasonal and trend time series , 2005, Eur. J. Oper. Res..

[35]  I. Pairaud,et al.  Hydrology and circulation in a coastal area off Marseille: Validation of a nested 3D model with observations , 2011 .

[36]  Durdu Ömer Faruk A hybrid neural network and ARIMA model for water quality time series prediction , 2010, Eng. Appl. Artif. Intell..