Evaluation of Seven Gap-Filling Techniques for Daily Station-Based Rainfall Datasets in South Ethiopia

Meteorological stations, mainly located in developing countries, have gigantic missing values in the climate dataset (rainfall and temperature). Ignoring the missing values from analyses has been used as a technique to manage it. However, it leads to partial and biased results in data analyses. Instead, filling the data gaps using the reference datasets is a better and widely used approach. Thus, this study was initiated to evaluate the seven gap-filling techniques in daily rainfall datasets in five meteorological stations of Wolaita Zone and the surroundings in South Ethiopia. The considered gap-filling techniques in this study were simple arithmetic means (SAM), normal ratio method (NRM), correlation coefficient weighing (CCW), inverse distance weighting (IDW), multiple linear regression (MLR), empirical quantile mapping (EQM), and empirical quantile mapping plus (EQM+). The techniques were preferred because of their computational simplicity and appreciable accuracies. Their performance was evaluated against mean absolute error (MAE), root mean square error (RMSE), skill scores (SS), and Pearson’s correlation coefficients (R). The results indicated that MLR outperformed other techniques in all of the five meteorological stations. It showed the lowest RMSE and the highest SS and R in all stations. Four techniques (SAM, NRM, CCW, and IDW) showed similar performance and were second-ranked in all of the stations with little exceptions in time series. EQM+ improved (not substantial) the performance levels of gap-filling techniques in some stations. In general, MLR is suggested to fill in the missing values of the daily rainfall time series. However, the second-ranked techniques could also be used depending on the required time series (period) of each station. The techniques have better performance in stations located in higher altitudes. The authors expect a substantial contribution of this paper to the achievement of sustainable development goal thirteen (climate action) through the provision of gap-filling techniques with better accuracy.

[1]  Z. L. Chuan,et al.  Determination of the best single imputation algorithm for missing rainfall data treatment , 2016 .

[2]  Carolina Guardiola-Albert,et al.  Estimating extremely large amounts of missing precipitation data , 2020 .

[3]  A. R. Peralta-Hernández Comparative analysis of indices of extreme rainfall events: Variations and trends from southern México , 2009 .

[4]  K. Nandalal,et al.  A Comparison of Methods of Estimating Missing Daily Rainfall Data , 2016 .

[5]  N. D. K. Dayawansa,et al.  A comparison of methods used in estimating missing rainfall data , 2007 .

[6]  J. Gironás,et al.  Spatial estimation of daily precipitation in regions with complex relief and scarce data using terrain orientation , 2014 .

[7]  N. Elagib,et al.  Gap filling and homogenization of climatological datasets in the headwater region of the Upper Blue Nile Basin, Ethiopia , 2017 .

[8]  Paulin Coulibaly,et al.  Comparison of neural network methods for infilling missing daily weather records , 2007 .

[9]  A. Fares,et al.  Comparison of Rainfall Interpolation Methods in a Mountainous Region of a Tropical Island , 2011 .

[10]  Y. Pachepsky,et al.  Reconstructing missing daily precipitation data using regression trees and artificial neural networks for SWAT streamflow simulation. , 2010 .

[11]  David C. Goodrich,et al.  Spatial interpolation of precipitation in a dense gauge network for monsoon storm events in the southwestern United States , 2008 .

[12]  Ali Reza Nafarzadegan,et al.  A modified distance-weighted approach for filling annual precipitation gaps: application to different climates of Iran , 2014, Theoretical and Applied Climatology.

[13]  Frédérique Seyler,et al.  A Quantile Mapping Bias Correction Method Based on Hydroclimatic Classification of the Guiana Shield , 2017, Sensors.

[14]  T. Giambelluca,et al.  Comparison of geostatistical approaches to spatially interpolate month‐year rainfall for the Hawaiian Islands , 2016 .

[15]  A. Berti,et al.  Comparison of Four Methods to Fill the Gaps in Daily Precipitation Data Collected by a Dense Weather Network , 2013 .

[16]  Wei Li,et al.  A Method for Improving Imputation and Prediction Accuracy of Highly Seasonal Univariate Data with Large Periods of Missingness , 2019, Wirel. Commun. Mob. Comput..

[17]  Bodo Ahrens,et al.  Distance in spatial interpolation of daily rain gauge data , 2005 .

[18]  Andualem Shigute Boke Comparative Evaluation of Spatial Interpolation Methods for Estimation of Missing Meteorological Variables over Ethiopia , 2017 .

[19]  Asaad M. Armanuos,et al.  Cross Assessment of Twenty-One Different Methods for Missing Precipitation Data Estimation , 2020, Atmosphere.

[20]  Comparison of the Relevance and the Performance of Filling in Gaps Methods in Climate Datasets , 2018, Advances in Intelligent Systems and Computing.

[21]  W. Y. Tang,et al.  Comparative studies of various missing data treatment methods - Malaysian experience , 1996 .

[22]  Madhuri S. Rishi,et al.  Performance of various techniques in estimating missing climatological data over snowbound mountainous areas of Karakoram Himalaya , 2018 .

[23]  Ramesh S. V. Teegavarapu,et al.  Infilling missing precipitation records using variants of spatial interpolation and data‐driven methods: use of optimal weighting parameters and nearest neighbour‐based corrections , 2018 .

[24]  Christian Bernhofer,et al.  Statistically downscaled climate dataset for East Africa , 2019, Scientific Data.

[25]  Andrew Kusiak,et al.  Assessment of different methods for estimation of missing data in precipitation studies , 2017 .

[26]  G. Mariéthoz,et al.  Missing Data Imputation for Multisite Rainfall Networks: A Comparison between Geostatistical Interpolation and Pattern-Based Estimation on Different Terrain Types , 2020 .

[27]  John O. Odiyo,et al.  Filling of missing rainfall data in Luvuvhu River Catchment using artificial neural networks , 2011 .

[28]  Jorge Luis Morales,et al.  Analysis of a new spatial interpolation weighting method to estimate missing data applied to rainfall records , 2019, Atmósfera.

[29]  Lan-hai Li,et al.  Reconstruction of hydrometeorological time series and its uncertainties for the Kaidu River Basin using multiple data sources , 2013, Theoretical and Applied Climatology.

[30]  Mahsa Hasanpour Kashani,et al.  Evaluation of efficiency of different estimation methods for missing climatological data , 2011, Stochastic Environmental Research and Risk Assessment.

[31]  F. Yuan,et al.  Comparison of Spatial Interpolation Schemes for Rainfall Data and Application in Hydrological Modeling , 2017 .

[32]  G. La Loggia,et al.  Comparative analysis of different techniques for spatial interpolation of rainfall data to create a serially complete monthly time series of precipitation for Sicily, Italy , 2011, Int. J. Appl. Earth Obs. Geoinformation.

[33]  SCDNA: a serially complete precipitation and temperature dataset for North America from 1979 to 2018 , 2020 .

[34]  Kuk‐Hyun Ahn,et al.  New gridded rainfall dataset over the Korean peninsula: Gap infilling, reconstruction, and validation , 2021, International Journal of Climatology.

[35]  Vicente Caselles,et al.  Multiple imputation of rainfall missing data in the Iberian Mediterranean context , 2017 .

[36]  T. Giambelluca,et al.  Characterizing the Uncertainty and Assessing the Value of Gap-Filled Daily Rainfall Data in Hawaii , 2020, Journal of Applied Meteorology and Climatology.

[37]  Lukas Gudmundsson,et al.  Technical Note: Downscaling RCM precipitation to the station scale using statistical transformations – a comparison of methods , 2012 .

[38]  Christian W. Dawson,et al.  SDSM - a decision support tool for the assessment of regional climate change impacts , 2002, Environ. Model. Softw..

[39]  M. Clark,et al.  Gridded Ensemble Precipitation and Temperature Estimates for the Contiguous United States , 2015 .

[40]  Ali Selamat,et al.  Infilling Missing Rainfall and Runoff Data for Sarawak, Malaysia Using Gaussian Mixture Model Based K-Nearest Neighbor Imputation , 2019, IEA/AIE.

[41]  M. Maugeri,et al.  Improving estimation of missing values in daily precipitation series by a probability density function‐preserving approach , 2010 .

[42]  Roslinazairimah Zakaria,et al.  Estimation of Missing Rainfall Data Using Spatial Interpolation and Imputation Methods , 2015 .

[43]  Jamshid Piri,et al.  Application of ANN and ANFIS models for reconstructing missing flow data , 2010, Environmental monitoring and assessment.

[44]  Ramesh S. V. Teegavarapu,et al.  Improved weighting methods, deterministic and stochastic data-driven models for estimation of missing precipitation records , 2005 .

[45]  C. Blanco,et al.  Assessment of satellite products for filling rainfall data gaps in the Amazon region , 2021, Natural Resource Modeling.

[46]  Kwangmin Kang,et al.  Interpolation of Missing Precipitation Data Using Kernel Estimations for Hydrologic Modeling , 2015 .

[47]  M. Clark,et al.  SC-Earth: A Station-based Serially Complete Earth Dataset from 1950 to 2019 , 2021, Journal of Climate.

[48]  Wan Norliyana Wan Ismail,et al.  Estimation of rainfall and stream flow missing data for Terengganu, Malaysia by using interpolation technique methods , 2017 .

[49]  B. Bonaccorso,et al.  A new approach for processing climate missing databases applied to daily rainfall data in Soummam watershed, Algeria , 2019, Heliyon.

[50]  R. Teegavarapu Spatial interpolation using nonlinear mathematical programming models for estimation of missing precipitation records , 2012 .

[51]  Vishal Singh,et al.  Data assimilation for constructing long-term gridded daily rainfall time series over Southeast Asia , 2019, Climate Dynamics.

[52]  P. Fiener,et al.  Comparison and evaluation of spatial interpolation schemes for daily rainfall in data scarce regions , 2012 .

[53]  Guillermo Trincado,et al.  Alternative approaches for estimating missing climate data: application to monthly precipitation records in South-Central Chile , 2018, Forest Ecosystems.

[54]  R. Stott,et al.  The World Bank , 2008, Annals of tropical medicine and parasitology.

[55]  A. Mekasha,et al.  Trends in daily observed temperature and precipitation extremes over three Ethiopian eco‐environments , 2014 .

[56]  S. Vicente‐Serrano,et al.  Gap Filling of Monthly Temperature Data and Its Effect on Climatic Variability and Trends , 2019, Journal of Climate.

[57]  S. Walker,et al.  Evaluation of an inverse distance weighting method for patching daily and dekadal rainfall over the Free State Province, South Africa , 2016 .

[58]  C. Stow High-resolution studies of rainfall on Norfolk Island. Part IV: observations of fractional time raining , 2002 .

[59]  Chen-Wuing Liu,et al.  Estimation of the spatial rainfall distribution using inverse distance weighting (IDW) in the middle of Taiwan , 2012, Paddy and Water Environment.

[60]  D. Alexakis,et al.  A Quantile Mapping Method to Fill in Discontinued Daily Precipitation Time Series , 2020 .

[61]  W. Z. W. Zin,et al.  Comparison of the Methods to Estimate Missing Values in Monthly Precipitation Data , 2017 .

[62]  Caroline Geck,et al.  The World Factbook , 2017 .

[63]  Xiao-Hua Zhou,et al.  Multiple imputation: review of theory, implementation and software , 2007, Statistics in medicine.