Inductive machine learning for improved estimation of catchment-scale snow water equivalent

Summary Infrastructure for the automatic collection of single-point measurements of snow water equivalent (SWE) is well-established. However, because SWE varies significantly over space, the estimation of SWE at the catchment scale based on a single-point measurement is error-prone. We propose low-cost, lightweight methods for near-real-time estimation of mean catchment-wide SWE using existing infrastructure, wireless sensor networks, and machine learning algorithms. Because snowpack distribution is highly nonlinear, we focus on Genetic Programming (GP), a nonlinear, white-box, inductive machine learning algorithm. Because we did not have access to near-real-time catchment-scale SWE data, we used available data as ground truth for machine learning in a set of experiments that are successive approximations of our goal of catchment-wide SWE estimation. First, we used a history of maritime snowpack data collected by manual snow courses. Second, we used distributed snow depth (HS) data collected automatically by wireless sensor networks. We compared the performance of GP against linear regression (LR), binary regression trees (BT), and a widely used basic method (BM) that naively assumes non-variable snowpack. In the first experiment set, GP and LR models predicted SWE with lower error than BM. In the second experiment set, GP had lower error than LR, but outperformed BT only when we applied a technique that specifically mitigated the possibility of over-fitting.

[1]  Albert Rango,et al.  Areal distribution of snow water equivalent evaluated by snow cover monitoring , 1981 .

[2]  Roy Schwaerzel Predicting Financial Time Series by Genetic Programming with Trigonometric Functions and High-Order Statistics , 2006 .

[3]  B. Alvera,et al.  Evaluation of spatial variability in snow water equivalent for a high mountain catchment , 2004 .

[4]  J. Dozier,et al.  Estimating the spatial distribution of snow water equivalent in an alpine basin using binary regression tree models: the impact of digital elevation data and independent variable selection , 2005 .

[5]  B. Santer,et al.  Attribution of declining Western U.S. Snowpack to human effects , 2008 .

[6]  W. P. Adams Areal differentiation of snow cover in east central Ontario , 1976 .

[7]  Roger C. Bales,et al.  SNOTEL representativeness in the Rio Grande headwaters on the basis of physiographics and remotely sensed snow cover persistence , 2006 .

[8]  Robert J. Gurney,et al.  Simulating wind-affected snow accumulations at catchment to basin scales , 2013 .

[9]  J. Landes Application of a J-Q Model for Fracture in the Ductile-Brittle Transition , 1997 .

[10]  D. Marks,et al.  The detection and correction of snow water equivalent pressure sensor errors , 2004 .

[11]  Tom Bylander,et al.  Financial time series prediction and evaluation by genetic programming with trigonometric functions and high-order statistics , 2006 .

[12]  Yoshinori Tsuchiya,et al.  Comparison of artificial synthesis methods of gene. , 2006, Nucleic acids symposium series.

[13]  Kelly Elder,et al.  Combining binary decision tree and geostatistical methods to estimate snow distribution in a mountain watershed , 2000 .

[14]  Thomas H. Painter,et al.  Mountain hydrology of the western United States , 2006 .

[15]  Michael Lehning,et al.  Statistical modelling of the snow depth distribution in open alpine terrain , 2013 .

[16]  Eric J. Fetzer,et al.  Extreme snowfall events linked to atmospheric rivers and surface air temperature via satellite measurements , 2010 .

[17]  Una-May O'Reilly,et al.  Computational Complexity Analysis of Genetic Programming - Initial Results and Future Directions , 2011 .

[18]  Yves Bühler,et al.  Sensitivity of snow avalanche simulations to digital elevation model quality and resolution , 2011, Annals of Glaciology.

[19]  John E. Dennis,et al.  Normal-Boundary Intersection: A New Method for Generating the Pareto Surface in Nonlinear Multicriteria Optimization Problems , 1998, SIAM J. Optim..

[20]  S. Glaser,et al.  Design and performance of a wireless sensor network for catchment‐scale snow and soil moisture measurements , 2012 .

[21]  Steven R. Fassnacht,et al.  Small scale spatial variability of snow density and depth over complex alpine terrain: Implications for estimating snow water equivalent , 2013 .

[22]  Regine Hock,et al.  Areal melt and discharge modelling of Storglaciären, Sweden , 1997, Annals of Glaciology.

[23]  Kelly Elder,et al.  Snow accumulation and distribution in an Alpine Watershed , 1991 .

[24]  Timothy E. Link,et al.  Subgrid variability of snow water equivalent at operational snow stations in the western USA , 2013 .

[25]  Christian Skalka,et al.  Snowcloud: A Complete Data Gathering System for Snow Hydrology Research , 2013, REALWSN.

[26]  M. Clark,et al.  Characteristics of the western United States snowpack from snowpack telemetry (SNOTEL) data , 1999 .

[27]  Younes Alila,et al.  The influence of forest and topography on snow accumulation and melt at the watershed-scale , 2007 .

[28]  Atsumu Ohmura,et al.  Physical Basis for the Temperature-Based Melt-Index Method , 2001 .

[29]  Michael D. Schmidt,et al.  Automated refinement and inference of analytical models for metabolic networks , 2011, Physical biology.

[30]  Danny Marks,et al.  Long‐term snow distribution observations in a mountain catchment: Assessing variability, time stability, and the representativeness of an index site , 2014 .

[31]  K. Deb,et al.  Understanding knee points in bicriteria problems and their implications as preferred solution principles , 2011 .

[32]  Kang-Tsung Chang,et al.  Modelling snow accumulation with a geographic information system , 2000, Int. J. Geogr. Inf. Sci..

[33]  Vidroha Debroy,et al.  Genetic Programming , 1998, Lecture Notes in Computer Science.

[34]  Steven D. Glaser,et al.  Sensor placement strategies for snow water equivalent (SWE) estimation in the American River basin , 2013 .

[35]  Trent McConaghy,et al.  FFX: Fast, Scalable, Deterministic Symbolic Regression Technology , 2011 .

[36]  R. Stouffer,et al.  Stationarity Is Dead: Whither Water Management? , 2008, Science.

[37]  J. Dozier Mountain hydrology, snow color, and the fourth paradigm , 2011 .

[38]  C. David Moeser,et al.  Development, Analysis and Use of a Distributed Wireless Sensor Network for Quantifying Spatial Trends of Snow Depth and Snow Water Equivalence Around Meteorological Stations With and Without Snow Sensing Equipment , 2010 .

[39]  Thomas H. Painter,et al.  MULTISPECTRAL AND HYPERSPECTRAL REMOTE SENSING OF ALPINE SNOW PROPERTIES , 2004 .

[40]  B. Ostendorf,et al.  GIS-based modelling of spatial pattern of snow cover duration in an alpine area , 2001 .

[41]  D. F. Berisford,et al.  NASA Airborne Snow Observatory: Measuring Spatial Distribution of Snow Water Equivalent and Snow Albedo , 2015 .

[42]  Michael Lehning,et al.  Persistence in intra‐annual snow depth distribution: 1. Measurements and topographic control , 2011 .

[43]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[44]  Chris Derksen,et al.  Estimating Snow Water Equivalent Using Snow Depth Data and Climate Classes , 2010 .

[45]  L. A. Logan Basin-wide water equivalent estimation from snowpack depth measurements , .

[46]  M. Lehning,et al.  Seasonal small‐scale spatial variability in alpine snowfall and snow accumulation , 2013 .

[47]  Kelly Elder,et al.  Spatial Snow Modeling of Wind-Redistributed Snow Using Terrain-Based Parameters , 2002 .

[48]  J. Frolik,et al.  APPLICATION OF A WIRELESS SENSOR NETWORK FOR DISTRIBUTED SNOW WATER EQUIVALENCE ESTIMATION , 2012 .

[49]  Michael Lehning,et al.  Evaluation of modelled snow depth and snow water equivalent at three contrasting sites in Switzerland using SNOWPACK simulations driven by different meteorological data input , 2014 .

[50]  Karl Rittger,et al.  Spatial estimates of snow water equivalent in the Sierra Nevada , 2012 .

[51]  Robert J. Gurney,et al.  An efficient method for distributing wind speeds over heterogeneous terrain , 2009 .

[52]  Una-May O'Reilly,et al.  Computational complexity analysis of simple genetic programming on two problems modeling isolated program semantics , 2010, FOGA '11.

[53]  Roger C. Bales,et al.  Snow water equivalent interpolation for the Colorado River Basin from snow telemetry (SNOTEL) data , 2003 .

[54]  P. Bartelt,et al.  A physical SNOWPACK model for the Swiss avalanche warning: Part I: numerical model , 2002 .

[55]  Roger C. Bales,et al.  Scaling snow observations from the point to the grid element: Implications for observation network design , 2005 .

[56]  H. Tabari,et al.  Predicting Spatial Distribution of Snow Water Equivalent Using Multivariate Non-linear Regression and Computational Intelligence Methods , 2011 .

[57]  James B. Domingo,et al.  A spatially distributed energy balance snowmelt model for application in mountain basins , 1999 .

[58]  Kelly Elder,et al.  Comparison of spatial interpolation methods for estimating snow distribution in the Colorado Rocky Mountains , 2002 .

[59]  Indraneel Das On characterizing the “knee” of the Pareto curve based on Normal-Boundary Intersection , 1999 .

[60]  Kelly Elder,et al.  Estimating the spatial distribution of snow water equivalence in a montane watershed , 1998 .

[61]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[62]  Jason H. Moore,et al.  Genetic Programming Theory and Practice IX , 2011 .

[63]  Hossein Tabari,et al.  Comparison of artificial neural network and combined models in estimating spatial distribution of snow depth and snow water equivalent in Samsami basin of Iran , 2010, Neural Computing and Applications.