Understanding Dominant Factors for Precipitation over the Great Lakes Region

Statistical modeling of local precipitation involves understanding local, regional and global factors informative of precipitation variability in a region. Modern machine learning methods for feature selection can potentially be explored for identifying statistically significant features from pool of potential predictors of precipitation. In this work, we consider sparse regression, which simultaneously performs feature selection and regression, followed by random permutation tests for selecting dominant factors. We consider average winter precipitation over Great Lakes Region in order to identify its dominant influencing factors. Experiments show that global climate indices, computed at different temporal lags, offer predictive information for winter precipitation. Further, among the dominant factors identified using randomized permutation tests, multiple climate indices indicate the influence of geopotential height patterns on winter precipitation. Using composite analysis, we illustrate that certain patterns are indeed typical in high and low precipitation years, and offer plausible scientific reasons for variations in precipitation. Thus, feature selection methods can be useful in identifying influential climate processes and variables, and thereby provide useful hypotheses over physical mechanisms affecting local precipitation.

[1]  Subimal Ghosh,et al.  Statistical downscaling of GCM simulations to streamflow using relevance vector machine , 2008 .

[2]  Andrew Sturman,et al.  El Niño Southern Oscillation and Climatic Variability , 1998 .

[3]  Jeff S. Waldstreicher,et al.  Winter Weather Forecasting throughout the Eastern United States. Part I: An Overview , 1995 .

[4]  Larry M. McMillin,et al.  Adjusting Satellite Radiances by Regression with an Orthogonal Transformation to a Prior Estimate , 1989 .

[5]  F. Giorgi,et al.  Reduction of future monsoon precipitation over China: comparison between a high resolution RCM simulation and the driving GCM , 2008 .

[6]  Jen-Ping Chen,et al.  Temperature dependence of global precipitation extremes , 2009 .

[7]  Nagiza F. Samatova,et al.  Toward Data-driven, Semi-automatic Inference of Phenomenological Physical Models: Application to Eastern Sahel Rainfall , 2012, SDM.

[8]  D. Randall,et al.  Climate models and their evaluation , 2007 .

[9]  A. Mysterud,et al.  Review article. Studying climate effects on ecology through the use of climate indices: the North Atlantic Oscillation, El Niño Southern Oscillation and beyond , 2003, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[10]  Taha B. M. J. Ouarda,et al.  Automated regression-based statistical downscaling tool , 2008, Environ. Model. Softw..

[11]  Timothy DelSole,et al.  A Bayesian Framework for Multimodel Regression , 2007 .

[12]  John F. B. Mitchell,et al.  The next generation of scenarios for climate change research and assessment , 2010, Nature.

[13]  Xiaolan L. Wang,et al.  Downscaling GCM information to regional scales: a non-parametric multivariate regression approach , 1995 .

[14]  Vidyashankar Sivakumar,et al.  Estimation with Norm Regularization , 2014, NIPS.

[15]  Wenhao Yu,et al.  Supplementary material , 2015 .

[16]  Jeff S. Waldstreicher,et al.  Winter Weather Forecasting throughout the Eastern United States. Part IV: Lake Effect Snow , 1995 .

[17]  Debasish Das Bayesian sparse regression with application to data-driven understanding of climate , 2015 .

[18]  Jieping Ye,et al.  Moreau-Yosida Regularization for Grouped Tree Structure Learning , 2010, NIPS.

[19]  Gemma C. Garriga,et al.  Permutation Tests for Studying Classifier Performance , 2009, 2009 Ninth IEEE International Conference on Data Mining.

[20]  Thomas Karl,et al.  Regional and national monthly, seasonal, and annual temperature weighted by area, 1895-1983 , 1984 .

[21]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[22]  N. Meinshausen,et al.  Stability selection , 2008, 0809.2932.

[23]  D. Chambers,et al.  Anomalous warming in the Indian Ocean coincident with El Niño , 1999 .

[24]  David D. Parrish,et al.  NORTH AMERICAN REGIONAL REANALYSIS , 2006 .

[25]  Martin J. Wainwright,et al.  A unified framework for high-dimensional analysis of $M$-estimators with decomposable regularizers , 2009, NIPS.

[26]  C. Piani,et al.  Statistical bias correction for daily precipitation in regional climate models over Europe , 2010 .

[27]  Malaquias Peña,et al.  Consolidation of Multimodel Forecasts by Ridge Regression: Application to Pacific Sea Surface Temperature , 2008 .

[28]  Slobodan P. Simonovic,et al.  Inverse flood risk modelling under changing climatic conditions , 2007 .

[29]  Thomas E. Nichols,et al.  Nonparametric permutation tests for functional neuroimaging: A primer with examples , 2002, Human brain mapping.

[30]  Nitesh V. Chawla,et al.  Comparing Predictive Power in Climate Data: Clustering Matters , 2011, SSTD.

[31]  Bin Wang,et al.  Intercomparison of the climatological variations of Asian summer monsoon precipitation simulated by 10 GCMs , 2002 .