A Framework For Hydroclimate Prediction and Discovery Using Object-Oriented Data

Author(s): Sellars, Scott | Advisor(s): Sorooshian, Soroosh; Gao, Xiaogang | Abstract: This thesis introduces a new object-oriented precipitation data set and explores statistical methods that can be used for predicting monthly precipitation and discovering the impact of climate variability on precipitation. The object-oriented data set consists of segmented, near global, satellite precipitation data characterized into four-dimensional (4D) objects (longitude, latitude, time and intensity). We use the Precipitation Estimation from Remotely Sensed Information using Artificial Neural Networks (PERSIANN) .25-degree dataset, which covers from 60N to 60S and from March 1st, 2000 to January 1st, 2011 as our source data. This data set is the called PERSIANN-CONNected objECT (CONNECT) and is stored in a PostgreSQL database. Using this novel data set we propose a prediction and discovery framework that 1) empirically studies the monthly precipitation systems, 2) builds accurate prediction models, and 3) estimates the relevance of the features included in a data matrix of attributes. We use four machine learning models, 1) Lasso, 2) Elastic Net, 3) Gradient Boosting Trees, and 4) Extremely Random Trees, combined with model validation, using a leave one out (LOO) prediction strategy and confidence estimation using bootstrap resampling that is applied to a precipitation prediction problem. Our case study focuses on a subset population of 626 Western U.S. precipitation systems. The study shows the joint interactions of the selected climate phenomena: 1) Arctic Oscillation (AO), 2) El Nino Southern Oscillation (ENSO) and 3) Madden Julian Oscillation (MJO) on these 626 precipitation systems by analyzing the increased/decreased likelihood of having precipitation systems occurring over the Western U.S. In addition, this dissertation finds that the machine learning methods produce accurate monthly precipitation frequency predictions, comparable to climatology at different monthly lead times and identify relevant features that correspond to interacting modes of climate, such as the Western Hemisphere Warm Pool (WHWP), Atlantic Meridional Mode Sea Surface Temperatures (AMMSST), North Pacific Index (NP) and the South West Monsoon Index (SWMONSOON) leading to alternate physical explanations of Western U.S. precipitation variability. Given the importance of monthly prediction in water resource planning and management, this framework provides an approach to understanding Western U.S. precipitation, and even more importantly, an approach that can be applicable to study precipitation around the world.

[1]  Daniel R. Cayan,et al.  ENSO and Hydrologic Extremes in the Western United States , 1999 .

[2]  Padhraic Smyth,et al.  Cluster Analysis of Typhoon Tracks. Part I: General Properties , 2007 .

[3]  Pierre Geurts,et al.  Extremely randomized trees , 2006, Machine Learning.

[4]  J. Wallace,et al.  Annular Modes in the Extratropical Circulation. Part I: Month-to-Month Variability* , 2000 .

[5]  R. Stull An Introduction to Boundary Layer Meteorology , 1988 .

[6]  Duane E. Waliser,et al.  Predictability and forecasting , 2012 .

[7]  J. Janowiak,et al.  CMORPH: A Method that Produces Global Precipitation Estimates from Passive Microwave and Infrared Data at High Spatial and Temporal Resolution , 2004 .

[8]  D. Stephenson,et al.  How useful are teleconnection patterns for explaining variability in extratropical storminess , 2007 .

[9]  Object-Oriented Analysis Computational Earth Science: Big Data Transformed Into Insight , 2013 .

[10]  K. Emanuel,et al.  The poleward migration of the location of tropical cyclone maximum intensity , 2014, Nature.

[11]  E. Fetzer,et al.  Does the Madden–Julian Oscillation Influence Wintertime Atmospheric Rivers and Snowpack in the Sierra Nevada? , 2012 .

[12]  James W. Hurrell,et al.  Decadal atmosphere-ocean variations in the Pacific , 1994 .

[13]  Roy W. Koch,et al.  Surface Climate and Streamflow Variability in the Western United States and Their Relationship to Large‐Scale Circulation Indices , 1991 .

[14]  Kathleen D. White,et al.  Climate Change and Water Resources Management: A Federal Perspective , 2009 .

[15]  R. Sutton,et al.  Atlantic Ocean Forcing of North American and European Summer Climate , 2005, Science.

[16]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[17]  Edward R. Cook,et al.  North American drought: Reconstructions, causes, and consequences , 2007 .

[18]  D. Enfield,et al.  A Further Study of the Tropical Western Hemisphere Warm Pool , 2003 .

[19]  J. Janowiak,et al.  The Version 2 Global Precipitation Climatology Project (GPCP) Monthly Precipitation Analysis (1979-Present) , 2003 .

[20]  G. McCabe,et al.  DECADAL VARIATIONS IN THE STRENGTH OF ENSO TELECONNECTIONS WITH PRECIPITATION IN THE WESTERN UNITED STATES , 1999 .

[21]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[22]  Thomas C. Piechota,et al.  Drought and Regional Hydrologic Variation in the United States: Associations with the El Niño-Southern Oscillation , 1996 .

[23]  B. Brown,et al.  The Method for Object-Based Diagnostic Evaluation (MODE) Applied to Numerical Forecasts from the 2005 NSSL/SPC Spring Program , 2009 .

[24]  J. Namias,et al.  Large-Scale Air-Sea Interactions and Short-Period Climatic Fluctuatioins , 1981, Science.

[25]  Mark A. Cane,et al.  The evolution of El Nino, past and future , 2005 .

[26]  Padhraic Smyth,et al.  Probabilistic clustering of extratropical cyclones using regression mixture models , 2007 .

[27]  Ercan Kahya,et al.  The relationships between U.S. streamflow and La Niña Events , 1994 .

[28]  Malcolm K. Hughes,et al.  Long-Term Variability in the El Niño/Southern Oscillation and Associated Teleconnections , 2000 .

[29]  S. Sorooshian,et al.  Extreme Natural Hazards, Disaster Risks and Societal Implications: Satellite-based remote sensing estimation of precipitation for early warning systems , 2014 .

[30]  B. Brown,et al.  Object-Based Verification of Precipitation Forecasts. Part II: Application to Convective Rain Systems , 2006 .

[31]  Marion Mittermaier,et al.  Using MODE to explore the spatial and temporal characteristics of cloud cover forecasts from high‐resolution NWP models , 2013 .

[32]  A. Barnston,et al.  Clustering of eastern North Pacific tropical cyclone tracks: ENSO and MJO effects , 2008 .

[33]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[34]  M. Dettinger Atmospheric Rivers as Drought Busters on the U.S. West Coast , 2013 .

[35]  Y. Hong,et al.  The TRMM Multisatellite Precipitation Analysis (TMPA): Quasi-Global, Multiyear, Combined-Sensor Precipitation Estimates at Fine Scales , 2007 .

[36]  S. Sorooshian,et al.  CONFIDENCE BUILDERS Evaluating Seasonal Climate Forecasts from User Perspectives , 2002 .

[37]  D. Wilks Review of Probability , 2019, Statistical Methods in the Atmospheric Sciences.

[38]  Casey Brown,et al.  Managing Climate Risk in Water Supply Systems , 2013 .

[39]  David Hinkley,et al.  Bootstrap Methods: Another Look at the Jackknife , 2008 .

[40]  Minghua Zhang,et al.  Climate Models: An Assessment of Strengths and Limitations , 2008 .

[41]  Michael D. Dettinger,et al.  Primary Modes and Predictability of Year-to-Year Snowpack Variations in the Western United States from Teleconnections with Pacific Ocean Climate , 2002 .

[42]  David B. Stephenson,et al.  Serial Clustering of Extratropical Cyclones , 2006 .

[43]  M. Wheeler,et al.  An All-Season Real-Time Multivariate MJO Index: Development of an Index for Monitoring and Prediction , 2004 .

[44]  Nagiza F. Samatova,et al.  Data Guided Discovery of Dynamic Climate Dipoles , 2011, CIDU.

[45]  D. Vimont,et al.  Analogous Pacific and Atlantic Meridional Modes of Tropical Atmosphere-Ocean Variability* , 2004 .

[46]  J. Garbrecht,et al.  Integration of Climate Information and Forecasts Into Western US Water Supply Forecasts , 2005 .

[47]  D. Stephenson,et al.  Storm track signature in total ozone during northern hemisphere winter , 1998 .

[48]  R. Reynolds,et al.  The NCEP/NCAR 40-Year Reanalysis Project , 1996, Renewable Energy.

[49]  Mojib Latif,et al.  A review of ENSO prediction studies , 1994 .

[50]  E. Fetzer,et al.  The 2010/2011 snow season in California's Sierra Nevada: Role of atmospheric rivers and modes of large‐scale variability , 2013 .

[51]  Mathew Barlow,et al.  ENSO, Pacific Decadal Variability, and U.S. Summertime Precipitation, Drought, and Stream Flow , 2001 .

[52]  S. Sorooshian,et al.  Evaluation of Official Western U.S. Seasonal Water Supply Outlooks, 1922–2002 , 2004 .

[53]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[54]  Hyemi Kim,et al.  Assessment of MJO Predictability for Boreal Winter with Various Statistical and Dynamical Models , 2010 .

[55]  E. Zipser,et al.  Four Years of Tropical ERA-40 Vorticity Maxima Tracks. Part II: Differences between Developing and Nondeveloping Disturbances , 2009 .

[56]  Suzana J. Camargo,et al.  Climate Modulation of North Atlantic Hurricane Tracks , 2010 .

[57]  A. H. Murphy,et al.  Probability Forecasting in Meteorology , 1984 .

[58]  S. Sorooshian,et al.  Precipitation Estimation from Remotely Sensed Information Using Artificial Neural Networks , 1997 .

[59]  Soroosh Sorooshian,et al.  USING CLIMATE FORECASTS FOR WATER MANAGEMENT: ARIZONA AND THE 1997–1998 EL NIÑO 1 , 2001 .

[60]  N. Mantua,et al.  The Pacific Decadal Oscillation , 2002 .

[61]  Elizabeth E. Ebert,et al.  Toward Better Understanding of the Contiguous Rain Area (CRA) Method for Spatial Forecast Verification , 2009 .

[62]  J. Betancourt,et al.  A tree‐ring based reconstruction of the Atlantic Multidecadal Oscillation since 1567 A.D. , 2004 .

[63]  Eric Gilleland,et al.  Verifying Forecasts Spatially , 2010 .

[64]  J. Namias,et al.  Persistence of North Pacific Sea Surface Temperature and Atmospheric Flow Patterns , 1988 .

[65]  D. Stephenson,et al.  On the frequency of heavy rainfall for the Midwest of the United States , 2011 .

[66]  F. Martin Ralph,et al.  Meteorological Characteristics and Overland Precipitation Impacts of Atmospheric Rivers Affecting the West Coast of North America Based on Eight Years of SSM/I Satellite Observations , 2008 .

[67]  Stanley B. Goldenberg,et al.  Documentation o f a Highly ENSO-Related SST Région in thé Equatorial Pacific , 1997 .

[68]  M. Dettinger,et al.  Atmospheric Rivers, Floods and the Water Resources of California , 2011 .

[69]  Yong Zhu,et al.  A Proposed Algorithm for Moisture Fluxes from Atmospheric Rivers , 1998 .

[70]  Susan Joslyn,et al.  Progress and challenges in forecast verification , 2013 .

[71]  S. Sorooshian,et al.  Geometrical Characterization of Precipitation Patterns , 2011 .

[72]  Alberto M. Mestas-Nuñez,et al.  The Atlantic Multidecadal Oscillation and its relation to rainfall and river flows in the continental U.S. , 2001 .

[73]  S. Sorooshian,et al.  Evaluation of PERSIANN system satellite-based estimates of tropical rainfall , 2000 .

[74]  Dimitri P. Bertsekas,et al.  Nonlinear Programming , 1997 .

[75]  Kevin I. Hodges,et al.  A General Method for Tracking Analysis and Its Application to Meteorological Data , 1994 .

[76]  J. David Neelin,et al.  ENSO theory , 1998 .

[77]  P. R. Julian,et al.  Observations of the 40-50-day tropical oscillation - a review , 1994 .

[78]  J. Janowiak,et al.  A Gridded Hourly Precipitation Data Base for the United States , 1996 .

[79]  Michael A. Palecki,et al.  Pacific and Atlantic Ocean influences on multidecadal drought frequency in the United States , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[80]  W. Briggs Statistical Methods in the Atmospheric Sciences , 2007 .

[81]  Kuolin Hsu,et al.  Satellites Track Precipitation of Super Typhoon Haiyan , 2014 .

[82]  James Correia,et al.  Forecasting Tornado Pathlengths Using a Three-Dimensional Object Identification Algorithm Applied to Convection-Allowing Forecasts , 2012 .

[83]  Barbara G. Brown,et al.  New verification approaches for convective weather forecasts , 2004 .

[84]  Tony Hey,et al.  The Fourth Paradigm: Data-Intensive Scientific Discovery , 2009 .