Topological data analysis and machine learning for recognizing atmospheric river patterns in large climate datasets

Abstract. Identifying weather patterns that frequently lead to extreme weather events is a crucial first step in understanding how they may vary under different climate change scenarios. Here, we propose an automated method for recognizing atmospheric rivers (ARs) in climate data using topological data analysis and machine learning. The method provides useful information about topological features (shape characteristics) and statistics of ARs. We illustrate this method by applying it to outputs of version 5.1 of the Community Atmosphere Model version 5.1 (CAM5.1) and the reanalysis product of the second Modern-Era Retrospective Analysis for Research and Applications (MERRA-2). An advantage of the proposed method is that it is threshold-free – there is no need to determine any threshold criteria for the detection method – when the spatial resolution of the climate model changes. Hence, this method may be useful in evaluating model biases in calculating AR statistics. Further, the method can be applied to different climate scenarios without tuning since it does not rely on threshold conditions. We show that the method is suitable for rapidly analyzing large amounts of climate model and reanalysis output data.

[1]  Prabhat,et al.  Application of Deep Convolutional Neural Networks for Detecting Extreme Weather in Climate Datasets , 2016, ArXiv.

[2]  Yongti Zhu,et al.  Tropospheric rivers? – A pilot study , 1992 .

[3]  S. Schubert,et al.  MERRA: NASA’s Modern-Era Retrospective Analysis for Research and Applications , 2011 .

[4]  Herbert Edelsbrunner,et al.  Computational Topology - an Introduction , 2009 .

[5]  Vitaliy Kurlin A fast persistence-based segmentation of noisy 2D clouds with provable guarantees , 2016, Pattern Recognit. Lett..

[6]  Surendra Byna,et al.  TECA: Petascale Pattern Recognition for Climate Science , 2015, CAIP.

[7]  Soroosh Sorooshian,et al.  Genesis, Pathways, and Terminations of Intense Global Water Vapor Transport in Association with Large-Scale Climate Patterns , 2017 .

[8]  Haibo He,et al.  Learning from Imbalanced Data , 2009, IEEE Transactions on Knowledge and Data Engineering.

[9]  S. Coast,et al.  Atmospheric Rivers as Drought Busters on the U . , 2013 .

[10]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[11]  Eric J. Fetzer,et al.  Extreme snowfall events linked to atmospheric rivers and surface air temperature via satellite measurements , 2010 .

[12]  Robert E. Tarjan,et al.  Efficiency of a Good But Not Linear Set Union Algorithm , 1972, JACM.

[13]  Paul A. Ullrich,et al.  TempestExtremes: a framework for scale-insensitive pointwise feature tracking on unstructured grids , 2016 .

[14]  M. Dettinger,et al.  Storms, floods, and the science of atmospheric rivers , 2011 .

[15]  Prabhat,et al.  The effect of horizontal resolution on simulation quality in the Community Atmospheric Model, CAM5.1 , 2014 .

[16]  G. Villarini,et al.  The nexus between atmospheric rivers and extreme precipitation across Europe , 2013 .

[17]  Christopher J. C. Burges,et al.  A Tutorial on Support Vector Machines for Pattern Recognition , 1998, Data Mining and Knowledge Discovery.

[18]  G. Wick,et al.  Satellite and CALJET Aircraft Observations of Atmospheric Rivers over the Eastern North Pacific Ocean during the Winter of 1997/98 , 2004 .

[19]  Herbert Edelsbrunner,et al.  Persistent Homology: Theory and Practice , 2013 .

[20]  Yongti Zhu,et al.  Atmospheric rivers and bombs , 1994 .

[21]  Yongti Zhu,et al.  Tropospheric rivers: A one‐year record and a possible application to ice core data , 1994 .

[22]  R. Trigo,et al.  Natural Hazards and Earth System Sciences The 20 February 2010 Madeira flashfloods : synoptic analysis and extreme rainfall assessment , 2012 .

[23]  Vitaliy Kurlin,et al.  A one‐dimensional homologically persistent skeleton of an unstructured point cloud in any metric space , 2015, SGP '15.

[24]  Daniel Walton,et al.  Atmospheric River Tracking Method Intercomparison Project (ARTMIP): project goals and experimental design , 2018, Geoscientific Model Development.

[25]  M. Dettinger,et al.  The coming megafloods. , 2012, Scientific American.

[26]  M. Dettinger Atmospheric Rivers as Drought Busters on the U.S. West Coast , 2013 .

[27]  Prabhat,et al.  ExtremeWeather: A large-scale climate dataset for semi-supervised detection, localization, and understanding of extreme weather events , 2016, NIPS.

[28]  M. Dettinger,et al.  Atmospheric Rivers, Floods and the Water Resources of California , 2011 .

[29]  Yong Zhu,et al.  A Proposed Algorithm for Moisture Fluxes from Atmospheric Rivers , 1998 .

[30]  Fernando Nogueira,et al.  Imbalanced-learn: A Python Toolbox to Tackle the Curse of Imbalanced Datasets in Machine Learning , 2016, J. Mach. Learn. Res..

[31]  Gunnar E. Carlsson,et al.  Topology and data , 2009 .

[32]  Jeffrey D. Ullman,et al.  Set Merging Algorithms , 1973, SIAM J. Comput..

[33]  Michael D. Dettinger,et al.  CalWater Field Studies Designed to Quantify the Roles of Atmospheric Rivers and Aerosols in Modulating U.S. West Coast Precipitation in a Changing Climate , 2016 .

[34]  Miroslav Kubat,et al.  An Introduction to Machine Learning , 2015, Springer International Publishing.

[35]  Bin Zhao,et al.  The Modern-Era Retrospective Analysis for Research and Applications, Version 2 (MERRA-2). , 2017, Journal of climate.

[36]  Chih-Jen Lin,et al.  A Practical Guide to Support Vector Classication , 2008 .

[37]  R. Ghrist Barcodes: The persistent topology of data , 2007 .

[38]  Gunnar E. Carlsson,et al.  Topological pattern recognition for point cloud data* , 2014, Acta Numerica.

[39]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.