Classification of Emerging Extreme Event Tracks in Multivariate Spatio-Temporal Physical Systems Using Dynamic Network Structures: Application to Hurricane Track Prediction

Understanding extreme events, such as hurricanes or forest fires, is of paramount importance because of their adverse impacts on human beings. Such events often propagate in space and time. Predicting--even a few days in advance--what locations will get affected by the event tracks could benefit our society in many ways. Arguably, simulations from first principles, where underlying physics-based models are described by a system of equations, provide least reliable predictions for variables characterizing the dynamics of these extreme events. Data-driven model building has been recently emerging as a complementary approach that could learn the relationships between historically observed or simulated multiple, spatio-temporal ancillary variables and the dynamic behavior of extreme events of interest. While promising, the methodology for predictive learning from such complex data is still in its infancy. In this paper, we propose a dynamic networks-based methodology for in-advance prediction of the dynamic tracks of emerging extreme events. By associating a network model of the system with the known tracks, our method is capable of learning the recurrent network motifs that could be used as discriminatory signatures for the event's behavioral class. When applied to classifying the behavior of the hurricane tracks at their early formation stages inWestern Africa region, our method is able to predict whether hurricane tracks will hit the land of the North Atlantic region at least 10-15 days lead lag time in advance with more than 90% accuracy using 10-fold cross-validation. To the best of our knowledge, no comparable methodology exists for solving this problem using data-driven models

[1]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[2]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[3]  Suzana J. Camargo,et al.  Climate Modulation of North Atlantic Hurricane Tracks , 2010 .

[4]  Tin Kam Ho,et al.  The Random Subspace Method for Constructing Decision Forests , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Tracking Hurricanes , 1966 .

[6]  R. Elsberry,et al.  Tropical Cyclone Track Characteristics as a Function of Large-Scale Circulation Anomalies , 1991 .

[7]  金田 重郎,et al.  C4.5: Programs for Machine Learning (書評) , 1995 .

[8]  Takashi Washio,et al.  An Apriori-Based Algorithm for Mining Frequent Substructures from Graph Data , 2000, PKDD.

[9]  Yoshio Kurihara,et al.  Improvements in the GFDL Hurricane Prediction System , 1995 .

[10]  Edward R. Dougherty,et al.  Is cross-validation valid for small-sample microarray classification? , 2004, Bioinform..

[11]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[12]  Padhraic Smyth,et al.  Probabilistic clustering of extratropical cyclones using regression mixture models , 2007 .

[13]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[14]  Shaogang Gong,et al.  Support vector regression and classification based multi-view face detection and recognition , 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580).

[15]  Norbert Marwan,et al.  The backbone of the climate network , 2009, 1002.2100.

[16]  Nagiza F. Samatova,et al.  Automatic and statistically robust spatio-temporal detection and tracking of fusion plasma turbulent fronts , 2010 .

[17]  Jason Weston,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2002, Machine Learning.

[18]  R. Reynolds,et al.  The NCEP/NCAR 40-Year Reanalysis Project , 1996, Renewable Energy.

[19]  W. Collins,et al.  The NCEP–NCAR 50-Year Reanalysis: Monthly Means CD-ROM and Documentation , 2001 .

[20]  H. Akaike,et al.  Information Theory and an Extension of the Maximum Likelihood Principle , 1973 .

[21]  George Karypis,et al.  Frequent subgraph discovery , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[22]  Paul J. Roebber,et al.  The architecture of the climate network , 2004 .

[23]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[24]  Jiawei Han,et al.  gSpan: graph-based substructure pattern mining , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[25]  Upmanu Lall,et al.  Classifying North Atlantic Tropical Cyclone Tracks by Mass Moments , 2009 .

[26]  James B. Elsner,et al.  Spatial Variations in Major U.S. Hurricane Activity: Statistics and a Physical Mechanism , 2000 .