DisCo: Physics-Based Unsupervised Discovery of Coherent Structures in Spatiotemporal Systems

Extracting actionable insight from complex unlabeled scientific data is an open challenge and key to unlocking data-driven discovery in science. Complementary and alternative to supervised machine learning approaches, unsupervised physics-based methods based on behavior-driven theories hold great promise. Due to computational limitations, practical application on real-world domain science problems has lagged far behind theoretical development. However, powerful modern supercomputers provide the opportunity to narrow the gap between theory and practical application. We present our first step towards bridging this divide - DisCo - a high-performance distributed workflow for the behavior-driven local causal state theory. DisCo provides a scalable unsupervised physics-based representation learning method that decomposes spatiotemporal systems into their structurally relevant components, which are captured by the latent local causal state variables. In several firsts we demonstrate the efficacy of DisCo in capturing physically meaningful coherent structures from observational and simulated scientific data. To the best of our knowledge, DisCo is also the first application software developed entirely in Python to scale to over 1000 machine nodes, providing good performance along with ensuring domain scientists' productivity. Our capstone experiment, using newly developed and optimized DisCo workflow and libraries, performs unsupervised spacetime segmentation analysis of CAM5.1 climate simulation data, processing an unprecedented 89.5 TB in 6.6 minutes end-to-end using 1024 Intel Haswell nodes on the Cori supercomputer obtaining 91% weak-scaling and 64% strong-scaling efficiency. This enables us to achieve state-of-the-art unsupervised segmentation of coherent spatiotemporal structures in complex fluid flows.

[1]  Gerik Scheuermann,et al.  Multifield visualization using local statistical complexity , 2007, IEEE Transactions on Visualization and Computer Graphics.

[2]  Wei-keng Liao,et al.  Scalable parallel OPTICS data clustering using graph algorithmic techniques , 2013, 2013 SC - International Conference for High Performance Computing, Networking, Storage and Analysis (SC).

[3]  Cosma Rohilla Shalizi Optimal Nonlinear Prediction of Random Fields on Networks , 2003, DMCS.

[4]  Matthias Nießner,et al.  Spherical CNNs on Unstructured Grids , 2019, ICLR.

[5]  Prabhat,et al.  The effect of horizontal resolution on simulation quality in the Community Atmospheric Model, CAM5.1 , 2014 .

[6]  Georg M. Goerg,et al.  LICORS: Light Cone Reconstruction of States for Non-parametric Forecasting of Spatio-Temporal Systems , 2012, 1206.2398.

[7]  Karthik Kashinath,et al.  Segmenting and Tracking Extreme Climate Events using Neural Networks , 2017 .

[8]  Ludovic C. Gillet,et al.  Mass Spectrometry Applied to Bottom-Up Proteomics: Entering the High-Throughput Era for Hypothesis Testing. , 2016, Annual review of analytical chemistry.

[9]  J Anthony Movshon,et al.  Putting big data to good use in neuroscience , 2014, Nature Neuroscience.

[10]  P. Anderson More is different. , 1972, Science.

[11]  Hans-Peter Kriegel,et al.  A survey on unsupervised outlier detection in high‐dimensional numerical data , 2012, Stat. Anal. Data Min..

[12]  James H. Faghmous,et al.  A Big Data Guide to Understanding Climate Change: The Case for Theory-Guided Data Science , 2014, Big Data.

[13]  Matthias Troyer,et al.  Solving the quantum many-body problem with artificial neural networks , 2016, Science.

[14]  Concha Bielza,et al.  Machine Learning in Bioinformatics , 2008, Encyclopedia of Database Systems.

[15]  M. Farazmand An adjoint-based approach for finding invariant solutions of Navier–Stokes equations , 2015, Journal of Fluid Mechanics.

[16]  K. Emanuel The dependence of hurricane intensity on climate , 1987, Nature.

[17]  Prabhat,et al.  Exascale Deep Learning for Climate Analytics , 2018, SC18: International Conference for High Performance Computing, Networking, Storage and Analysis.

[18]  Steven L. Brunton,et al.  On dynamic mode decomposition: Theory and applications , 2013, 1312.0041.

[19]  Sergei Vassilvitskii,et al.  k-means++: the advantages of careful seeding , 2007, SODA '07.

[20]  Liao Wei-keng,et al.  Evaluation of K-means data clustering algorithm on Intel Xeon Phi , 2016 .

[21]  Hans-Peter Kriegel,et al.  A Fast Parallel Clustering Algorithm for Large Spatial Databases , 1999, Data Mining and Knowledge Discovery.

[22]  Julian Borrill,et al.  Python in the NERSC Exascale Science Applications Program for Data , 2017, PyHPC@SC.

[23]  R. Marimont,et al.  Nearest Neighbour Searches and the Curse of Dimensionality , 1979 .

[24]  Mohammad Farazmand,et al.  A critical comparison of Lagrangian methods for coherent structure detection. , 2017, Chaos.

[25]  I. Jolliffe Principal Component Analysis , 2002 .

[26]  Morris Riedel,et al.  HPDBSCAN: highly parallel DBSCAN , 2015, MLHPC@SC.

[27]  Byunghan Lee,et al.  Deep learning in bioinformatics , 2016, Briefings Bioinform..

[28]  J. Overpeck,et al.  Climate Data Challenges in the 21st Century , 2011, Science.

[29]  Surendra Byna,et al.  TECA: A Parallel Toolkit for Extreme Climate Analysis , 2012, ICCS.

[30]  George Haller,et al.  Geodesic Transport Barriers in Jupiter's Atmosphere: A Video-Based Analysis , 2014, SIAM Rev..

[31]  J. Crutchfield Between order and chaos , 2011, Nature Physics.

[32]  Daniel Walton,et al.  Atmospheric River Tracking Method Intercomparison Project (ARTMIP): project goals and experimental design , 2018, Geoscientific Model Development.

[33]  Gordon Bell,et al.  Beyond the Data Deluge , 2009, Science.

[34]  Leland McInnes,et al.  hdbscan: Hierarchical density based clustering , 2017, J. Open Source Softw..

[35]  Joachim Denzler,et al.  Deep learning and process understanding for data-driven Earth system science , 2019, Nature.

[36]  P. Holmes,et al.  Turbulence, Coherent Structures, Dynamical Systems and Symmetry , 1996 .

[37]  Jürgen Kurths,et al.  Identifying causal gateways and mediators in complex spatio-temporal systems , 2015, Nature Communications.

[38]  Nicola Jones,et al.  How machine learning could help to improve climate forecasts , 2017, Nature.

[39]  Prabhat,et al.  Deep Neural Networks for Physics Analysis on low-level whole-detector data at the LHC , 2017, Journal of Physics: Conference Series.

[40]  Vedika Khemani,et al.  Machine Learning Out-of-Equilibrium Phases of Matter. , 2017, Physical review letters.

[41]  A. Woeikof Tropical cyclones. , 1884, Science.

[42]  James P. Crutchfield,et al.  The dreams of theory , 2014 .

[43]  E Mjolsness,et al.  Machine learning for science: state of the art and future prospects. , 2001, Science.

[44]  Wei Chu,et al.  PS-DBSCAN: An Efficient Parallel DBSCAN Algorithm Based on Platform Of AI (PAI) , 2017, ArXiv.

[45]  Klaus Sutner,et al.  Computation theory of cellular automata , 1998 .

[46]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[47]  George Sugihara,et al.  Equation-free mechanistic ecosystem forecasting using empirical dynamic modeling , 2015, Proceedings of the National Academy of Sciences.

[48]  Satish Karra,et al.  Unsupervised Machine Learning Based on Non-Negative Tensor Factorization for Analyzing Reactive-Mixing , 2018, J. Comput. Phys..

[49]  P. Grassberger Toward a quantitative theory of self-generated complexity , 1986 .

[50]  C. Moore,et al.  Automatic filters for the detection of coherent structure in spatiotemporal systems. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[51]  Michael F. Wehner,et al.  Towards Direct Simulation of Future Tropical Cyclone Statistics in a High-Resolution Global Atmospheric Model , 2010 .

[52]  Brenden P. Epps Review of Vortex Identification Methods , 2017 .

[53]  Max Welling,et al.  Gauge Equivariant Convolutional Networks and the Icosahedral CNN 1 , 2019 .

[54]  James P. Crutchfield,et al.  Computational Mechanics: Pattern and Prediction, Structure and Simplicity , 1999, ArXiv.

[55]  Surendra Byna,et al.  BD-CATS: big data clustering at trillion particle scale , 2015, SC15: International Conference for High Performance Computing, Networking, Storage and Analysis.

[56]  Hector Zenil,et al.  Causal deconvolution by algorithmic generative models , 2019, Nature Machine Intelligence.

[57]  G. Haller,et al.  Defining coherent vortices objectively from the vorticity , 2015, Journal of Fluid Mechanics.

[58]  G. Haller Lagrangian Coherent Structures , 2015 .

[59]  Prabhat,et al.  CosmoFlow: Using Deep Learning to Learn the Universe at Scale , 2018, SC18: International Conference for High Performance Computing, Networking, Storage and Analysis.

[60]  P. Webster,et al.  Changes in Tropical Cyclone Number, Duration, and Intensity in a Warming Environment , 2005, Science.

[61]  Daniel W. Davies,et al.  Machine learning for molecular and materials science , 2018, Nature.

[62]  Guangwen Yang,et al.  Large-Scale Hierarchical k-means for Heterogeneous Many-Core Supercomputers , 2018, SC18: International Conference for High Performance Computing, Networking, Storage and Analysis.

[63]  Adam Rupe,et al.  Spacetime Symmetries, Invariant Sets, and Additive Subdynamics of Cellular Automata , 2018, ArXiv.

[64]  Alexis Tantet,et al.  An early warning indicator for atmospheric blocking events using transfer operators. , 2015, Chaos.

[65]  Hans-Peter Kriegel,et al.  OPTICS: ordering points to identify the clustering structure , 1999, SIGMOD '99.

[66]  Clarence W. Rowley,et al.  A Data–Driven Approximation of the Koopman Operator: Extending Dynamic Mode Decomposition , 2014, Journal of Nonlinear Science.

[67]  Adam Rupe,et al.  Local Causal States and Discrete Coherent Structures , 2018, Chaos.

[68]  Cosma Rohilla Shalizi,et al.  The LICORS Cabinet: Nonparametric Algorithms for Spatio-temporal Prediction , 2015, ArXiv.