Analyzing collective motion with machine learning and topology

We use topological data analysis and machine learning to study a seminal model of collective motion in biology [M. R. D’Orsogna et al., Phys. Rev. Lett. 96, 104302 (2006)]. This model describes agents interacting nonlinearly via attractive-repulsive social forces and gives rise to collective behaviors such as flocking and milling. To classify the emergent collective motion in a large library of numerical simulations and to recover model parameters from the simulation data, we apply machine learning techniques to two different types of input. First, we input time series of order parameters traditionally used in studies of collective motion. Second, we input measures based on topology that summarize the time-varying persistent homology of simulation data over multiple scales. This topological approach does not require prior knowledge of the expected patterns. For both unsupervised and supervised machine learning methods, the topological approach outperforms the one that is based on traditional order parameters.

[1]  Janet E. Jones On the Determination of Molecular Fields. I. From the Variation of the Viscosity of a Gas with Temperature , 1924 .

[2]  H. Banks,et al.  Parameter estimation techniques for transport equations with application to population dispersal and tissue bulk flow models , 1983, Journal of mathematical biology.

[3]  Ian T. Jolliffe,et al.  Principal Component Analysis , 1986, Springer Series in Statistics.

[4]  A. Czirók,et al.  Collective Motion , 1999, physics/9902023.

[5]  L. Edelstein-Keshet,et al.  Complexity, pattern, and evolutionary trade-offs in animal aggregation. , 1999, Science.

[6]  W. Rappel,et al.  Self-organization in systems of self-propelled particles. , 2000, Physical review. E, Statistical, nonlinear, and soft matter physics.

[7]  I. Couzin,et al.  Collective memory and spatial sorting in animal groups. , 2002, Journal of theoretical biology.

[8]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[9]  I. Couzin,et al.  Effective leadership and decision-making in animal groups on the move , 2005, Nature.

[10]  J. Vincent,et al.  Biomimetics: its practice and theory , 2006, Journal of The Royal Society Interface.

[11]  A. Bertozzi,et al.  Self-propelled particles with soft-core interactions: patterns, stability, and collapse. , 2006, Physical review letters.

[12]  A. Bertozzi,et al.  State Transitions and the Continuum Limit for a 2D Interacting, Self-Propelled Particle System , 2006, nlin/0606031.

[13]  David Cohen-Steiner,et al.  Stability of Persistence Diagrams , 2007, Discret. Comput. Geom..

[14]  Irene Giardina,et al.  Collective behavior in animal groups: Theoretical models and empirical studies , 2008, HFSP journal.

[15]  M. Aldana,et al.  New tools for characterizing swarming systems: A comparison of minimal models , 2008 .

[16]  Bharat Bhushan,et al.  Biomimetics: lessons from nature–an overview , 2009, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[17]  D. Levy,et al.  Particle, kinetic and fluid models for phototaxis , 2009 .

[18]  T. Vicsek,et al.  Collective Motion , 1999, physics/9902023.

[19]  Andrew M. Stuart,et al.  Inverse problems: A Bayesian perspective , 2010, Acta Numerica.

[20]  D. Sumpter Collective Animal Behavior , 2010 .

[21]  Leah Edelstein-Keshet,et al.  Inferring individual rules from collective behavior , 2010, Proceedings of the National Academy of Sciences.

[22]  G. Carlsson,et al.  Topology based data analysis identifies a subgroup of breast cancers with a unique mutational profile and excellent survival , 2011, Proceedings of the National Academy of Sciences.

[23]  Moo K. Chung,et al.  Topological Data Analysis , 2012 .

[24]  Frédéric Chazal,et al.  Geometric Inference for Probability Measures , 2011, Found. Comput. Math..

[25]  V. Isaeva Self-organization in biological systems , 2012, Biology Bulletin.

[26]  T. Kolokolnikov,et al.  A minimal model of predator–swarm interactions , 2014, Journal of The Royal Society Interface.

[27]  Sivaraman Balakrishnan,et al.  Confidence sets for persistence diagrams , 2013, The Annals of Statistics.

[28]  Complex contagions for topological data analysis of networks , 2014, 1408.1168.

[29]  N. Ouellette,et al.  Velocity correlations in laboratory insect swarms , 2015 .

[30]  Adam R Ferguson,et al.  Topological data analysis for discovery in preclinical spinal cord injury and traumatic brain injury , 2015, Nature Communications.

[31]  E. Pastalkova,et al.  Clique topology reveals intrinsic geometric structure in neural correlations , 2015, Proceedings of the National Academy of Sciences.

[32]  Tom Halverson,et al.  Topological Data Analysis of Biological Aggregation Models , 2014, PloS one.

[33]  T. Chou,et al.  Swarming in viscous fluids: Three-dimensional patterns in swimmer- and force-induced flows. , 2016, Physical review. E.

[34]  L. Wasserman Topological Data Analysis , 2016, 1609.08227.

[35]  Frédéric Chazal,et al.  Robust Topological Inference: Distance To a Measure and Kernel Distance , 2014, J. Mach. Learn. Res..

[36]  Mason A. Porter,et al.  A roadmap for the computation of persistent homology , 2015, EPJ Data Science.

[37]  P. Degond,et al.  An age-structured continuum model for myxobacteria , 2017, Mathematical Models and Methods in Applied Sciences.

[38]  Alex Mogilner,et al.  Mechanical positioning of multiple nuclei in muscle cells , 2018, PLoS Comput. Biol..

[39]  Christopher J. Tralie,et al.  Ripser.py: A Lean Persistent Homology Library for Python , 2018, J. Open Source Softw..

[40]  Pierre Degond,et al.  Quaternions in Collective Dynamics , 2017, Multiscale Model. Simul..

[41]  R. Baker,et al.  Mechanistic models versus machine learning, a fight worth fighting for the biological community? , 2018, Biology Letters.

[42]  Ming Zhong,et al.  Nonparametric inference of interaction laws in systems of agents from trajectory data , 2018, Proceedings of the National Academy of Sciences.

[43]  Lori Ziegelmeier,et al.  A topological approach to selecting models of biological experiments , 2019, PloS one.

[44]  Robert M. Haralick,et al.  Topological Structure of Linear Manifold Clustering , 2019, MLDM.