Beyond tracking: using deep learning to discover novel interactions in biological swarms

Most deep-learning frameworks for understanding biological swarms are designed to fit perceptive models of group behavior to individual-level data (e.g., spatial coordinates of identified features of individuals) that have been separately gathered from video observations. Despite considerable advances in automated tracking, these methods are still very expensive or unreliable when tracking large numbers of animals simultaneously. Moreover, this approach assumes that the human-chosen features include sufficient features to explain important patterns in collective behavior. To address these issues, we propose training deep network models to predict system-level states directly from generic graphical features from the entire view, which can be relatively inexpensive to gather in a completely automated fashion. Because the resulting predictive models are not based on human-understood predictors, we use explanatory modules (e.g., Grad-CAM) that combine information hidden in the latent variables of the deep-network model with the video data itself to communicate to a human observer which aspects of observed individual behaviors are most informative in predicting group behavior. This represents an example of augmented intelligence in behavioral ecology – knowledge co-creation in a human–AI team. As proof of concept, we utilize a 20-day video recording of a colony of over 50 Harpegnathos saltator ants to showcase that, without any individual annotations provided, a trained model can generate an “importance map” across the video frames to highlight regions of important behaviors, such as dueling (which the AI has no a priori knowledge of), that play a role in the resolution of reproductive-hierarchy re-formation. Based on the empirical results, we also discuss the potential use and current challenges to further develop the proposed framework as a tool to discover behaviors that have not yet been considered crucial to understand complex social dynamics within biological collectives.

[1]  C. Peeters,et al.  Worker reproduction in the ponerine ant Ophthalmopone berthoudi: an alternative form of eusocial organization , 1985, Behavioral Ecology and Sociobiology.

[2]  Taeyeong Choi,et al.  Identification of Abnormal States in Videos of Ants Undergoing Social Phase Change , 2020, AAAI.

[3]  M. Shah,et al.  Abnormal crowd behavior detection using social force model , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  Greg J. Stephens,et al.  Towards Dense Object Tracking in a 2D Honeybee Hive , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[5]  Stephen C. Pratt,et al.  A Simple Behavioral Model Predicts the Emergence of Complex Animal Hierarchies , 2015, The American Naturalist.

[6]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[7]  Matthias Bethge,et al.  Using DeepLabCut for 3D markerless pose estimation across species and behaviors , 2018 .

[8]  Joseph Redmon,et al.  YOLOv3: An Incremental Improvement , 2018, ArXiv.

[9]  Mattia G. Bergomi,et al.  idtracker.ai: tracking all individuals in small or large collectives of unmarked animals , 2019, Nature Methods.

[10]  Luc Van Gool,et al.  Temporal Segment Networks: Towards Good Practices for Deep Action Recognition , 2016, ECCV.

[11]  Jacob M. Graving,et al.  DeepPoseKit, a software toolkit for fast and robust animal pose estimation using deep learning , 2019, bioRxiv.

[12]  Abhishek Das,et al.  Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[13]  B. Hölldobler,et al.  Worker policing limits the number of reproductives in a ponerine ant , 1999, Proceedings of the Royal Society of London. Series B: Biological Sciences.