DeepLabCut: markerless pose estimation of user-defined body parts with deep learning

Quantifying behavior is crucial for many applications in neuroscience. Videography provides easy methods for the observation and recording of animal behavior in diverse settings, yet extracting particular aspects of a behavior for further analysis can be highly time consuming. In motor control studies, humans or other animals are often marked with reflective markers to assist with computer-based tracking, but markers are intrusive, and the number and location of the markers must be determined a priori. Here we present an efficient method for markerless pose estimation based on transfer learning with deep neural networks that achieves excellent results with minimal training data. We demonstrate the versatility of this framework by tracking various body parts in multiple species across a broad collection of behaviors. Remarkably, even when only a small number of frames are labeled (~200), the algorithm achieves excellent tracking performance on test frames that is comparable to human accuracy.Using a deep learning approach to track user-defined body parts during various behaviors across multiple species, the authors show that their toolbox, called DeepLabCut, can achieve human accuracy with only a few hundred frames of training data.

[1]  Gebräuchliche Fertigarzneimittel,et al.  V , 1893, Therapielexikon Neurologie.

[2]  N. A. Bernshteĭn The co-ordination and regulation of movements , 1967 .

[3]  David A. Winter,et al.  Biomechanics and Motor Control of Human Movement , 1990 .

[4]  著者なし 16 , 1871, Animals at the End of the World.

[5]  W. Eric L. Grimson,et al.  Adaptive background mixture models for real-time tracking , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[6]  Ilan Golani,et al.  SEE: a tool for the visualization and analysis of rodent exploratory behavior , 2001, Neuroscience & Biobehavioral Reviews.

[7]  Branko Ristic,et al.  Beyond the Kalman Filter: Particle Filters for Tracking Applications , 2004 .

[8]  Daniel P. Huttenlocher,et al.  Pictorial Structures for Object Recognition , 2004, International Journal of Computer Vision.

[9]  N. Sousa,et al.  A hitchhiker's guide to behavioral analysis in laboratory rodents , 2006, Genes, brain, and behavior.

[10]  George Loizou,et al.  Computer vision and pattern recognition , 2007, Int. J. Comput. Math..

[11]  Sonja J. Prohaska,et al.  “Genes” , 2008, Theory in Biosciences.

[12]  Dima Damen,et al.  Recognizing linked events: Searching the space of feasible explanations , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  N. Tinbergen On aims and methods of Ethology , 2010 .

[14]  Michael J. Black,et al.  Decoding Complete Reach and Grasp Actions from Local Primary Motor Cortex Populations , 2010, The Journal of Neuroscience.

[15]  Pietro Perona,et al.  Cascaded pose regression , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[16]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[17]  Geoffrey E. Hinton,et al.  Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition-' Washington , D . C . , June , 1983 OPTIMAL PERCEPTUAL INFERENCE , 2011 .

[18]  Damon Afkari,et al.  ? ? ? ? ? ? ? ? ? ? ? ? ? 30 ? ? ? ? ? ? ? ? ? ? ? ? ? ? , 2011 .

[19]  U. Bhalla,et al.  Rats track odour trails accurately using a multi-layered strategy with near-optimal sampling , 2012, Nature Communications.

[20]  A. Cressant,et al.  Computerized video analysis of social interactions in mice , 2012, Nature Methods.

[21]  Shay B. Cohen,et al.  Advances in Neural Information Processing Systems 25 , 2012, NIPS 2012.

[22]  W. Marsden I and J , 2012 .

[23]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[24]  Johannes E. Schindelin,et al.  Fiji: an open-source platform for biological-image analysis , 2012, Nature Methods.

[25]  Greg J. Stephens,et al.  Automated Tracking of Animal Posture and Movement during Exploration and Sensory Orientation Behaviors , 2012, PloS one.

[26]  T. Ono,et al.  A 3D-Video-Based Computerized Analysis of Social and Sexual Interactions in Rats , 2013, PloS one.

[27]  Pietro Perona,et al.  Automated image-based tracking and its application in ecology. , 2014, Trends in ecology & evolution.

[28]  Yoshua Bengio,et al.  How transferable are features in deep neural networks? , 2014, NIPS.

[29]  Silvestro Micera,et al.  Closed-loop neuromodulation of spinal sensorimotor circuits controls refined locomotion after complete spinal cord injury , 2014, Science Translational Medicine.

[30]  Joseph J. Paton,et al.  Big behavioral data: psychology, ethology and the foundations of neuroscience , 2014, Nature Neuroscience.

[31]  Christian Szegedy,et al.  DeepPose: Human Pose Estimation via Deep Neural Networks , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[32]  David J. Anderson,et al.  Toward a Science of Computational Ethology , 2014, Neuron.

[33]  A. Pérez-Escudero,et al.  idTracker: tracking individuals in a group by automatic identification of unmarked animals , 2014, Nature Methods.

[34]  Trevor Darrell,et al.  DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition , 2013, ICML.

[35]  Bernt Schiele,et al.  2D Human Pose Estimation: New Benchmark and State of the Art Analysis , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[36]  João Fayad,et al.  A quantitative framework for whole-body coordination reveals specific deficits in freely walking ataxic mice , 2015, eLife.

[37]  M. Gahr,et al.  Tap dancing birds: the multimodal mutual courtship display of males and females in a socially monogamous songbird , 2015, Scientific Reports.

[38]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[39]  Kristin Branson,et al.  Cortex commands the performance of skilled movement , 2015, eLife.

[40]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[41]  Yuan Yu,et al.  TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[42]  Jumpei Matsumoto,et al.  A Markerless 3D Computerized Motion Capture System Incorporating a Skeleton Model for Monkeys , 2016, PloS one.

[43]  Varun Ramakrishna,et al.  Convolutional Pose Machines , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[44]  Bernt Schiele,et al.  DeeperCut: A Deeper, Stronger, and Faster Multi-person Pose Estimation Model , 2016, ECCV.

[45]  Peter V. Gehler,et al.  DeepCut: Joint Subset Partition and Labeling for Multi Person Pose Estimation , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[46]  Matthias Bethge,et al.  DeepGaze II: Reading fixations from deep features trained on object recognition , 2016, ArXiv.

[47]  Nicholas J. Wade,et al.  Capturing Motion and Depth Before Cinematography , 2016, Journal of the history of the neurosciences.

[48]  Kristin Branson,et al.  Machine vision methods for analyzing social interactions , 2017, Journal of Experimental Biology.

[49]  Karl Deisseroth,et al.  Integration of optogenetics with complementary methodologies in systems neuroscience , 2017, Nature Reviews Neuroscience.

[50]  Bernt Schiele,et al.  ArtTrack: Articulated Multi-Person Tracking in the Wild , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[51]  Michael Unser,et al.  FlyLimbTracker: An active contour based approach for leg segment tracking in unmarked, freely behaving Drosophila , 2016, bioRxiv.

[52]  Omid Haji Maghsoudi,et al.  Superpixels based marker tracking vs. hue thresholding in rodent biomechanics application , 2017, 2017 51st Asilomar Conference on Signals, Systems, and Computers.

[53]  Samy Bengio,et al.  Understanding deep learning requires rethinking generalization , 2016, ICLR.

[54]  Yoram Ben-Shaul,et al.  OptiMouse: a comprehensive open source program for reliable detection and analysis of mouse body and nose positions , 2017, BMC Biology.

[55]  Benjamin F. Grewe,et al.  Neuronal Representation of Social Information in the Medial Amygdala of Awake Behaving Mice , 2017, Cell.

[56]  Andrew Zisserman,et al.  Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[57]  M. A. MacIver,et al.  Neuroscience Needs Behavior: Correcting a Reductionist Bias , 2017, Neuron.

[58]  Mackenzie W. Mathis,et al.  Somatosensory Cortex Plays an Essential Role in Forelimb Motor Adaptation in Mice , 2017, Neuron.

[59]  Andrew Zisserman,et al.  Detect to Track and Track to Detect , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[60]  R. Sarpong,et al.  Bio-inspired synthesis of xishacorenes A, B, and C, and a new congener from fuscol† †Electronic supplementary information (ESI) available. See DOI: 10.1039/c9sc02572c , 2019, Chemical science.

[61]  Gordon J. Berman,et al.  Measuring behavior across scales , 2017, BMC Biology.

[62]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.