Towards Automatic Detection of Animals in Camera-Trap Images

In recent years the world's biodiversity is declining on an unprecedented scale. Many species are endangered and remaining populations need to be protected. To overcome this agitating issue, biologist started to use remote camera devices for wildlife monitoring and estimation of remaining population sizes. Unfortunately, the huge amount of data makes the necessary manual analysis extremely tedious and highly cost intensive. In this paper we re-train and apply two state-of-the-art deep-learning based object detectors to localize and classify Serengeti animals in camera-trap images. Furthermore, we thoroughly evaluate both algorithms on a self-established dataset and show that the combination of the results of both detectors can enhance overall mean average precision. In contrast to previous work our approach is not only capable of classifying the main species in images but can also detect them and therefore count the number of individuals which is in fact an important information for biologists, ecologists, and wildlife epidemiologists.

[1]  Margaret Kosmala,et al.  Automatically identifying wild animals in camera trap images with deep learning , 2017, ArXiv.

[2]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[3]  C. Lintott,et al.  Snapshot Serengeti, high-frequency annotated camera trap images of 40 mammalian species in an African savanna , 2015, Scientific Data.

[4]  C. Carbone,et al.  Surveys using camera traps: are we looking to a brighter future? , 2008 .

[5]  T. Burghardt Visual Animal Biometrics , 2008 .

[6]  Derek Hoiem,et al.  Diagnosing Error in Object Detectors , 2012, ECCV.

[7]  Zhi Zhang,et al.  Animal Detection From Highly Cluttered Natural Scenes Using Spatiotemporal Object Region Proposals and Patch Verification , 2016, IEEE Transactions on Multimedia.

[8]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[9]  S. Stuart,et al.  Wildlife in a changing world : an analysis of the 2008 IUCN red list of threatened species , 2009 .

[10]  Veronica Maidel,et al.  This Image Intentionally Left Blank : Mundane Images Increase Citizen Science Participation , 2015 .

[11]  H. Kühl,et al.  Animal biometrics: quantifying and detecting phenotypic appearance. , 2013, Trends in ecology & evolution.

[12]  Heesung Kwon,et al.  Dynamic belief fusion for object detection , 2015, 2016 IEEE Winter Conference on Applications of Computer Vision (WACV).

[13]  Wei Liu,et al.  SSD: Single Shot MultiBox Detector , 2015, ECCV.

[14]  Ming-Hsuan Yang,et al.  Weakly Supervised Object Localization with Progressive Domain Adaptation , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Karina Figueroa,et al.  Fast Automatic Detection of Wildlife in Images from Trap Cameras , 2014, CIARP.

[16]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[17]  Luc Van Gool,et al.  The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[18]  Alexander Loos,et al.  An automated chimpanzee identification system using face detection and recognition , 2013, EURASIP J. Image Video Process..

[19]  Jesús Francisco Vargas-Bonilla,et al.  Towards automatic wild animal monitoring: Identification of animal species in camera-trap images using very deep convolutional neural networks , 2016, Ecol. Informatics.

[20]  Yang Wang,et al.  Attention Networks for Weakly Supervised Object Localization , 2016, BMVC.

[21]  Arthur P. Dempster,et al.  Upper and Lower Probabilities Induced by a Multivalued Mapping , 1967, Classic Works of the Dempster-Shafer Theory of Belief Functions.

[22]  Ali Farhadi,et al.  YOLO9000: Better, Faster, Stronger , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[23]  Jiangping Wang,et al.  Automated identification of animal species in camera trap images , 2013, EURASIP J. Image Video Process..