Automatically Identifying of animals in the wilderness: Comparative studies between CNN and C-Capsule Network

The evolution of machine learning and computer vision in technology has driven a lot of improvements and innovation into several domains. We see it being applied for credit decisions, insurance quotes, malware detection, fraud detection, email composition, and any other area having enough information to allow the machine to learn patterns. Over the years the number of sensors, cameras and cognitive pieces of equipment placed in the wilderness have been growing exponentially. However, the resources(human) to leverage these data into something meaningful are not improving at the same rate. For instance, a team of scientist volunteers took 8.4 years, 17000 hours at a rate of 40 hours/ week to label 3.2 million images from the Serengeti wild park for our research, we are going to focus on wild data, and keep proving that deep learning can do better and faster than the human equivalent labour for the same task. Moreover, this is also an opportunity to present some custom Capsule Networks architectures to the deep learning community while solving the above-mentioned critical problem. Incidentally, we are going to take advantage of these data to make a comparative study on multiple Deep learning models. Specifically, VGG-net, RES-net and a custom made Convolutional-Capsule Network. We benchmark our work with the Serengeti project where Mohammed Mohammed Sadegh et al. recently published a 92% top-1 accuracy [15] and Gomez et al. had a 58% top-1 accuracy [8]. We successfully reached 96.4% top-1 accuracy on the same identification task. Concurrently, we reach up to 79.48% top-1 testing accuracy on a big complex dataset using capsule network, which out-perform the best results of Capsule networks on a complex dataset from Edgar Xi et al. with 71% testing accuracy [5,23,18].

[1]  Robertas Alzbutas,et al.  Convolutional capsule network for classification of breast cancer histology images , 2018, ICIAR.

[2]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[3]  Yu Sun,et al.  Convolutional Recurrent Neural Networks for Observation-Centered Plant Identification , 2018, J. Electr. Comput. Eng..

[4]  Trevor Darrell,et al.  Timely Object Recognition , 2012, NIPS.

[5]  Sergey Ioffe,et al.  Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Rinat Mukhometzianov,et al.  CapsNet comparative performance evaluation for image classification , 2018, ArXiv.

[7]  James J. DiCarlo,et al.  How Does the Brain Solve Visual Object Recognition? , 2012, Neuron.

[8]  A. Asuncion,et al.  UCI Machine Learning Repository, University of California, Irvine, School of Information and Computer Sciences , 2007 .

[9]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  Margaret Kosmala,et al.  Automatically identifying, counting, and describing wild animals in camera-trap images with deep learning , 2017, Proceedings of the National Academy of Sciences.

[11]  Yang Jin,et al.  Capsule Network Performance on Complex Data , 2017, ArXiv.

[12]  Xindong Wu,et al.  Object Detection With Deep Learning: A Review , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[13]  Pascal Vincent,et al.  Stacked Denoising Autoencoders: Learning Useful Representations in a Deep Network with a Local Denoising Criterion , 2010, J. Mach. Learn. Res..

[14]  Geoffrey E. Hinton,et al.  Dynamic Routing Between Capsules , 2017, NIPS.

[15]  German Díez,et al.  Animal Identification in Low Quality Camera-Trap Images Using Very Deep Convolutional Neural Networks and Confidence Thresholds , 2016, ISVC.

[16]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.