Bird Detection in Agriculture Environment using Image Processing and Neural Network

This paper addresses the problem of detecting pest birds in the agriculture field. So far, various research methods have been proposed for bird detection. However, bird detection in a low-lying field is still challenging since it is difficult to distinguish objects in the dynamic background including moving leaves and clouds. To recognize objects in the dynamic background, deep learning has been used due to its efficiency. However, detecting objects in a video with deep learning is not fast enough since the process needs high computational cost. In addition, if the object is tiny, the accuracy becomes lower. To solve these problems, we propose adapting deep learning to certain cropped small areas of a frame where there is a high possibility of bird presence, based on the result of the image processing. First, the moving objects are extracted by conducting the background subtraction based on Gaussian Mixture Model. Then, color extraction and median filter remove the unnecessary objects in the agricultural environment. Lastly, the minimized moving objects are classified with the neural network object classifier. The experimental results demonstrate that applying neural networks to specific areas of an image brings a higher accuracy than the whole original frame.

[1]  Rogelio Hasimoto-Beltran,et al.  Video Background Subtraction in Complex Environments , 2014 .

[2]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[3]  Pietro Perona,et al.  Building a bird recognition app and large scale dataset with citizen scientists: The fine print in fine-grained dataset collection , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Zoran Zivkovic,et al.  Improved adaptive Gaussian mixture model for background subtraction , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[5]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[6]  Sergey Ioffe,et al.  Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Trevor Darrell,et al.  DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition , 2013, ICML.

[8]  Vijay Vasudevan,et al.  Learning Transferable Architectures for Scalable Image Recognition , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[9]  Hong Zhang,et al.  Real-time bird detection based on background subtraction , 2012, Proceedings of the 10th World Congress on Intelligent Control and Automation.

[10]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  W. Eric L. Grimson,et al.  Adaptive background mixture models for real-time tracking , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[12]  Pietro Perona,et al.  The Caltech-UCSD Birds-200-2011 Dataset , 2011 .

[13]  Koen E. A. van de Sande,et al.  Selective Search for Object Recognition , 2013, International Journal of Computer Vision.

[14]  Ting Liu,et al.  Recent advances in convolutional neural networks , 2015, Pattern Recognit..

[15]  Bo Chen,et al.  MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications , 2017, ArXiv.

[16]  Jian Sun,et al.  Identity Mappings in Deep Residual Networks , 2016, ECCV.

[17]  Wei Liu,et al.  SSD: Single Shot MultiBox Detector , 2015, ECCV.

[18]  Massimo Piccardi,et al.  Background subtraction techniques: a review , 2004, 2004 IEEE International Conference on Systems, Man and Cybernetics (IEEE Cat. No.04CH37583).

[19]  Takeshi Naemura,et al.  Detection of small birds in large images by combining a deep detector with semantic segmentation , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[20]  Fei-Fei Li,et al.  ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[21]  Ohad Shamir,et al.  Are ResNets Provably Better than Linear Predictors? , 2018, NeurIPS.

[22]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[23]  Ferdinand van der Heijden,et al.  Efficient adaptive density estimation per image pixel for the task of background subtraction , 2006, Pattern Recognit. Lett..

[24]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  Subhransu Maji,et al.  Fine-Grained Visual Classification of Aircraft , 2013, ArXiv.

[26]  Steven Verstockt,et al.  Automatic detection, tracking and counting of birds in marine video content , 2016, 2016 Sixth International Conference on Image Processing Theory, Tools and Applications (IPTA).

[27]  Larry S. Davis,et al.  Non-parametric Model for Background Subtraction , 2000, ECCV.