Background extraction method for analysis of natural images captured by camera traps

Introduction: Automatic detection of animals, particularly birds, on images captured in the wild by camera traps remains an unsolved task due to the shooting and weather conditions. Such observations generate thousands or millions of images which are impossible to analyze manually. Wildlife sanctuaries and national parks normally use cheap camera traps. Their low quality images require careful multifold processing prior to the recognition of animal species. Purpose: Developing a background extraction method based on Gaussian mixture model in order to locate an object of interest under any time/season/meteorological conditions. Results: We propose a background extraction method based on a modified Gaussian mixture model. The modification uses truncated pixel values (in low bites) to decrease the dependence on the illumination changes or shadows. After that, binary masks are created and processed instead of real intensity values. The proposed method is aimed for background estimation of natural scenes in wildlife sanctuaries and national parks. Structural elements (trunks of growing and/or fallen trees) are considered slowly changeable during the seasons, while other textured areas are simulated by texture patterns corresponding to the current season. Such an approach provides a compact background model of a scene. Also, we consider the influence of the time/season/meteorological attributes o f a scene with respect to its restoration ability. The method was tested using a rich dataset of natural images obtained on the territory of Ergaki wildlife sanctuary in Krasnoyarsk Krai, Russia. Practical relevance: The application of the modified Gaussian mixture model provides an accuracy of object detection as high as 79-83% in the daytime and 60-69% at night, under acceptable meteorological conditions. When the meteorological conditions are bad, the accuracy is 5-8% lower.

[1]  Larry S. Davis,et al.  Real-time foreground-background segmentation using codebook model , 2005, Real Time Imaging.

[2]  Larry S. Davis,et al.  Non-parametric Model for Background Subtraction , 2000, ECCV.

[3]  Zhi Zhang,et al.  Animal Detection From Highly Cluttered Natural Scenes Using Spatiotemporal Object Region Proposals and Patch Verification , 2016, IEEE Transactions on Multimedia.

[4]  Weiguo Gong,et al.  Nonparametric on-line background generation for surveillance video , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[5]  Tieniu Tan,et al.  Foreground Object Detection Using Top-Down Information Based on EM Framework , 2012, IEEE Transactions on Image Processing.

[6]  Margarita N. Favorskaya,et al.  Image Inpainting based on Self-organizing Maps by Using Multi-agent Implementation , 2014, KES.

[7]  Chih-Hsien Hsia,et al.  Fast Background Subtraction Based on a Multilayer Codebook Model for Moving Object Detection , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[8]  Marc Van Droogenbroeck,et al.  ViBe: A Universal Background Subtraction Algorithm for Video Sequences , 2011, IEEE Transactions on Image Processing.

[9]  W. Eric L. Grimson,et al.  Learning Patterns of Activity Using Real-Time Tracking , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Álvaro Sánchez Miralles,et al.  Mixture of Merged Gaussian Algorithm using RTDENN , 2013, Machine Vision and Applications.

[11]  Zheng Yi,et al.  Moving object detection based on running average background and temporal difference , 2010, 2010 IEEE International Conference on Intelligent Systems and Knowledge Engineering.

[12]  Venkatesh Saligrama,et al.  Foreground-Adaptive Background Subtraction , 2009, IEEE Signal Processing Letters.

[13]  Thuong Le-Tien,et al.  NIC: A Robust Background Extraction Algorithm for Foreground Detection in Dynamic Scenes , 2017, IEEE Transactions on Circuits and Systems for Video Technology.

[14]  Jitendra Malik,et al.  Learning to segment moving objects in videos , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Margarita N. Favorskaya,et al.  Intelligent method of texture reconstruction in video sequences based on neural networks , 2013, Int. J. Reason. based Intell. Syst..

[16]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Thierry Bouwmans,et al.  Recent Advanced Statistical Background Modeling for Foreground Detection - A Systematic Survey , 2011 .

[18]  Thierry Bouwmans,et al.  Modeling of Dynamic Backgrounds by Type-2 Fuzzy Gaussians Mixture Models , 2010 .

[19]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Vladlen Koltun,et al.  Geodesic Object Proposals , 2014, ECCV.

[21]  Zize Liang,et al.  An adaptive mixture Gaussian background model with online background reconstruction and adjustable foreground mergence time for motion segmentation , 2005, 2005 IEEE International Conference on Industrial Technology.

[22]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.

[23]  Koen E. A. van de Sande,et al.  Selective Search for Object Recognition , 2013, International Journal of Computer Vision.

[24]  Marko Heikkilä,et al.  A texture-based method for modeling the background and detecting moving objects , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  Nizar Bouguila,et al.  Background subtraction using finite mixtures of asymmetric Gaussian distributions and shadow detection , 2013, Machine Vision and Applications.

[26]  Jinkuan Wang,et al.  Improved visual background extractor using an adaptive distance threshold , 2014, J. Electronic Imaging.

[27]  Thierry Bouwmans,et al.  Traditional and recent approaches in background modeling for foreground detection: An overview , 2014, Comput. Sci. Rev..

[28]  Jeng-Shyang Pan,et al.  Speed Up Temporal Median Filter for Background Subtraction , 2010, 2010 First International Conference on Pervasive Computing, Signal Processing and Applications.

[29]  Zhenhua Guo,et al.  Dynamic background estimation and complementary learning for pixel-wise foreground/background segmentation , 2016, Pattern Recognit..

[30]  Brendon J. Woodford,et al.  Illumination Invariant Background Model Using Mixture of Gaussians and SURF Features , 2012, ACCV Workshops.

[31]  Markus H. Gross,et al.  Fully Connected Object Proposals for Video Segmentation , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[32]  Itaru Kitahara,et al.  Robust Foreground Extraction Technique Using Gaussian Family Model and Multiple Thresholds , 2007, ACCV.