Maize Insects Classification Through Endoscopic Video Analysis

An early identification of insects in grains is of paramount importance to avoid losses. Instead of sampling and visual/laboratory analysis of grains, we propose carrying out the insect identification task automatically, using endoscopic video analysis methods. As the classification process of moving objects in video relies heavily on precise segmentation of moving objects, we propose a new background subtraction method and comparing their results with the main methods of the literature according to a comprehensive review. The background subtraction method relies on a binarization process that uses two thresholds: a global and a local threshold. The binarized results are combined by adding details of the object obtained by the local threshold in the result to the global threshold. Experimental results performed through visual analysis of the segmentation results and using a SVM classifier suggest that the proposed segmentation method produces more accurate results than the state-of-art background subtraction methods.

[1]  Thierry Bouwmans,et al.  Fuzzy statistical modeling of dynamic backgrounds for moving object detection in infrared videos , 2009, 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[2]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[3]  Lucia Maddalena,et al.  A Self-Organizing Approach to Background Subtraction for Visual Surveillance Applications , 2008, IEEE Transactions on Image Processing.

[4]  P. KaewTrakulPong,et al.  An Improved Adaptive Background Mixture Model for Real-time Tracking with Shadow Detection , 2002 .

[5]  Thierry Bouwmans,et al.  Fuzzy integral for moving object detection , 2008, 2008 IEEE International Conference on Fuzzy Systems (IEEE World Congress on Computational Intelligence).

[6]  Jean-Marc Odobez,et al.  Multi-Layer Background Subtraction Based on Color and Texture , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Antoine Vacavant,et al.  A comprehensive review of background subtraction algorithms evaluated with synthetic and real videos , 2014, Comput. Vis. Image Underst..

[8]  Iasonas Kokkinos,et al.  DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Matti Pietikäinen,et al.  Adaptive document image binarization , 2000, Pattern Recognit..

[10]  Alex Pentland,et al.  Pfinder: Real-Time Tracking of the Human Body , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Thierry Bouwmans,et al.  Modeling of Dynamic Backgrounds by Type-2 Fuzzy Gaussians Mixture Models , 2010 .

[12]  Thierry Bouwmans,et al.  Type-2 Fuzzy Mixture of Gaussians Model: Application to Background Modeling , 2008, ISVC.

[13]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[14]  Simone Calderara,et al.  Reliable background suppression for complex scenes , 2006, VSSN '06.

[15]  Alex Pentland,et al.  A Bayesian Computer Vision System for Modeling Human Interactions , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[16]  Christoph Meinel,et al.  Exploring multimodal video representation for action recognition , 2016, 2016 International Joint Conference on Neural Networks (IJCNN).

[17]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[18]  Charles R. Giardina,et al.  Elliptic Fourier features of a closed contour , 1982, Comput. Graph. Image Process..

[19]  Christoph Meinel,et al.  Action Recognition in Surveillance Video Using ConvNets and Motion History Image , 2016, ICANN.

[20]  Marc'Aurelio Ranzato,et al.  Building high-level features using large scale unsupervised learning , 2011, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[21]  Patrick Bouthemy,et al.  Optical flow modeling and computation: A survey , 2015, Comput. Vis. Image Underst..

[22]  Christoph Meinel,et al.  Real-Time Action Recognition in Surveillance Videos Using ConvNets , 2016, ICONIP.