Contextual background modeling using deep convolutional neural network

Moving object detection is a crucial problem in computer vision. This affects the performance of the overall system in surveillance applications. In this paper, a Deep-Convolutional Neural Network with fully convolutional approach is proposed. Convolutional networks are powerful models to extract hierarchies of non-handcrafted features. The primary objective of the paper is to build an accurate foreground segmentation system with limited user interventions. The presented work focuses to build a fully convolutional network with skip architecture to identify moving objects in complex scenarios. The network is modeled as an end-to-end fully convolutional network, and the method contains a new hierarchical pooling layer to make use of global contextual information. The presented model utilizes a pre-trained VGG-19 Net model for the construction of Deep-Convolutional Neural Network (Deep-CNN) model. The fine and coarse features are fused using skip architecture to improve the feature representation. The qualitative and quantitative performance of the Deep-CNN architecture is tested on ChangeDetection.net-2014 dataset. The results produced by the Deep-CNN method were compared with the techniques in the recent literature. The Deep-CNN method outperforms the state-of-the-art methods without relying on any post-processing techniques.

[1]  Xiaobo Lu,et al.  WeSamBE: A Weight-Sample-Based Method for Background Subtraction , 2018, IEEE Transactions on Circuits and Systems for Video Technology.

[2]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[3]  Ronan Collobert,et al.  Recurrent Convolutional Neural Networks for Scene Labeling , 2014, ICML.

[4]  Sebastian Ruder,et al.  An overview of gradient descent optimization algorithms , 2016, Vestnik komp'iuternykh i informatsionnykh tekhnologii.

[5]  Eduardo A. B. da Silva,et al.  Foreground Segmentation for Anomaly Detection in Surveillance Videos Using Deep Residual Networks , 2017 .

[6]  V. Rajinikanth,et al.  Deep neural network assisted diagnosis of time-frequency transformed electromyograms , 2018, Multimedia Tools and Applications.

[7]  Hanqing Lu,et al.  Learning sharable models for robust background subtraction , 2015, 2015 IEEE International Conference on Multimedia and Expo (ICME).

[8]  Larry S. Davis,et al.  Real-time foreground-background segmentation using codebook model , 2005, Real Time Imaging.

[9]  Heng Tao Shen,et al.  Beyond Frame-level CNN: Saliency-Aware 3-D CNN With LSTM for Video Action Recognition , 2017, IEEE Signal Processing Letters.

[10]  Guillaume-Alexandre Bilodeau,et al.  SuBSENSE: A Universal Change Detection Method With Local Adaptive Sensitivity , 2015, IEEE Transactions on Image Processing.

[11]  Mohan Ramasundaram,et al.  A fast DGPSO-motion saliency map based moving object detection , 2018, Multimedia Tools and Applications.

[12]  Marc Van Droogenbroeck,et al.  ViBe: A Universal Background Subtraction Algorithm for Video Sequences , 2011, IEEE Transactions on Image Processing.

[13]  Kang-Hyun Jo,et al.  Evaluation of background subtraction algorithms for video surveillance , 2015, 2015 21st Korea-Japan Joint Workshop on Frontiers of Computer Vision (FCV).

[14]  Joseph L. Mundy,et al.  Duration Dependent Codebooks for Change Detection , 2014, BMVC.

[15]  Marcel van Gerven,et al.  Convolutional neural network-based encoding and decoding of visual object recognition in space and time , 2017, NeuroImage.

[16]  Zezhi Chen,et al.  A self-adaptive Gaussian mixture model , 2014, Comput. Vis. Image Underst..

[17]  W. Eric L. Grimson,et al.  Adaptive background mixture models for real-time tracking , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[18]  Yacov Hel-Or,et al.  Foreground detection using spatiotemporal projection kernels , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[19]  Sambit Bakshi,et al.  An Evaluation of Background Subtraction for Object Detection Vis-a-Vis Mitigating Challenging Scenarios , 2016, IEEE Access.

[20]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  Mohan Ramasundaram,et al.  Moving object detection using vector image model , 2018, Optik.

[22]  Guillaume-Alexandre Bilodeau,et al.  Universal Background Subtraction Using Word Consensus Models , 2016, IEEE Transactions on Image Processing.

[23]  Giorgos Tolias,et al.  Fine-Tuning CNN Image Retrieval with No Human Annotation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[25]  Fatih Murat Porikli,et al.  CDnet 2014: An Expanded Change Detection Benchmark Dataset , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[26]  Muhammad Sharif,et al.  Appearance based pedestrians' gender recognition by employing stacked auto encoders in deep learning , 2018, Future Gener. Comput. Syst..

[27]  Muhammad Sharif,et al.  Big data analysis for brain tumor detection: Deep convolutional neural networks , 2018, Future Gener. Comput. Syst..

[28]  Marko Heikkilä,et al.  A texture-based method for modeling the background and detecting moving objects , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[29]  KimKyungnam,et al.  Real-time foreground-background segmentation using codebook model , 2005 .

[30]  Lucia Maddalena,et al.  A Self-Organizing Approach to Background Subtraction for Visual Surveillance Applications , 2008, IEEE Transactions on Image Processing.

[31]  Michael Harville,et al.  Foreground segmentation using adaptive mixture models in color and depth , 2001, Proceedings IEEE Workshop on Detection and Recognition of Events in Video.

[32]  Tao Xiang,et al.  Background Subtraction with DirichletProcess Mixture Models , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[33]  Fatih Murat Porikli,et al.  Changedetection.net: A new change detection benchmark dataset , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[34]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35]  Yongxin Yang,et al.  Frankenstein: Learning Deep Face Representations Using Small Data , 2016, IEEE Transactions on Image Processing.

[36]  Saeid Nahavandi,et al.  A sequential search-space shrinking using CNN transfer learning and a Radon projection pool for medical image retrieval , 2018, Expert Syst. Appl..

[37]  Javed Imran,et al.  Combining CNN streams of RGB-D and skeletal data for human activity recognition , 2018, Pattern Recognit. Lett..

[38]  A. Vadivel,et al.  Multiple human object tracking using background subtraction and shadow removal techniques , 2010, 2010 International Conference on Signal and Image Processing.

[39]  Gerhard Rigoll,et al.  Background segmentation with feedback: The Pixel-Based Adaptive Segmenter , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[40]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[41]  Larry S. Davis,et al.  ReMotENet: Efficient Relevant Motion Event Detection for Large-Scale Home Surveillance Videos , 2018, 2018 IEEE Winter Conference on Applications of Computer Vision (WACV).

[42]  Lei Wang,et al.  A Novel Recursive Bayesian Learning-Based Method for the Efficient and Accurate Segmentation of Video With Dynamic Background , 2012, IEEE Transactions on Image Processing.

[43]  K. Chitra,et al.  An efficient low cost background subtraction method to extract foreground object during human tracking , 2014, 2014 International Conference on Circuits, Power and Computing Technologies [ICCPCT-2014].

[44]  Chih-Hsien Hsia,et al.  Fast Background Subtraction Based on a Multilayer Codebook Model for Moving Object Detection , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[45]  Zhiming Luo,et al.  Interactive deep learning method for segmenting moving objects , 2017, Pattern Recognit. Lett..

[46]  Yuansheng Luo,et al.  Deep Background Modeling Using Fully Convolutional Network , 2018, IEEE Transactions on Intelligent Transportation Systems.

[47]  Bakiya Ambikapathy,et al.  Assessment of electromyograms using genetic algorithm and artificial neural networks , 2018, Evol. Intell..

[48]  Jing Li,et al.  SD-CNN: a Shallow-Deep CNN for Improved Breast Cancer Diagnosis , 2018, Comput. Medical Imaging Graph..

[49]  Gerhard Rigoll,et al.  A deep convolutional neural network for video sequence background subtraction , 2018, Pattern Recognit..

[50]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[51]  Marc Van Droogenbroeck,et al.  Deep background subtraction with scene-specific convolutional neural networks , 2016, 2016 International Conference on Systems, Signals and Image Processing (IWSSIP).