Symmetric pyramid attention convolutional neural network for moving object detection

Moving object detection (MOD) is a crucial research topic in the field of computer vision, but it faces some challenges such as shadows, illumination, and dynamic background in practical application. In the past few years, the rise of deep learning (DL) has provided fresh ideas to conquer these issues. Inspired by the existing successful deep learning framework, we design a novel pyramid attention-based architecture for MOD. On the one hand, we propose a pyramid attention module to get pivotal target information, and link different layers of knowledge through skip connections. On the other hand, the dilated convolution block (DCB) is dedicated to obtain multi-scale features, which provides sufficient semantic information and geometric details for the network. In this way, contextual resources are closely linked and get more valuable clues. It helps to obtain a precise foreground in the end. Compared with the existing conventional techniques and DL approaches on the benchmark dataset (CDnet2014), the experiments indicate that the performance of our algorithm is better than previous methods.

[1]  Hasan Sajid,et al.  Universal Multimode Background Subtraction , 2017, IEEE Transactions on Image Processing.

[2]  Simone Bianco,et al.  Combination of Video Change Detection Algorithms by Genetic Programming , 2017, IEEE Transactions on Evolutionary Computation.

[3]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[4]  Mohamed S. Shehata,et al.  Local null space pursuit for real-time moving object detection in aerial surveillance , 2020, Signal Image Video Process..

[5]  Dewang Chen,et al.  Multi-Dimensional Traffic Congestion Detection Based on Fusion of Visual Features and Convolutional Neural Network , 2019, IEEE Transactions on Intelligent Transportation Systems.

[6]  Yuansheng Luo,et al.  Deep Background Modeling Using Fully Convolutional Network , 2018, IEEE Transactions on Intelligent Transportation Systems.

[7]  Jinqing Qi,et al.  Multi-attention guided feature fusion network for salient object detection , 2020, Neurocomputing.

[8]  Peng Li,et al.  An Improved ViBe Algorithm Based on Visual saliency , 2018, 2018 International Computers, Signals and Systems Conference (ICOMSSC).

[9]  Sambit Bakshi,et al.  An Evaluation of Background Subtraction for Object Detection Vis-a-Vis Mitigating Challenging Scenarios , 2016, IEEE Access.

[10]  Hanqing Lu,et al.  Pixelwise Deep Sequence Learning for Moving Object Detection , 2019, IEEE Transactions on Circuits and Systems for Video Technology.

[11]  Z. Zivkovic Improved adaptive Gaussian mixture model for background subtraction , 2004, ICPR 2004.

[12]  Chung-Cheng Chiu,et al.  A Robust Object Segmentation System Using a Probability-Based Background Extraction Algorithm , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[13]  Young-Jun Son,et al.  Effective and Efficient Detection of Moving Targets From a UAV’s Camera , 2018, IEEE Transactions on Intelligent Transportation Systems.

[14]  Larry S. Davis,et al.  Non-parametric Model for Background Subtraction , 2000, ECCV.

[15]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[16]  Santosh Kumar Vipparthi,et al.  3DFR: A Swift 3D Feature Reductionist Framework for Scene Independent Change Detection , 2019, IEEE Signal Processing Letters.

[17]  Nicu Sebe,et al.  Multi-View Spatial Attention Embedding for Vehicle Re-Identification , 2021, IEEE Transactions on Circuits and Systems for Video Technology.

[18]  Guillaume-Alexandre Bilodeau,et al.  A Self-Adjusting Approach to Change Detection Based on Background Word Consensus , 2015, 2015 IEEE Winter Conference on Applications of Computer Vision.

[19]  Marc Van Droogenbroeck,et al.  Deep background subtraction with scene-specific convolutional neural networks , 2016, 2016 International Conference on Systems, Signals and Image Processing (IWSSIP).

[20]  Wenjun Xu,et al.  A Moving Shadow Elimination Method Based on Fusion of Multi-Feature , 2020, IEEE Access.

[21]  Guillaume-Alexandre Bilodeau,et al.  SuBSENSE: A Universal Change Detection Method With Local Adaptive Sensitivity , 2015, IEEE Transactions on Image Processing.

[22]  Ling Shao,et al.  End-to-end video background subtraction with 3d convolutional neural networks , 2017, Multimedia Tools and Applications.

[23]  Marcos Ortega,et al.  An end-to-end deep learning approach for simultaneous background modeling and subtraction , 2019, BMVC.

[24]  Matti Pietikäinen,et al.  Modeling pixel process with scale invariant local patterns for background subtraction in complex scenes , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[25]  Kang-Hyun Jo,et al.  Moving Object Detection for a Moving Camera Based on Global Motion Compensation and Adaptive Background Model , 2019, International Journal of Control, Automation and Systems.

[26]  W. Eric L. Grimson,et al.  Adaptive background mixture models for real-time tracking , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[27]  Gerhard Rigoll,et al.  A deep convolutional neural network for video sequence background subtraction , 2018, Pattern Recognit..

[28]  Zhiming Luo,et al.  Interactive deep learning method for segmenting moving objects , 2017, Pattern Recognit. Lett..

[29]  Gerhard Rigoll,et al.  Background segmentation with feedback: The Pixel-Based Adaptive Segmenter , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[30]  Subrahmanyam Murala,et al.  MSFgNet: A Novel Compact End-to-End Deep Network for Moving Object Detection , 2019, IEEE Transactions on Intelligent Transportation Systems.

[31]  Marc Van Droogenbroeck,et al.  ViBe: A Universal Background Subtraction Algorithm for Video Sequences , 2011, IEEE Transactions on Image Processing.

[32]  Rui Wang,et al.  Static and Moving Object Detection Using Flux Tensor with Split Gaussian Models , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[33]  Fatih Murat Porikli,et al.  CDnet 2014: An Expanded Change Detection Benchmark Dataset , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[34]  Yimin Yang,et al.  A 3D CNN-LSTM-Based Image-to-Image Foreground Segmentation , 2020, IEEE Transactions on Intelligent Transportation Systems.

[35]  Yantao Wei,et al.  An infrared small target detection method based on multiscale local homogeneity measure , 2018 .

[36]  Zhan-Li Sun,et al.  An Effective Subsuperpixel-Based Approach for Background Subtraction , 2020, IEEE Transactions on Industrial Electronics.

[37]  Zhang Xinsong,et al.  An optimized Vibe target detection algorithm based on gray distribution and Minkowski distance , 2017, 2017 32nd Youth Academic Annual Conference of Chinese Association of Automation (YAC).

[38]  Yaobin Zou,et al.  Change detection with various combinations of fluid pyramid integration networks , 2021, Neurocomputing.