Deep Learning based Moving Object Detection for Video Surveillance

This paper proposes a new two-stream neural network which combines the traditional background modeling method with a deep learning network to detect moving objects. The input for the deep neural network is the original image and its corresponding foreground image, while the output is the bounding boxes of the moving objects in the image. Traditional CNN methods cannot distinguish moving objects from static objects, but the method in this paper successfully solves this problem.

[1]  Marc Van Droogenbroeck,et al.  ViBE: A powerful random technique to estimate the background in video sequences , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[2]  Guillaume-Alexandre Bilodeau,et al.  Urban Tracker: Multiple object tracking in urban mixed traffic , 2014, IEEE Winter Conference on Applications of Computer Vision.

[3]  W. Eric L. Grimson,et al.  Adaptive background mixture models for real-time tracking , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[4]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Hanqing Lu,et al.  Pixelwise Deep Sequence Learning for Moving Object Detection , 2019, IEEE Transactions on Circuits and Systems for Video Technology.