Implementation of Multi-object Recognition Algorithm Using Enhanced R-CNN

Background/Objectives: Multi-object recognition is emerging as a technology that can be applied to various real worlds such as image security, gesture recognition, robot vision, and human robot interaction, and it is difficult to recognize public objects in a complex background. Methods/Statistical analysis: Most multi-object tracking methods suffer from performance degradation due to the number of objects changing each frame, and this phenomenon is more pronounced in complex backgrounds. Therefore, in this paper, we propose an improved R-CNN-based multi-object recognition method that can detect multiple objects quickly while being robust to geometric distortions, lighting changes, and noise of environmental elements and objects in the image. Findings: Experiments were compared the detection rate and detection rate of the CNN method, R-CNN method and the proposed method. The proposed method takes more time to recognize multiple objects, but the object recognition rate shows a high result. Improvements/Applications: Through the proposed deep learning based multi-object recognition, it can contribute to the research that monitors and tracks several objects at the same time in the surveillance system.

[1]  Jason Yosinski,et al.  Deep neural networks are easily fooled: High confidence predictions for unrecognizable images , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Dorin Comaniciu,et al.  Kernel-Based Object Tracking , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Sergio A. Velastin,et al.  Intelligent distributed surveillance systems: a review , 2005 .

[4]  Huiyu Zhou,et al.  Object tracking using SIFT features and mean shift , 2009, Comput. Vis. Image Underst..

[5]  Yong-Hwan Lee,et al.  Performance analysis of object recognition and tracking for the use of surveillance system , 2016, J. Ambient Intell. Humaniz. Comput..

[6]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[7]  Ming-Hsuan Yang,et al.  Robust Object Tracking with Online Multiple Instance Learning , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  King-Sun Fu,et al.  IEEE Transactions on Pattern Analysis and Machine Intelligence Publication Information , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Bineng Zhong,et al.  CNNTracker: Online discriminative object tracking via deep convolutional neural network , 2016, Appl. Soft Comput..

[10]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[11]  Ming-Hsuan Yang,et al.  Visual tracking with online Multiple Instance Learning , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[12]  Takeo Kanade,et al.  A System for Video Surveillance and Monitoring , 2000 .

[13]  Jian Sun,et al.  Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition , 2015, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Hironobu Fujiyoshi,et al.  A System for Video Surveillance and Monitoring CMU VSAM Final Report , 1999 .

[15]  Philippe C. Cattin,et al.  Tracking the invisible: Learning where the object might be , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[16]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[18]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[19]  Yi Yang,et al.  A discriminative CNN video representation for event detection , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).