Traffic Police Gesture Recognition using RGB-D and Faster R-CNN

How to make self-driving cars understand the traffic gestures of traffic police is crucial for driverless, especially in China there are many police to help the traffic move smoothly and quickly at different intersection in rush hours. Faster R-CNN in deep learning is a mainstream method, however, has a low recognition rate in the case of complex backgrounds. In order to improve the recognition accuracy under complex environment, a two-stream Faster R-CNN based on color and depth data is proposed in this paper. Depth channel information is used to combine with RGB channel information at the feature level. RGB channel information is integrated with Depth channel information based on Faster R-CNN and RGB-D. Experimental results show that this method is more advantageous than the Faster R-CNN using only RGB data.

[1]  Qingmin Liao,et al.  Robust RGB-D SLAM in dynamic environment using faster R-CNN , 2017, 2017 3rd IEEE International Conference on Computer and Communications (ICCC).

[2]  Rob Fergus,et al.  Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[3]  Trevor Darrell,et al.  Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[4]  Sergio Escalera,et al.  ChaLearn Looking at People RGB-D Isolated and Continuous Datasets for Gesture Recognition , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[5]  Xilin Chen,et al.  Continuous Gesture Recognition with Hand-Oriented Spatiotemporal Feature , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[6]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[8]  Takeo Kanade,et al.  Background Subtraction for Freely Moving Cameras , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[9]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[10]  Nikolaos G. Bourbakis,et al.  A survey of skin-color modeling and detection methods , 2007, Pattern Recognit..

[11]  Andrew Blake,et al.  Contour-based learning for object detection , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.