A Saliency Guided Shallow Convolutional Neural Network for Traffic Signs Retrieval

As one of the important parts of road infrastructure, traffic signs provide vital information for road users. Achieving efficient traffic signs retrieval greatly contributes to the intelligent analysis on big traffic data. In this paper, we propose a saliency guided shallow convolutional neural network (CNN) for traffic signs accurate and fast retrieval. Firstly, by unifying deep saliency and hashing learning in a single architecture, the proposed CNN model performs joint learning in a point-wise manner, which is scalable on large-scale datasets. Then, deep saliency features and hashing-like outputs are extracted from traffic sign images with the saliency guided shallow CNN. The binarized hashing-like outputs together with saliency features are used to construct features database. Finally, a coarse to fine similarity measurement is performed by Euclidean distance and Hamming distance to return retrieval results. Experimental results demonstrate the retrieval accuracy of our method outperforms five state-of-the-art methods on GTSRB dataset.

[1]  Reinhard Klette,et al.  General traffic sign recognition by feature matching , 2009, 2009 24th International Conference Image and Vision Computing New Zealand.

[2]  Piotr Indyk,et al.  Similarity Search in High Dimensions via Hashing , 1999, VLDB.

[3]  Honglin Wu,et al.  A Shallow Network with Combined Pooling for Fast Traffic Sign Recognition , 2017, Inf..

[4]  Ji Wan,et al.  Deep Learning for Content-Based Image Retrieval: A Comprehensive Study , 2014, ACM Multimedia.

[5]  Cheng-Lin Liu,et al.  Traffic Sign Detection Using a Cascade Method With Fast Feature Extraction and Saliency Test , 2017, IEEE Transactions on Intelligent Transportation Systems.

[6]  Antonio Torralba,et al.  Spectral Hashing , 2008, NIPS.

[7]  Luc Van Gool,et al.  Traffic sign recognition — How far are we from the solution? , 2013, The 2013 International Joint Conference on Neural Networks (IJCNN).

[8]  Victor S. Lempitsky,et al.  Neural Codes for Image Retrieval , 2014, ECCV.

[9]  Jian Sun,et al.  Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[10]  Svetlana Lazebnik,et al.  Iterative quantization: A procrustean approach to learning binary codes , 2011, CVPR 2011.

[11]  Yoshua Bengio,et al.  Deep Sparse Rectifier Neural Networks , 2011, AISTATS.

[12]  Bin Fan,et al.  Traffic Sign Recognition Using a Multi-Task Convolutional Neural Network , 2018, IEEE Transactions on Intelligent Transportation Systems.

[13]  Marcel Worring,et al.  Content-Based Image Retrieval at the End of the Early Years , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Bolei Zhou,et al.  Learning Deep Features for Discriminative Localization , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Stefan Carlsson,et al.  CNN Features Off-the-Shelf: An Astounding Baseline for Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[16]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[17]  Lin-Lin Huang,et al.  Traffic Sign Recognition Using Complementary Features , 2013, 2013 2nd IAPR Asian Conference on Pattern Recognition.

[18]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[19]  Yizhou Yu,et al.  Visual Saliency Detection Based on Multiscale Deep CNN Features , 2016, IEEE Transactions on Image Processing.

[20]  Johannes Stallkamp,et al.  Man vs. computer: Benchmarking machine learning algorithms for traffic sign recognition , 2012, Neural Networks.

[21]  Chu-Song Chen,et al.  Supervised Learning of Semantics-Preserving Hash via Deep Convolutional Neural Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Yoshua Bengio,et al.  Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[23]  Nozomu Hamada,et al.  Traffic sign recognition by Bags of features , 2009, TENCON 2009 - 2009 IEEE Region 10 Conference.

[24]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[25]  Trevor Darrell,et al.  Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[26]  Jen-Hao Hsiao,et al.  Deep learning of binary hash codes for fast image retrieval , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[27]  Anjan Gudigar,et al.  Local texture patterns for traffic sign recognition using higher order spectra , 2017, Pattern Recognit. Lett..

[28]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[29]  Matti Pietikäinen,et al.  Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[30]  Yann LeCun,et al.  Traffic sign recognition with multi-scale Convolutional Networks , 2011, The 2011 International Joint Conference on Neural Networks.