Transfer learning based hybrid 2D-3D CNN for traffic sign recognition and semantic road detection applied in advanced driver assistance systems

Annually, deep learning algorithms have proven their effectiveness in many vision-based applications, such as autonomous driving, traffic, and congestion monitoring, and so on. In computer vision, accurate traffic sign recognition and semantic road detection are vital challenges for increased safety, which are becoming a major research topic for intelligent transport systems community. In this paper, a deep learning-based driving assistance system has been proposed. To this end, we present hybrid 2D-3D CNN models based on the transfer learning paradigm to achieve better performance on benchmark real-world datasets. The primary goal of transfer learning is to improve the learning process in the target domain while transferring relevant knowledge from the source domain. We combine a pre-trained deep 2D CNN and a shallow 3D CNN to significantly reduce complexity and speed-up the training algorithm. The first model, called Hybrid-TSR, is intended to effectively address the task of traffic sign recognition. Hybrid-SRD is the second architecture that allows the semantic detection of road space through a combination of up-sampling and deconvolutional operations. The experimental results show that the proposed methods have considerable relevance in terms of efficiency and accuracy.

[1]  Mark Sandler,et al.  MobileNetV2: Inverted Residuals and Linear Bottlenecks , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[2]  Abdellatif Mtibaa,et al.  Robust road lanes and traffic signs recognition for driver assistance system , 2015, Int. J. Comput. Sci. Eng..

[3]  Juan Antonio Álvarez,et al.  Deep neural network for traffic sign recognition systems: An analysis of spatial transformers and stochastic optimisation methods , 2018, Neural Networks.

[4]  Neethu John,et al.  A Reliable Method for Detecting Road Regions from a Single Image Based on Color Distribution and Vanishing Point Location , 2015 .

[5]  CireşAnDan,et al.  2012 Special Issue , 2012 .

[6]  Fei Wang,et al.  Siamese Attentional Keypoint Network for High Performance Visual Tracking , 2019, Knowl. Based Syst..

[7]  Daniel Thalmann,et al.  3D Convolutional Neural Networks for Efficient and Robust Hand Pose Estimation from Single Depth Images , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Yann LeCun,et al.  Traffic sign recognition with multi-scale Convolutional Networks , 2011, The 2011 International Joint Conference on Neural Networks.

[10]  Danijel Skocaj,et al.  Deep Learning for Large-Scale Traffic-Sign Detection and Recognition , 2019, IEEE Transactions on Intelligent Transportation Systems.

[11]  Xiaoping Luo,et al.  Efficient convNets for fast traffic sign recognition , 2019, IET Intelligent Transport Systems.

[12]  Ali Behloul,et al.  Speed limit sign detection and recognition system using SVM and MNIST datasets , 2019, Neural Computing and Applications.

[13]  Liang Xiao,et al.  Hybrid conditional random field based camera-LIDAR fusion for road detection , 2017, Inf. Sci..

[14]  Seunghoon Hong,et al.  Learning Deconvolution Network for Semantic Segmentation , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[15]  Zhang Shichao,et al.  One For All: A Mutual Enhancement Method for Object Detection and Semantic Segmentation , 2019 .

[16]  Ping Jiang,et al.  Learning multi-temporal-scale deep information for action recognition , 2018, Applied Intelligence.

[17]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[18]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Johannes Stallkamp,et al.  The German Traffic Sign Recognition Benchmark: A multi-class classification competition , 2011, The 2011 International Joint Conference on Neural Networks.

[20]  Kelvin C. P. Wang,et al.  Transfer Learning Based Traffic Sign Recognition Using Inception-v3 Model , 2018, Periodica Polytechnica Transportation Engineering.

[21]  Majid Mirmehdi,et al.  Real-Time Detection and Recognition of Road Traffic Signs , 2012, IEEE Transactions on Intelligent Transportation Systems.

[22]  Jiujun Cheng,et al.  Deep learning–based traffic sign recognition for unmanned autonomous vehicles , 2018, J. Syst. Control. Eng..

[23]  Kang-Hyun Jo,et al.  Traffic sign recognition and classification with modified residual networks , 2017, 2017 IEEE/SICE International Symposium on System Integration (SII).

[24]  George Azzopardi,et al.  Color-blob-based COSFIRE filters for object recognition , 2017, Image Vis. Comput..

[25]  Mohamed Atri,et al.  Traffic Signs Detection for Real-World Application of an Advanced Driving Assisting System Using Deep Learning , 2019, Neural Processing Letters.

[26]  Lorenzo Torresani,et al.  Learning Spatiotemporal Features with 3D Convolutional Networks , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[27]  Peng Gao,et al.  Learning Reinforced Attentional Representation for End-to-End Visual Tracking , 2019, Inf. Sci..

[28]  Enhua Wu,et al.  Squeeze-and-Excitation Networks , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[29]  Léon Bottou,et al.  Large-Scale Machine Learning with Stochastic Gradient Descent , 2010, COMPSTAT.

[30]  Kaijun Zhou,et al.  Multi-channel biomimetic visual transformation for object feature extraction and recognition of complex scenes , 2019, Applied Intelligence.

[31]  Shuzhi Sam Ge,et al.  Small traffic sign detection from large image , 2019, Applied Intelligence.

[32]  Zhe Chen,et al.  RBNet: A Deep Neural Network for Unified Road and Road Boundary Detection , 2017, ICONIP.

[33]  Johannes Stallkamp,et al.  Man vs. computer: Benchmarking machine learning algorithms for traffic sign recognition , 2012, Neural Networks.

[34]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[35]  Byung-Gyu Kim,et al.  A Robust Real-Time Road Detection Algorithm Using Color and Edge Information , 2015, ISVC.

[36]  Jun Liu,et al.  Focused random walk with probability distribution for SAT with long clauses , 2020, Applied Intelligence.

[37]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[38]  StallkampJ.,et al.  2012 Special Issue , 2012 .

[39]  Wolfram Burgard,et al.  Efficient and robust deep networks for semantic segmentation , 2018, Int. J. Robotics Res..

[40]  Alexander Wong,et al.  MicronNet: A Highly Compact Deep Convolutional Neural Network Architecture for Real-Time Embedded Traffic Sign Classification , 2018, IEEE Access.

[41]  Roberto Cipolla,et al.  SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[42]  Roberto Cipolla,et al.  MultiNet: Real-time Joint Semantic Reasoning for Autonomous Driving , 2016, 2018 IEEE Intelligent Vehicles Symposium (IV).

[43]  Quoc V. Le,et al.  EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks , 2019, ICML.

[44]  Ming Yang,et al.  3D Convolutional Neural Networks for Human Action Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[45]  Carsten Rother,et al.  Conditional Random Fields Meet Deep Neural Networks for Semantic Segmentation: Combining Probabilistic Graphical Models with Deep Learning for Structured Prediction , 2018, IEEE Signal Processing Magazine.

[46]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[47]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[48]  Philip H. S. Torr,et al.  Conditional Random Fields Meet Deep Neural Networks for Semantic Segmentation , 2017 .

[49]  Bo Chen,et al.  MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications , 2017, ArXiv.

[50]  Jürgen Schmidhuber,et al.  Multi-column deep neural network for traffic sign classification , 2012, Neural Networks.

[51]  Abdellatif Mtibaa,et al.  Hardware implementation and validation of a traffic road sign detection and identification system , 2016, Journal of Real-Time Image Processing.

[52]  P. Yakimov,et al.  CNN Design for Real-Time Traffic Sign Recognition , 2017 .

[53]  Yi Lu Murphey,et al.  Traffic sign recognition with transfer learning , 2017, 2017 IEEE Symposium Series on Computational Intelligence (SSCI).

[54]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[55]  Fatin Zaklouta,et al.  Traffic sign classification using K-d trees and Random Forests , 2011, The 2011 International Joint Conference on Neural Networks.