Road segmentation via iterative deep analysis

Nowadays, people are increasingly concerned about the safety of traffic systems. Road segmentation and recognition is a fundamental problem in perceiving traffic environments and serve as the basis for self-driving cars. In this paper, inspired by an iterative deep analysis thinking, we propose a novel method which is able to learning powerful features step by step, and solve the optimal precision by balancing local and global information to conduct pixel-level classification for road segmentation. Firstly, we introduce an iterative deep analysis thinking which shows that how to design a strong and robustness deep model from failure experience. Secondly, we choose a powerful global features learning network as basis to create a novel framework for our task. Meanwhile, we employ the patch and multi-scale pyramid as input to enhance local features learning. We conduct experiments on three datasets from KITTI Vision Benchmark, namely UU, UM, UMM. The experimental results demonstrate that our proposed method obtains comparable performance with state-of-the-art methods on these datasets.

[1]  Rob Fergus,et al.  Depth Map Prediction from a Single Image using a Multi-Scale Deep Network , 2014, NIPS.

[2]  Camille Couprie,et al.  Learning Hierarchical Features for Scene Labeling , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Jannik Fritsch,et al.  A new performance measure and evaluation benchmark for road detection algorithms , 2013, 16th International IEEE Conference on Intelligent Transportation Systems (ITSC 2013).

[4]  A. López,et al.  Novel Index for Objective Evaluation of Road Detection Algorithms , 2008, 2008 11th International IEEE Conference on Intelligent Transportation Systems.

[5]  Xiaogang Wang,et al.  Hierarchical face parsing via deep learning , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[6]  Ronan Collobert,et al.  Recurrent Convolutional Neural Networks for Scene Parsing , 2013, ArXiv.

[7]  Yann LeCun,et al.  Road Scene Segmentation from a Single Image , 2012, ECCV.

[8]  Yann LeCun,et al.  Convolutional networks and applications in vision , 2010, Proceedings of 2010 IEEE International Symposium on Circuits and Systems.

[9]  Gang Wang,et al.  Multi-modal Unsupervised Feature Learning for RGB-D Scene Labeling , 2014, ECCV.

[10]  Ronan Collobert,et al.  Recurrent Convolutional Neural Networks for Scene Labeling , 2014, ICML.

[11]  한보형,et al.  Learning Deconvolution Network for Semantic Segmentation , 2015 .

[12]  Theo Gevers,et al.  3D Scene priors for road detection , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[13]  Anthony Cowley,et al.  Parsing Indoor Scenes Using RGB-D Imagery , 2012, Robotics: Science and Systems.

[14]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[15]  Yann LeCun,et al.  Indoor Semantic Segmentation using depth information , 2013, ICLR.

[16]  Xiaogang Wang,et al.  Deep Learning Face Representation from Predicting 10,000 Classes , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[17]  Rahul Mohan,et al.  Deep Deconvolutional Networks for Scene Parsing , 2014, ArXiv.

[18]  Graham W. Taylor,et al.  Adaptive deconvolutional networks for mid and high level feature learning , 2011, 2011 International Conference on Computer Vision.

[19]  Rob Fergus,et al.  Predicting Depth, Surface Normals and Semantic Labels with a Common Multi-scale Convolutional Architecture , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[20]  Jitendra Malik,et al.  Learning Rich Features from RGB-D Images for Object Detection and Segmentation , 2014, ECCV.

[21]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[22]  Takashi Naito,et al.  Multiband Image Segmentation and Object Recognition for Understanding Road Scenes , 2011, IEEE Transactions on Intelligent Transportation Systems.

[23]  Liang Lin,et al.  Deep Joint Task Learning for Generic Object Extraction , 2014, NIPS.