Map-supervised road detection

We propose an approach to detect drivable road area in monocular images. It is a self-supervised approach which doesn't require any human road annotations on images to train the road detection algorithm. Our approach reduces human labeling effort and makes training scalable. We combine the best of both supervised and unsupervised methods in our approach. First, we automatically generate training road annotations for images using OpenStreetMap1, vehicle pose estimation sensors, and camera parameters. Next, we train a Convolutional Neural Network (CNN) for road detection using these annotations. We show that we are able to generate reasonably accurate training annotations in KITTI data-set [1]. We achieve state-of-the-art performance among the methods which do not require human annotation effort.

[1]  Yann LeCun,et al.  Road Scene Segmentation from a Single Image , 2012, ECCV.

[2]  Martial Hebert,et al.  Stacked Hierarchical Labeling , 2010, ECCV.

[3]  Geoffrey E. Hinton,et al.  Learning to Label Aerial Images from Noisy Data , 2012, ICML.

[4]  Philip Chan,et al.  Determining the number of clusters/segments in hierarchical clustering/segmentation algorithms , 2004, 16th IEEE International Conference on Tools with Artificial Intelligence.

[5]  Ethan Fetaya,et al.  StixelNet: A Deep Convolutional Network for Obstacle Detection and Road Segmentation , 2015, BMVC.

[6]  Paul Newman,et al.  A variational approach to online road and path segmentation with monocular vision , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[7]  Iasonas Kokkinos,et al.  Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs , 2014, ICLR.

[8]  C. Lawrence Zitnick,et al.  Fast Edge Detection Using Structured Forests , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Alexei A. Efros,et al.  Geometric context from a single image , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[10]  Sebastian Thrun,et al.  Adaptive Road Following using Self-Supervised Learning and Reverse Optical Flow , 2005, Robotics: Science and Systems.

[11]  Hsu-Yung Cheng,et al.  Lane Detection With Moving Vehicles in the Traffic Scenes , 2006, IEEE Transactions on Intelligent Transportation Systems.

[12]  Raquel Urtasun,et al.  Estimating Drivable Collision-Free Space from Monocular Video , 2015, 2015 IEEE Winter Conference on Applications of Computer Vision.

[13]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[14]  Vladlen Koltun,et al.  Efficient Inference in Fully Connected CRFs with Gaussian Edge Potentials , 2011, NIPS.

[15]  Tommy Chang,et al.  Color model-based real-time learning for road following , 2006, 2006 IEEE Intelligent Transportation Systems Conference.

[16]  Andreas Geiger,et al.  Vision meets robotics: The KITTI dataset , 2013, Int. J. Robotics Res..

[17]  Denis Fernando Wolf,et al.  Road terrain detection: Avoiding common obstacle detection assumptions using sensor fusion , 2014, 2014 IEEE Intelligent Vehicles Symposium Proceedings.

[18]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[19]  Jannik Fritsch,et al.  A new performance measure and evaluation benchmark for road detection algorithms , 2013, 16th International IEEE Conference on Intelligent Transportation Systems (ITSC 2013).

[20]  Liang Xiao,et al.  CRF based road detection with multi-sensor fusion , 2015, 2015 IEEE Intelligent Vehicles Symposium (IV).

[21]  Nick Barnes,et al.  Large-scale semantic co-labeling of image sets , 2014, IEEE Winter Conference on Applications of Computer Vision.

[22]  Rahul Mohan,et al.  Deep Deconvolutional Networks for Scene Parsing , 2014, ArXiv.

[23]  Jean Ponce,et al.  General Road Detection From a Single Image , 2010, IEEE Transactions on Image Processing.

[24]  Kiyoshi Irie,et al.  Road recognition from a single image using prior information , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[25]  Antonio M. López,et al.  Road Detection Based on Illuminant Invariance , 2011, IEEE Transactions on Intelligent Transportation Systems.

[26]  Theo Gevers,et al.  Combining Priors, Appearance, and Context for Road Detection , 2014, IEEE Transactions on Intelligent Transportation Systems.

[27]  Sanja Fidler,et al.  Holistic 3D scene understanding from a single geo-tagged image , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).