Hybrid conditional random field based camera-LIDAR fusion for road detection

Abstract Road detection is one of the key challenges for autonomous vehicles. Two kinds of sensors are commonly used for road detection: cameras and LIDARs. However, each of them suffers from some inherent drawbacks. Thus, sensor fusion is commonly used to combine the merits of these two kinds of sensors. Nevertheless, current sensor fusion methods are dominated by either cameras or LIDARs rather than making the best of both. In this paper, we extend the conditional random field (CRF) model and propose a novel hybrid CRF model to fuse the information from camera and LIDAR. After aligning the LIDAR points and pixels, we take the labels (either road or background) of the pixels and LIDAR points as random variables and infer the labels via minimization of a hybrid energy function. Boosted decision tree classifiers are learned to predict the unary potentials of both the pixels and LIDAR points. The pairwise potentials in the hybrid model encode (i) the contextual consistency in the image, (ii) the contextual consistency in the point cloud, and (iii) the cross-modal consistency between the aligned pixels and LIDAR points. This model integrates the information from the two sensors in a probabilistic way and makes good use of both sensors. The hybrid CRF model can be optimized efficiently with graph cuts to get road areas. Extensive experiments have been conducted on the KITTI-ROAD benchmark dataset and the experimental results show that the proposed method outperforms the current methods.

[1]  Antonio M. López,et al.  Road Detection Based on Illuminant Invariance , 2011, IEEE Transactions on Intelligent Transportation Systems.

[2]  Fang Yuqiang,et al.  Unstructured road segmentation method based on dictionary learning and sparse representation , 2013 .

[3]  Luis Miguel Bergasa,et al.  Fast pixelwise road inference based on Uniformly Reweighted Belief Propagation , 2015, 2015 IEEE Intelligent Vehicles Symposium (IV).

[4]  Paulo Peixoto,et al.  3D Lidar-based static and moving obstacle detection in driving environments: An approach based on voxels and multi-region ground planes , 2016, Robotics Auton. Syst..

[5]  Patrick Y. Shinzato,et al.  Fast visual road recognition and horizon detection using multiple artificial neural networks , 2012, 2012 IEEE Intelligent Vehicles Symposium.

[6]  Ankit Laddha,et al.  Map-supervised road detection , 2016, 2016 IEEE Intelligent Vehicles Symposium (IV).

[7]  Vladimir Kolmogorov,et al.  An experimental comparison of min-cut/max- flow algorithms for energy minimization in vision , 2001, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Di Guo,et al.  Extreme Kernel Sparse Learning for Tactile Object Recognition , 2017, IEEE Transactions on Cybernetics.

[9]  Vladimir Kolmogorov,et al.  What energy functions can be minimized via graph cuts? , 2002, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Fuchun Sun,et al.  Fusion tracking in color and infrared images using joint sparse representation , 2012, Science China Information Sciences.

[11]  Seiichi Mita,et al.  Graph-based 2D road representation of 3D point clouds for intelligent vehicles , 2011, 2011 IEEE Intelligent Vehicles Symposium (IV).

[12]  Myungchan Roh,et al.  Drivable Road Detection with 3D Point Clouds Based on the MRF for Intelligent Vehicle , 2013, FSR.

[13]  Ronen Lerner,et al.  Recent progress in road and lane detection: a survey , 2012, Machine Vision and Applications.

[14]  Yann LeCun,et al.  Semantic Road Segmentation via Multi-scale Ensembles of Learned Features , 2012, ECCV Workshops.

[15]  Dawei Zhao,et al.  Monocular Road Detection Using Structured Random Forest , 2016 .

[16]  Vincent Frémont,et al.  Color-based road detection and its evaluation on the KITTI road benchmark , 2014, 2014 IEEE Intelligent Vehicles Symposium Proceedings.

[17]  Franz Kummert,et al.  Monocular Road Terrain Detection by Combining Visual and Spatial Information , 2014, IEEE Transactions on Intelligent Transportation Systems.

[18]  Andreas Geiger,et al.  Vision meets robotics: The KITTI dataset , 2013, Int. J. Robotics Res..

[19]  Giovani Bernardes Vitor,et al.  A probabilistic distribution approach for the classification of urban roads in complex environments , 2014 .

[20]  Martial Hebert,et al.  Directional Associative Markov Network for 3-D Point Cloud Classification , 2008 .

[21]  Yoav Freund,et al.  A Short Introduction to Boosting , 1999 .

[22]  Huimin Ma,et al.  3D Object Proposals for Accurate Object Class Detection , 2015, NIPS.

[23]  Di Guo,et al.  Structured Output-Associated Dictionary Learning for Haptic Understanding , 2017, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[24]  Hong Liu,et al.  Likelihood confidence rating based multi-modal information fusion for robot fine operation , 2014, 2014 13th International Conference on Control Automation Robotics & Vision (ICARCV).

[25]  Yunhui Liu,et al.  Robust Exemplar Extraction Using Structured Sparse Coding , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[26]  Vincent Frémont,et al.  Vision-Based Road Detection using Contextual Blocks , 2015, ArXiv.

[27]  Tao Wu,et al.  Robust road detection from a single image using road shape prior , 2013, 2013 IEEE International Conference on Image Processing.

[28]  Bin Dai,et al.  Gaussian-Process-Based Real-Time Ground Segmentation for Autonomous Land Vehicles , 2013, Journal of Intelligent & Robotic Systems.

[29]  Antonio Criminisi,et al.  TextonBoost: Joint Appearance, Shape and Context Modeling for Multi-class Object Recognition and Segmentation , 2006, ECCV.

[30]  Ernst D. Dickmanns,et al.  Recursive 3-D Road and Relative Ego-State Recognition , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[31]  Bertrand Douillard,et al.  On the segmentation of 3D LIDAR point clouds , 2011, 2011 IEEE International Conference on Robotics and Automation.

[32]  Wenbing Tao,et al.  Spatial adjacent bag of features with multiple superpixels for object segmentation and classification , 2014, Inf. Sci..

[33]  Fuchun Sun,et al.  Visual–Tactile Fusion for Object Recognition , 2017, IEEE Transactions on Automation Science and Engineering.

[34]  Ethan Fetaya,et al.  StixelNet: A Deep Convolutional Network for Obstacle Detection and Road Segmentation , 2015, BMVC.

[35]  Xiaojin Gong,et al.  Road scene segmentation via fusing camera and lidar data , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[36]  Jannik Fritsch,et al.  A new performance measure and evaluation benchmark for road detection algorithms , 2013, 16th International IEEE Conference on Intelligent Transportation Systems (ITSC 2013).

[37]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[38]  Di Guo,et al.  Object Recognition Using Tactile Measurements: Kernel Sparse Coding Methods , 2016, IEEE Transactions on Instrumentation and Measurement.

[39]  Sebastian Thrun,et al.  Self-supervised Monocular Road Detection in Desert Terrain , 2006, Robotics: Science and Systems.

[40]  Franz Kummert,et al.  Spatial ray features for real-time ego-lane extraction , 2012, 2012 15th International IEEE Conference on Intelligent Transportation Systems.

[41]  Martial Hebert,et al.  Stacked Hierarchical Labeling , 2010, ECCV.

[42]  Alessandro Corrêa Victorino,et al.  Comprehensive performance analysis of road detection algorithms using the common urban Kitti-road benchmark , 2014, 2014 IEEE Intelligent Vehicles Symposium Proceedings.

[43]  Martial Hebert,et al.  Natural terrain classification using three‐dimensional ladar data for ground robot mobility , 2006, J. Field Robotics.

[44]  Sebastian Thrun,et al.  Stanley: The robot that won the DARPA Grand Challenge , 2006, J. Field Robotics.

[45]  Rahul Mohan,et al.  Deep Deconvolutional Networks for Scene Parsing , 2014, ArXiv.

[46]  Avideh Zakhor,et al.  Sensor fusion for semantic segmentation of urban scenes , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[47]  Denis Fernando Wolf,et al.  Road terrain detection: Avoiding common obstacle detection assumptions using sensor fusion , 2014, 2014 IEEE Intelligent Vehicles Symposium Proceedings.

[48]  Jun Miao,et al.  Vehicle detection in driving simulation using extreme learning machine , 2014, Neurocomputing.

[49]  Christoph Stiller,et al.  Segmentation of 3D lidar data in non-flat urban environments using a local convexity criterion , 2009, 2009 IEEE Intelligent Vehicles Symposium.

[50]  Vincent Frémont,et al.  Exploiting fully convolutional neural networks for fast road detection , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[51]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[52]  Amnon Shashua,et al.  Off-road Path Following using Region Classification and Geometric Projection Constraints , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[53]  F. SergioA.Rodriguez,et al.  A multi-modal system for road detection and segmentation , 2014, 2014 IEEE Intelligent Vehicles Symposium Proceedings.

[54]  Liang Xiao,et al.  CRF based road detection with multi-sensor fusion , 2015, 2015 IEEE Intelligent Vehicles Symposium (IV).