论文信息 - DLT-Net: Joint Detection of Drivable Areas, Lane Lines, and Traffic Objects

DLT-Net: Joint Detection of Drivable Areas, Lane Lines, and Traffic Objects

Perception is an essential task for self-driving cars, but most perception tasks are usually handled independently. We propose a unified neural network named DLT-Net to detect drivable areas, lane lines, and traffic objects simultaneously. These three tasks are most important for autonomous driving, especially when a high-definition map and accurate localization are unavailable. Instead of separating tasks in the decoder, we construct context tensors between sub-task decoders to share designate influence among tasks. Therefore, each task can benefit from others during multi-task learning. Experiments show that our model outperforms the conventional multi-task network in terms of the task-wise accuracy and the overall computational efficiency, in the challenging BDD dataset.

[1] S. Kolski,et al. Detection, prediction, and avoidance of dynamic obstacles in urban environments , 2008, 2008 IEEE Intelligent Vehicles Symposium.

[2] Sebastian Ramos,et al. The Cityscapes Dataset for Semantic Urban Scene Understanding , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3] Ali Farhadi,et al. YOLO9000: Better, Faster, Stronger , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4] Zhe Chen,et al. RBNet: A Deep Neural Network for Unified Road and Road Boundary Detection , 2017, ICONIP.

[5] Ming Yang,et al. Pedestrian Feature Generation in Fish-Eye Images via Adversary , 2018, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[6] Jean-Claude Latombe,et al. Robot Motion Planning: A Distributed Representation Approach , 1991, Int. J. Robotics Res..

[7] Ignacio Parra,et al. Deep fully convolutional networks with random data augmentation for enhanced generalization in road detection , 2017, 2017 IEEE 20th International Conference on Intelligent Transportation Systems (ITSC).

[8] Klaus C. J. Dietmayer,et al. A random finite set approach to multiple lane detection , 2012, 2012 15th International IEEE Conference on Intelligent Transportation Systems.

[9] Guoyan Xu,et al. Computer vision-based multiple-lane detection on straight road and in a curve , 2010, 2010 International Conference on Image Analysis and Signal Processing.

[10] Kaiming He,et al. Feature Pyramid Networks for Object Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11] Weiqiang Ren,et al. LaneNet: Real-Time Lane Detection Networks for Autonomous Driving , 2018, ArXiv.

[12] Roberto Cipolla,et al. MultiNet: Real-time Joint Semantic Reasoning for Autonomous Driving , 2016, 2018 IEEE Intelligent Vehicles Symposium (IV).

[13] Xiaogang Wang,et al. Spatial As Deep: Spatial CNN for Traffic Scene Understanding , 2017, AAAI.

[14] Kaiming He,et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15] Sergiu Nedevschi,et al. Real-time object detection using a sparse 4-layer LIDAR , 2017, 2017 13th IEEE International Conference on Intelligent Computer Communication and Processing (ICCP).

[16] Yi Li,et al. R-FCN: Object Detection via Region-based Fully Convolutional Networks , 2016, NIPS.

[17] A J McKnight,et al. The effect of lane line width and contrast upon lanekeeping. , 1998, Accident; analysis and prevention.

[18] Ali Farhadi,et al. You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[19] Wei Liu,et al. SSD: Single Shot MultiBox Detector , 2015, ECCV.

[20] Luc Van Gool,et al. Towards End-to-End Lane Detection: an Instance Segmentation Approach , 2018, 2018 IEEE Intelligent Vehicles Symposium (IV).

[21] Rong Chen,et al. Roadside Magnetic Sensor System for Vehicle Detection in Urban Environments , 2018, IEEE Transactions on Intelligent Transportation Systems.

[22] Lennart Svensson,et al. Fast LIDAR-based road detection using fully convolutional neural networks , 2017, 2017 IEEE Intelligent Vehicles Symposium (IV).

[23] Mohamed Aly,et al. Real time detection of lane markers in urban streets , 2008, 2008 IEEE Intelligent Vehicles Symposium.

[24] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25] Wei-Hua Chieng,et al. Estimating Speed Using a Side-Looking Single-Radar Vehicle Detector , 2014, IEEE Transactions on Intelligent Transportation Systems.

[26] Ali Farhadi,et al. YOLOv3: An Incremental Improvement , 2018, ArXiv.

[27] Julien Mairal,et al. BlitzNet: A Real-Time Deep Network for Scene Understanding , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[28] Ross B. Girshick,et al. Focal Loss for Dense Object Detection , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[29] Andreas Geiger,et al. Are we ready for autonomous driving? The KITTI vision benchmark suite , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[30] Eduardo Romera,et al. ERFNet: Efficient Residual Factorized ConvNet for Real-Time Semantic Segmentation , 2018, IEEE Transactions on Intelligent Transportation Systems.

[31] Li Fei-Fei,et al. ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[32] Luis Miguel Bergasa,et al. Efficient ConvNet for real-time semantic segmentation , 2017, 2017 IEEE Intelligent Vehicles Symposium (IV).

[33] Ronan Collobert,et al. Learning to Refine Object Segments , 2016, ECCV.

[34] Qingquan Li,et al. A Sensor-Fusion Drivable-Region and Lane-Detection System for Autonomous Vehicle Navigation in Challenging Road Scenarios , 2014, IEEE Transactions on Vehicular Technology.

[35] Fuqiang Zhou,et al. FSSD: Feature Fusion Single Shot Multibox Detector , 2017, ArXiv.

[36] Wolfram Burgard,et al. Efficient deep models for monocular road segmentation , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[37] Zhuowen Tu,et al. Aggregated Residual Transformations for Deep Neural Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[38] Hongdong Li,et al. Semisupervised and Weakly Supervised Road Detection Based on Generative Adversarial Networks , 2018, IEEE Signal Processing Letters.

[39] Trevor Darrell,et al. Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[40] Chen Fu,et al. Camera-Based Semantic Enhanced Vehicle Segmentation for Planar LIDAR , 2018, 2018 21st International Conference on Intelligent Transportation Systems (ITSC).

[41] Ross B. Girshick,et al. Mask R-CNN , 2017, 1703.06870.

[42] Thomas Brox,et al. U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[43] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[44] Rogério Schmidt Feris,et al. A Unified Multi-scale Deep Convolutional Neural Network for Fast Object Detection , 2016, ECCV.

[45] Jaehoon Jung,et al. Object Recognition, Segmentation, and Classification of Mobile Laser Scanning Point Clouds: A State of the Art Review , 2019, Sensors.