论文信息 - End-to-end Lane Detection through Differentiable Least-Squares Fitting

End-to-end Lane Detection through Differentiable Least-Squares Fitting

Lane detection is typically tackled with a two-step pipeline in which a segmentation mask of the lane markings is predicted first, and a lane line model like a parabola or spline is fitted to the post-processed mask next. The problem with such a two-step approach is that the parameters of the network are not optimized for the true task of interest (estimating the lane curvature parameters) but for a proxy task (segmenting the lane markings), resulting in suboptimal performance. In this work, we propose a method to train a lane detector in an end-to-end manner, directly regressing the lane parameters. The architecture consists of two components: a deep network that predicts a segmentation-ike weight map for each lane line, and a differentiable least-squares fitting module that returns for each map the parameters of the best-fitting curve in the weighted least-squares sense. These parameters can subsequently be supervised with a loss function of choice. Our method relies on the observation that it is possible to backpropagate through a least-squares fitting procedure. This leads to an end-to-end method where the features are optimized for the true task of interest: the network implicitly learns to generate features that prevent instabilities during the model fitting step, as opposed to two-step pipelines that need to handle outliers with heuristics. Additionally, the system is not just a black box but offers a degree of interpretability because the intermediately generated segmentation-like weight maps can be inspected and visualized. Code and a video is available at github.com/wvangansbeke/LaneDetection_End2End.

[1] Justin Domke,et al. Generic Methods for Optimization-Based Modeling , 2012, AISTATS.

[2] Honglak Lee,et al. Perspective Transformer Nets: Learning Single-View 3D Object Reconstruction without 3D Supervision , 2016, NIPS.

[3] Yunhe Pan,et al. Computer-Aided Industrial Design & Conceptual Design , 2009 .

[4] Luis Miguel Bergasa,et al. Efficient ConvNet for real-time semantic segmentation , 2017, 2017 IEEE Intelligent Vehicles Symposium (IV).

[5] O. Faugeras. Three-dimensional computer vision: a geometric viewpoint , 1993 .

[6] Bernhard P. Wrobel,et al. Multiple View Geometry in Computer Vision , 2001 .

[7] Luc Van Gool,et al. Towards End-to-End Lane Detection: an Instance Segmentation Approach , 2018, 2018 IEEE Intelligent Vehicles Symposium (IV).

[8] Paul J. Besl,et al. A Method for Registration of 3-D Shapes , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[9] M. Giles. An extended collection of matrix derivative results for forward and reverse mode algorithmic dieren tiation , 2008 .

[10] Luca Antiga,et al. Automatic differentiation in PyTorch , 2017 .

[11] Guangqian Lu,et al. A Lane Detection, Tracking and Recognition System for Smart Vehicles , 2015 .

[12] Viorica Patraucean,et al. gvnn: Neural Network Library for Geometric Computer Vision , 2016, ECCV Workshops.

[13] Roberto Cipolla,et al. PoseNet: A Convolutional Network for Real-Time 6-DOF Camera Relocalization , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[14] J. Zico Kolter,et al. OptNet: Differentiable Optimization as a Layer in Neural Networks , 2017, ICML.

[15] Mohan M. Trivedi,et al. Video-based lane estimation and tracking for driver assistance: survey, system, and evaluation , 2006, IEEE Transactions on Intelligent Transportation Systems.

[16] Stephen Mann,et al. Geometric algebra for computer science - an object-oriented approach to geometry , 2007, The Morgan Kaufmann series in computer graphics.

[17] Vidya N. Murali,et al. DeepLanes: End-To-End Lane Position Estimation Using Deep Neural Networks , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[18] Paul J. Besl,et al. Method for registration of 3-D shapes , 1992, Other Conferences.

[19] Keshou Wu,et al. A fast and stable lane detection method based on B-spline curve , 2009, 2009 IEEE 10th International Conference on Computer-Aided Industrial Design & Conceptual Design.

[20] Xiaogang Wang,et al. Spatial As Deep: Spatial CNN for Traffic Scene Understanding , 2017, AAAI.

[21] Pierre Vandergheynst,et al. Geometric Deep Learning: Going beyond Euclidean data , 2016, IEEE Signal Process. Mag..

[22] David Pfau,et al. Unrolled Generative Adversarial Networks , 2016, ICLR.

[23] Jonathan Masci,et al. Learning shape correspondence with anisotropic convolutional neural networks , 2016, NIPS.

[24] Bastian Goldlücke,et al. Variational Analysis , 2014, Computer Vision, A Reference Guide.

[25] Luc Van Gool,et al. SURF: Speeded Up Robust Features , 2006, ECCV.

[26] Roberto Cipolla,et al. Geometric Loss Functions for Camera Pose Regression with Deep Learning , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27] P. Holland,et al. Robust regression using iteratively reweighted least-squares , 1977 .

[28] Ersin Yumer,et al. Perspective Transformer Nets: Learning Single-View 3D Object Reconstruction without 3D Supervision , 2016, NIPS.

[29] Andrew Zisserman,et al. Spatial Transformer Networks , 2015, NIPS.

[30] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[31] Alex Kendall,et al. End-to-End Learning of Geometry and Context for Deep Stereo Regression , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[32] Robert C. Bolles,et al. Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[33] David G. Lowe,et al. Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[34] E. Teoh,et al. LANE DETECTION USING CATMULL-ROM SPLINE , 1998 .