Fast and Accurate Lane Detection via Frequency Domain Learning

It is desirable to maintain both high accuracy and runtime efficiency in lane detection. State-of-the-art methods mainly address the efficiency problem by direct compression of high-dimensional features. These methods usually suffer from information loss and cannot achieve satisfactory accuracy performance. To ensure the diversity of features and subsequently maintain information as much as possible, we introduce multi-frequency analysis into lane detection. Specifically, we propose a multi-spectral feature compressor (MSFC) based on two-dimensional (2D) discrete cosine transform (DCT) to compress features while preserving diversity information. We group features and associate each group with an individual frequency component, which incurs only 1/7 overhead of one-dimensional convolution operation but preserves more information. Moreover, to further enhance the discriminability of features, we design a multi-spectral lane feature aggregator (MSFA) based on one-dimensional (1D) DCT to aggregate features from each lane according to their corresponding frequency components. The proposed method outperforms the state-of-the-art methods (including LaneATT and UFLD) on TuSimple, CULane, and LLAMAS benchmarks. For example, our method achieves 76.32% F1 at 237 FPS and 76.98% F1 at 164 FPS on CULane, which is 1.23% and 0.30% higher than LaneATT. Our code and models are available at https://github.com/harrylin-hyl/MSLD.

[1]  Huanyu Wang,et al.  Ultra Fast Structure-aware Deep Lane Detection , 2020, ECCV.

[2]  Jonah Philion,et al.  FastDraw: Addressing the Long Tail of Lane Detection by Adapting a Sequential Prediction Network , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Chunxiao Liu,et al.  Inter-Region Affinity Distillation for Road Marking Segmentation , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Xiaogang Wang,et al.  Spatial As Deep: Spatial CNN for Traffic Scene Understanding , 2017, AAAI.

[5]  Michael Carbin,et al.  The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks , 2018, ICLR.

[6]  Pavlo Molchanov,et al.  Importance Estimation for Neural Network Pruning , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[8]  Hiren M. Mandalia,et al.  Using Support Vector Machines for Lane-Change Detection , 2005 .

[9]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[10]  Zhijian Liu,et al.  HAQ: Hardware-Aware Automated Quantization With Mixed Precision , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Yi Yang,et al.  Lane Detection in Low-light Conditions Using an Efficient Data Enhancement: Light Conditions Style Transfer , 2020, 2020 IEEE Intelligent Vehicles Symposium (IV).

[12]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Zejian Yuan,et al.  End-to-end Lane Shape Prediction with Transformers , 2020, 2021 IEEE Winter Conference on Applications of Computer Vision (WACV).

[14]  Ali Farhadi,et al.  YOLOv3: An Incremental Improvement , 2018, ArXiv.

[15]  Preeti R. Bajaj,et al.  Road Lane Detection with Improved Canny Edges Using Ant Colony Optimization , 2010, 2010 3rd International Conference on Emerging Trends in Engineering and Technology.

[16]  Wei Zhang,et al.  CurveLane-NAS: Unifying Lane-Sensitive Architecture Search and Adaptive Point Blending , 2020, ECCV.

[17]  Jian Yang,et al.  Line-CNN: End-to-End Traffic Line Detection With Line Proposal Unit , 2020, IEEE Transactions on Intelligent Transportation Systems.

[18]  Alberto Ferreira de Souza,et al.  PolyLaneNet: Lane Estimation via Deep Polynomial Regression , 2020, 2020 25th International Conference on Pattern Recognition (ICPR).

[19]  Enhua Wu,et al.  Squeeze-and-Excitation Networks , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Wei Liu,et al.  SSD: Single Shot MultiBox Detector , 2015, ECCV.

[21]  Jason Yosinski,et al.  Faster Neural Networks Straight from JPEG , 2018, NeurIPS.

[22]  In-So Kweon,et al.  CBAM: Convolutional Block Attention Module , 2018, ECCV.

[23]  Deng Cai,et al.  RESA: Recurrent Feature-Shift Aggregator for Lane Detection , 2021, AAAI.

[24]  Zhenpeng Chen,et al.  PointLaneNet: Efficient end-to-end CNNs for Accurate Real-Time Lane Detection , 2019, 2019 IEEE Intelligent Vehicles Symposium (IV).

[25]  Ross B. Girshick,et al.  Focal Loss for Dense Object Detection , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  Fei Wu,et al.  FcaNet: Frequency Channel Attention Networks , 2020, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[27]  Seungwoo Yoo,et al.  End-to-End Lane Marker Detection via Row-wise Classification , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[28]  Larry S. Davis,et al.  Deep Residual Learning in the JPEG Transform Domain , 2018, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[29]  Chen Change Loy,et al.  Learning Lightweight Lane Detection CNNs by Self Attention Distillation , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[30]  Kaiming He,et al.  Focal Loss for Dense Object Detection , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).