SOLD2: Self-supervised Occlusion-aware Line Description and Detection

Compared to feature point detection and description, detecting and matching line segments offer additional challenges. Yet, line features represent a promising complement to points for multi-view tasks. Lines are indeed well-defined by the image gradient, frequently appear even in poorly textured areas and offer robust structural cues. We thus hereby introduce the first joint detection and description of line segments in a single deep network. Thanks to a self-supervised training, our method does not require any annotated line labels and can therefore generalize to any dataset. Our detector offers repeatable and accurate localization of line segments in images, departing from the wireframe parsing approach. Leveraging the recent progresses in descriptor learning, our proposed line descriptor is highly discriminative, while remaining robust to viewpoint changes and occlusions. We evaluate our approach against previous line detection and description methods on several multi-view datasets created with homographic warps as well as real-world viewpoint changes. Our full pipeline yields higher repeatability, localization accuracy and matching metrics, and thus represents a first step to bridge the gap with learned feature points methods. Code and trained weights are available at https://github.com/cvg/SOLD2.

[1]  Ning Ding,et al.  TP-LSD: Tri-Points Based Line Segment Detector , 2020, ECCV.

[2]  Jan-Michael Frahm,et al.  Reducing Drift in Structure From Motion Using Extended Features , 2020, 2020 International Conference on 3D Vision (3DV).

[3]  Kyungdon Joo,et al.  Non-Local Spatial Propagation Network for Depth Completion , 2020, ECCV.

[4]  Silvia L. Pintea,et al.  Deep Hough-Transform Line Priors , 2020, ECCV.

[5]  Mingyang Li,et al.  SEKD: Self-Evolving Keypoint Detection and Description , 2020, ArXiv.

[6]  Yiannis Demiris,et al.  D2D: Keypoint Extraction with Describe to Detect Approach , 2020, ACCV.

[7]  Long Quan,et al.  ASLFeat: Learning Local Features of Accurate Shape and Localization , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  Gui-Song Xia,et al.  Holistically-Attracted Wireframe Parsing , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Tsun-Yi Yang,et al.  UR2KiD: Unifying Retrieval, Keypoint Detection, and Keypoint Description without Local Correspondence Supervision , 2020, ArXiv.

[10]  Manuel Lange,et al.  DLD: A Deep Learning Based Line Descriptor for Line Feature Matching , 2019, 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[11]  T. Pajdla,et al.  Are Large-Scale 3D Models Really Necessary for Accurate Visual Localization? , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Henrik Karstoft,et al.  UnsuperPoint: End-to-end Unsupervised Interest Point Detector and Descriptor , 2019, ArXiv.

[13]  Gabriela Csurka,et al.  R2D2: Repeatable and Reliable Detector and Descriptor , 2019, ArXiv.

[14]  Li-Yi Wei,et al.  Learning to Reconstruct 3D Manhattan Wireframes From a Single Image , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[15]  Shenghua Gao,et al.  PPGNet: Learning Point-Pair Graph for Line Segment Detection , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[16]  Yi Ma,et al.  End-to-End Wireframe Parsing , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[17]  Torsten Sattler,et al.  Understanding the Limitations of CNN-Based Absolute Camera Pose Regression , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[18]  Victor Lempitsky,et al.  Learnable Line Segment Descriptor for Visual SLAM , 2019, IEEE Access.

[19]  Roland Siegwart,et al.  From Coarse to Fine: Robust Hierarchical Localization at Large Scale , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[20]  Gui-Song Xia,et al.  Learning Attraction Field Representation for Robust Line Segment Detection , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  Kun Huang,et al.  Learning to Parse Wireframes in Images of Man-Made Environments , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[22]  Tomasz Malisiewicz,et al.  SuperPoint: Self-Supervised Interest Point Detection and Description , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[23]  Torsten Sattler,et al.  Semantic Visual Localization , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[24]  Torsten Sattler,et al.  A Multi-view Stereo Benchmark with High-Resolution Images and Multi-camera Videos , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  James H. Elder,et al.  MCMLSD: A Dynamic Programming Approach to Line Segment Detection , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[26]  Jiri Matas,et al.  Working hard to know your neighbor's margins: Local descriptor learning loss , 2017, NIPS.

[27]  Roberto Cipolla,et al.  Multi-task Learning Using Uncertainty to Weigh Losses for Scene Geometry and Semantics , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[28]  Roberto Cipolla,et al.  Geometric Loss Functions for Camera Pose Regression with Deep Learning , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[29]  Jan-Michael Frahm,et al.  Structure-from-Motion Revisited , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30]  Daniel Rueckert,et al.  Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[31]  Li Li,et al.  Hierarchical line matching based on Line-Junction-Line structure descriptor and local homography estimation , 2016, Neurocomputing.

[32]  Jia Deng,et al.  Stacked Hourglass Networks for Human Pose Estimation , 2016, ECCV.

[33]  Kai Li,et al.  Line segment matching: A benchmark , 2016, 2016 IEEE Winter Conference on Applications of Computer Vision (WACV).

[34]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[35]  Daniel Snow,et al.  Line-sweep: Cross-ratio for wide-baseline matching and 3D reconstruction , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[36]  J. M. M. Montiel,et al.  ORB-SLAM: A Versatile and Accurate Monocular SLAM System , 2015, IEEE Transactions on Robotics.

[37]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[38]  Horst Wildenauer,et al.  Structure from Motion with Line Segments Under Relaxed Endpoint Constraints , 2014, 2014 2nd International Conference on 3D Vision.

[39]  Kai Li,et al.  Robust Line Matching Based on Ray-Point-Ray Structure Descriptor , 2014, ACCV Workshops.

[40]  Luc Van Gool,et al.  Scale-invariant line descriptors for wide baseline matching , 2014, IEEE Winter Conference on Applications of Computer Vision.

[41]  Reinhard Koch,et al.  An efficient and robust line segment matching approach based on LBD descriptor and pairwise geometric consistency , 2013, J. Vis. Commun. Image Represent..

[42]  Zhanyi Hu,et al.  Robust line matching through line-point invariants , 2012, Pattern Recognit..

[43]  Cuneyt Akinlar,et al.  Edlines: Real-time line segment detection by Edge Drawing (ed) , 2011, 2011 18th IEEE International Conference on Image Processing.

[44]  Zhiheng Wang,et al.  Extend point descriptors for line, curve and region matching , 2010, 2010 International Conference on Machine Learning and Cybernetics.

[45]  Zhanyi Hu,et al.  Line matching leveraged by point correspondences , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[46]  Sukhan Lee,et al.  A novel line matching method based on intersection context , 2010, 2010 IEEE International Conference on Robotics and Automation.

[47]  Rafael Grompone von Gioi,et al.  LSD: A Fast Line Segment Detector with a False Detection Control , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[48]  Zhiheng Wang,et al.  HLD: A robust descriptor for line matching , 2009, 2009 11th IEEE International Conference on Computer-Aided Design and Computer Graphics.

[49]  Lu Wang,et al.  Wide-baseline image matching using Line Signatures , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[50]  Zhanyi Hu,et al.  MSLD: A robust descriptor for line matching , 2009, Pattern Recognit..

[51]  James H. Elder,et al.  Efficient Edge-Based Methods for Estimating Manhattan Frames in Urban Imagery , 2008, ECCV.

[52]  Luc Van Gool,et al.  Wide-baseline stereo matching with line segments , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[53]  David G. Lowe,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004, International Journal of Computer Vision.

[54]  Manolis I. A. Lourakis,et al.  Matching disparate views of planar surfaces using projective invariants , 2000, Image Vis. Comput..

[55]  Cordelia Schmid,et al.  Automatic line matching across views , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[56]  David J. Kriegman,et al.  Structure and motion from line segments in multiple images , 1992, Proceedings 1992 IEEE International Conference on Robotics and Automation.

[57]  Narendra Ahuja,et al.  Motion and Structure from Line Correspondences; Closed-Form Solution, Uniqueness, and Optimization , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[58]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[59]  Manuel Lange,et al.  WLD: A Wavelet and Learning based Line Descriptor for Line Feature Matching , 2020, VMV.

[60]  Alexei A. Efros,et al.  Feature Matching , 2017, Encyclopedia of GIS.

[61]  Krystian Mikolajczyk,et al.  Learning local feature descriptors with triplets and shallow convolutional neural networks , 2016, BMVC.

[62]  Jan-Michael Frahm,et al.  Reconstructing the World* in Six Days *(As Captured by the Yahoo 100 Million Image Dataset) , 2015, CVPR 2015.

[63]  Hideo Saito,et al.  Fast Line Description for Line-based SLAM , 2012, BMVC.

[64]  Jiri Matas,et al.  Fixing the Locally Optimized RANSAC , 2012, BMVC.

[65]  Jean-Christophe Nebel,et al.  Bioinformatics Inspired Algorithm for Stereo Correspondence , 2011, VISAPP.

[66]  Christus,et al.  A General Method Applicable to the Search for Similarities in the Amino Acid Sequence of Two Proteins , 2022 .