Model-based keypoint detection in Images with Radial Distortion

Keypoint detection and matching is of fundamental importance for many applications in computer and robot vision. The association of points across different views is problematic because image features can undergo significant changes in appearance. Unfortunately, state-of-the-art methods, like the Scale-Invariant Feature Transform (SIFT), are not resilient to the radial distortion that often arises in images acquired by cameras with micro-lenses and/or wide field-of-view. This article proposes modifications to the SIFT algorithm that substantially improve the repeatability of detection and effectiveness of matching under radial distortion, while preserving the original invariance to scale and rotation. The scale-space representation of the image is obtained using adaptive filtering that compensates the local distortion, and the keypoint description is carried after implicit image gradient correction. Unlike competing methods, our approach avoids image re-sampling (the processing is carried in the original image plane), it does not require accurate camera calibration (an approximate modeling of the distortion is sufficient), and it adds minimum computational overhead. Extensive experiments using both synthetic and real imagery show the superiority of our method in establishing point correspondence across images with radial distortion. 1

[1]  A. Makadia,et al.  Image processing in catadioptric planes: spatiotemporal derivatives and optical flow computation , 2002, Proceedings of the IEEE Workshop on Omnidirectional Vision 2002. Held in conjunction with ECCV'02.

[2]  Andrew W. Fitzgibbon,et al.  Simultaneous linear estimation of multiple view geometry and lens distortion , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[3]  Paulo Menezes,et al.  Ray-based Calibration of Rigid Medical Endoscopes , 2008 .

[4]  Darius Burschka,et al.  Scale-Invariant Registration of Monocular Endoscopic Images to CT-Scans for Sinus Surgery , 2004, MICCAI.

[5]  Luiz Velho,et al.  Image Processing for Computer Graphics and Vision, Second Edition , 2009, Texts in Computer Science.

[6]  João Pedro Barreto,et al.  A unifying geometric representation for central projection systems , 2006, Comput. Vis. Image Underst..

[7]  Andrew Zisserman,et al.  Multiple View Geometry in Computer Vision (2nd ed) , 2003 .

[8]  Binoy Pinto,et al.  Speeded Up Robust Features , 2011 .

[9]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[10]  Thomas Bülow Multiscale Image Processing on the Sphere , 2002, DAGM-Symposium.

[11]  Helder Araújo,et al.  Geometric properties of central catadioptric line images and their application in calibration , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Matthew A. Brown,et al.  Invariant Features from Interest Point Groups , 2002, BMVC.

[13]  Tony Lindeberg,et al.  Scale-Space Theory in Computer Vision , 1993, Lecture Notes in Computer Science.

[14]  Cordelia Schmid,et al.  A performance evaluation of local descriptors , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  S. Shankar Sastry,et al.  An Invitation to 3-D Vision: From Images to Geometric Models , 2003 .

[16]  James J. Little,et al.  Mobile Robot Localization and Mapping with Uncertainty using Scale-Invariant Visual Landmarks , 2002, Int. J. Robotics Res..

[17]  Abed Malti,et al.  Feature detection and matching in images with radial distortion , 2010, 2010 IEEE International Conference on Robotics and Automation.

[18]  Peter F. Sturm,et al.  Automatic Camera Calibration Applied to Medical Endoscopy , 2009, BMVC.

[19]  David W. Murray,et al.  Towards simultaneous recognition, localization and mapping for hand-held and wearable cameras , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[20]  Wolfgang Förstner,et al.  Detecting interpretable and accurate scale-invariant keypoints , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[21]  Christopher G. Harris,et al.  A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.

[22]  Frank Hoffmann,et al.  Optimal Large View Visual Servoing with Sets of SIFT Features , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[23]  Shree K. Nayar,et al.  Ego-motion and omnidirectional cameras , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[24]  Thomas Bülow,et al.  Spherical Diffusion for 3D Surface Smoothing , 2004, 3DPVT.

[25]  Peter I. Corke,et al.  Wide-angle Visual Feature Matching for Outdoor Localization , 2010, Int. J. Robotics Res..

[26]  Cordelia Schmid,et al.  A Comparison of Affine Region Detectors , 2005, International Journal of Computer Vision.

[27]  Robert Pless,et al.  A spherical eye from multiple cameras (makes better models of the world) , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[28]  Peter I. Corke,et al.  Scale invariant feature matching with wide angle images , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[29]  Bernd Jähne,et al.  Localization accuracy of region detectors , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[30]  Cordelia Schmid,et al.  Scale & Affine Invariant Interest Point Detectors , 2004, International Journal of Computer Vision.

[31]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[32]  Krystian Mikolajczyk,et al.  Detection of local features invariant to affines transformations , 2002 .

[33]  Tony Lindeberg,et al.  Feature Detection with Automatic Scale Selection , 1998, International Journal of Computer Vision.

[34]  David Nistér,et al.  An efficient solution to the five-point relative pose problem , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35]  Jiri Matas,et al.  Robust wide-baseline stereo from maximally stable extremal regions , 2004, Image Vis. Comput..