Model-Free Distortion Rectification Framework Bridged by Distortion Distribution Map

Recently, learning-based distortion rectification schemes have shown high efficiency. However, most of these methods only focus on a specific camera model with fixed parameters, thus failing to be extended to other models. To avoid such a disadvantage, we propose a model-free distortion rectification framework for the single-shot case, bridged by the distortion distribution map (DDM). Our framework is based on an observation that the pixel-wise distortion information is explicitly regular in a distorted image, despite different models having different types and numbers of distortion parameters. Motivated by this observation, instead of estimating the heterogeneous distortion parameters, we construct a proposed distortion distribution map that intuitively indicates the global distortion features of a distorted image. In addition, we develop a dual-stream feature learning module, benefitting from both the advantages of traditional methods that leverage the local handcrafted feature and learning-based methods that focus on the global semantic feature perception. Due to the sparsity of handcrafted features, we discrete the features into a 2D point map and learn the structure inspired by PointNet. Finally, a multimodal attention fusion module is designed to attentively fuse the local structural and global semantic features, providing the hybrid features for the more reasonable scene recovery. The experimental results demonstrate the excellent generalization ability and more significant performance of our method in both quantitative and qualitative evaluations, compared with the state-of-the-art methods.

[1]  Moncef Gabbouj,et al.  DR-GAN: Automatic Radial Distortion Rectification Using Conditional GAN in Real-Time , 2020, IEEE Transactions on Circuits and Systems for Video Technology.

[2]  Zhengyou Zhang,et al.  Flexible camera calibration by viewing a plane from unknown orientations , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[3]  Daniel Santana-Cedrés,et al.  An Iterative Optimization Algorithm for Lens Distortion Correction Using Two-Parameter Models , 2016, Image Process. Line.

[4]  Maneesh Kumar Singh,et al.  DRIT++: Diverse Image-to-Image Translation via Disentangled Representations , 2019, International Journal of Computer Vision.

[5]  Li Fei-Fei,et al.  Perceptual Losses for Real-Time Style Transfer and Super-Resolution , 2016, ECCV.

[6]  Andrew W. Fitzgibbon,et al.  A rational function lens distortion model for general cameras , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[7]  Roland Siegwart,et al.  A Toolbox for Easily Calibrating Omnidirectional Cameras , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[8]  Juho Kannala,et al.  A generic camera model and calibration method for conventional, wide-angle, and fish-eye lenses , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Christian Ledig,et al.  Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Shi-Min Hu,et al.  BiggerSelfie: Selfie Video Expansion With Hand-Held Camera , 2018, IEEE Transactions on Image Processing.

[12]  Jun Yu,et al.  FishEyeRecNet: A Multi-Context Collaborative Deep Network for Fisheye Image Rectification , 2018, ECCV.

[13]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[14]  Olivier D. Faugeras,et al.  Automatic calibration and removal of distortion from scenes of structured environments , 1995, Optics & Photonics.

[15]  Jian Yang,et al.  Progressive Hard-Mining Network for Monocular Depth Estimation , 2018, IEEE Transactions on Image Processing.

[16]  Yizhou Yu,et al.  Deep Contrast Learning for Salient Object Detection , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Andrew W. Fitzgibbon,et al.  Simultaneous linear estimation of multiple view geometry and lens distortion , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[18]  Xianghua Ying,et al.  Radial Lens Distortion Correction Using Convolutional Neural Networks Trained with Synthesized Images , 2016, ACCV.

[19]  Leonidas J. Guibas,et al.  PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[20]  Leonidas J. Guibas,et al.  PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space , 2017, NIPS.

[21]  Alexei A. Efros,et al.  Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[22]  Matthew N. Dailey,et al.  Automatic Radial Distortion Estimation from a Single Image , 2012, Journal of Mathematical Imaging and Vision.

[23]  Qi Huang,et al.  3-D Surround View for Advanced Driver Assistance Systems , 2018, IEEE Transactions on Intelligent Transportation Systems.

[24]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[25]  Jia Li,et al.  Deep3DSaliency: Deep Stereoscopic Video Saliency Detection Model by 3D Convolutional Networks , 2019, IEEE Transactions on Image Processing.

[26]  Daniel Santana-Cedrés,et al.  Automatic Lens Distortion Correction Using One-Parameter Division Models , 2014, Image Process. Line.

[27]  Daniel Thalmann,et al.  Robust 3D Hand Pose Estimation From Single Depth Images Using Multi-View CNNs , 2018, IEEE Transactions on Image Processing.

[28]  Jake K. Aggarwal,et al.  Intrinsic parameter calibration procedure for a (high-distortion) fish-eye lens camera with distortion model and accuracy estimation , 1996, Pattern Recognit..

[29]  Helder Araújo,et al.  Geometric properties of central catadioptric line images and their application in calibration , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[30]  Michel Antunes,et al.  Unsupervised Intrinsic Calibration from a Single Frame Using a "Plumb-Line" Approach , 2013, 2013 IEEE International Conference on Computer Vision.

[31]  Maneesh Agrawala,et al.  Optimizing content-preserving projections for wide-angle images , 2009, SIGGRAPH '09.

[32]  Subhransu Maji,et al.  Multi-view Convolutional Neural Networks for 3D Shape Recognition , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[33]  Alexei A. Efros,et al.  Toward Multimodal Image-to-Image Translation , 2017, NIPS.

[34]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[35]  Léon Bottou,et al.  Wasserstein GAN , 2017, ArXiv.

[36]  Yao Zhao,et al.  3D saliency detection based on background detection , 2017, J. Vis. Commun. Image Represent..

[37]  Leonidas J. Guibas,et al.  Volumetric and Multi-view CNNs for Object Classification on 3D Data , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[38]  Songhwai Oh,et al.  Procrustean Normal Distribution for Non-Rigid Structure from Motion , 2017, IEEE Trans. Pattern Anal. Mach. Intell..

[39]  Adrien Bartoli,et al.  Planar Structure-from-Motion with Affine Camera Models: Closed-Form Solutions, Ambiguities and Degeneracy Analysis , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[40]  Yida Wang,et al.  Generative Model With Coordinate Metric Learning for Object Recognition Based on 3D Models , 2018, IEEE Transactions on Image Processing.

[41]  José L. Herrera,et al.  Automatic Depth Extraction from 2D Images Using a Cluster-Based Learning Framework , 2018, IEEE Transactions on Image Processing.

[42]  Sing Bing Kang,et al.  Parameter-Free Radial Distortion Correction with Center of Distortion Estimation , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[43]  Hao Guan,et al.  Structure-From-Motion in Spherical Video Using the von Mises-Fisher Distribution , 2017, IEEE Trans. Image Process..

[44]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[45]  Kyoung Mu Lee,et al.  Accurate Image Super-Resolution Using Very Deep Convolutional Networks , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[46]  Xilin Chen,et al.  Calibration of a hybrid camera network , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[47]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.