Deep homography convolutional neural network for fast and accurate image registration

In this paper, a deep convolution neural network for image registration using homography transformation is proposed to improve the speed and accuracy of image registration. The four-corner homography parameterization is carried out by randomly clipping and perturbing the image block, and then the mapping from one image to another is completed to form the homography image registration dataset. In this network architecture, the homography matrix is obtained by returning the mean square error to the corner variables of the local region. In the preprocessing stage, the image is equalized by the histogram and the feature is magnified. The trained homography matrix is used for the affine transformation of the registered image to verify the effectiveness of the model. We test the dataset of homography image registration and experiment on various noises and various image enhancement effects. We also compare several traditional algorithms. The results prove that the accuracy of the proposed model is state-of-art. The processing speed of a single image is only 0.28 seconds, which has strong noise adaptability and the best performance.

[1]  Lei Zhang,et al.  Beyond a Gaussian Denoiser: Residual Learning of Deep CNN for Image Denoising , 2016, IEEE Transactions on Image Processing.

[2]  Mengjie Zhang,et al.  Evolving Deep Convolutional Neural Networks for Image Classification , 2017, IEEE Transactions on Evolutionary Computation.

[3]  Vincent Lepetit,et al.  Learning Image Descriptors with Boosting , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  David G. Lowe,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004, International Journal of Computer Vision.

[5]  Shuai Yang,et al.  Structure-Guided Image Inpainting Using Homography Transformation , 2018, IEEE Transactions on Multimedia.

[6]  Richard Dosselmann,et al.  A comprehensive assessment of the structural similarity index , 2011, Signal Image Video Process..

[7]  Lu Tian,et al.  OSRI: A Rotationally Invariant Binary Descriptor , 2014, IEEE Transactions on Image Processing.

[8]  Matti Pietikäinen,et al.  IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, TPAMI-2008-09-0620 1 WLD: A Robust Local Image Descriptor , 2022 .

[9]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[10]  Cordelia Schmid,et al.  Scale & Affine Invariant Interest Point Detectors , 2004, International Journal of Computer Vision.

[11]  Vincent Lepetit,et al.  BRIEF: Computing a Local Binary Descriptor Very Fast , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Jun Wu,et al.  Pornographic image detection utilizing deep convolutional neural networks , 2016, Neurocomputing.

[13]  Yang Yang,et al.  Multi-Temporal Remote Sensing Image Registration Using Deep Convolutional Features , 2018, IEEE Access.

[14]  Junjun Jiang,et al.  Image Matching from Handcrafted to Deep Features: A Survey , 2020, International Journal of Computer Vision.

[15]  Cordelia Schmid,et al.  A performance evaluation of local descriptors , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Lisa M. Brown,et al.  A survey of image registration techniques , 1992, CSUR.

[17]  Jan Kautz,et al.  Geometry-Aware Learning of Maps for Camera Localization , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[18]  Vijay Kumar,et al.  Unsupervised Deep Homography: A Fast and Robust Homography Estimation Model , 2017, IEEE Robotics and Automation Letters.

[19]  Michaël Sdika,et al.  A Fast Nonrigid Image Registration With Constraints on the Jacobian Using Large Scale Constrained Optimization , 2008, IEEE Transactions on Medical Imaging.

[20]  Jian Yang,et al.  Convolution Neural Networks With Two Pathways for Image Style Recognition , 2017, IEEE Transactions on Image Processing.

[21]  Shiliang Zhang,et al.  USB: Ultrashort Binary Descriptor for Fast Visual Matching and Retrieval , 2014, IEEE Transactions on Image Processing.

[22]  J. Goeman,et al.  Mean squared error of ridge estimators in logistic regression , 2020, Statistica neerlandica (Print).

[23]  Bo Li,et al.  Local Feature Descriptor for Image Matching: A Survey , 2019, IEEE Access.

[24]  Adrien Bartoli,et al.  Fast Explicit Diffusion for Accelerated Features in Nonlinear Scale Spaces , 2013, BMVC.

[25]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[26]  Giovanni Ramponi,et al.  Image enhancement via adaptive unsharp masking , 2000, IEEE Trans. Image Process..

[27]  Adrien Bartoli,et al.  KAZE Features , 2012, ECCV.

[28]  Mingtao Ding,et al.  Registration of remote-sensing images using robust weighted kernel principal component analysis , 2013 .

[29]  Gary R. Bradski,et al.  ORB: An efficient alternative to SIFT or SURF , 2011, 2011 International Conference on Computer Vision.

[30]  Roland Siegwart,et al.  BRISK: Binary Robust invariant scalable keypoints , 2011, 2011 International Conference on Computer Vision.

[31]  Stephen M. Smith,et al.  SUSAN—A New Approach to Low Level Image Processing , 1997, International Journal of Computer Vision.

[32]  Jan Flusser,et al.  Image registration methods: a survey , 2003, Image Vis. Comput..

[33]  J. Thimmiaraja,et al.  Histogram Equalization for Image Enhancement Using MRI Brain Images , 2014, 2014 World Congress on Computing and Communication Technologies.

[34]  Luigi di Stefano,et al.  Keypoints from symmetries by wave propagation , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[35]  Christopher G. Harris,et al.  A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.

[36]  Mingtao Ding,et al.  A novel adaptive weights proximity matrix for image registration based on R-SIFT , 2011 .

[37]  Andriy Myronenko,et al.  Intensity-Based Image Registration by Minimizing Residual Complexity , 2010, IEEE Transactions on Medical Imaging.

[38]  Hans P. Moravec Rover Visual Obstacle Avoidance , 1981, IJCAI.

[39]  Pascal Fua,et al.  LDAHash: Improved Matching with Smaller Descriptors , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[40]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.