Evaluation of Distance Measures for Feature based Image Registration using AlexNet

Image registration is a classic problem of computer vision with several applications across areas like defence, remote sensing, medicine etc. Feature based image registration methods traditionally used hand-crafted feature extraction algorithms, which detect key points in an image and describe them using a region around the point. Such features are matched using a threshold either on distances or ratio of distances computed between the feature descriptors. Evolution of deep learning, in particular convolution neural networks, has enabled researchers to address several problems of vision such as recognition, tracking, localization etc. Outputs of convolution layers or fully connected layers of CNN which has been trained for applications like visual recognition are proved to be effective when used as features in other applications such as retrieval. In this work, a deep CNN, AlexNet, is used in the place of handcrafted features for feature extraction in the first stage of image registration. However, there is a need to identify a suitable distance measure and a matching method for effective results. Several distance metrics have been evaluated in the framework of nearest neighbour and nearest neighbour ratio matching methods using benchmark dataset. Evaluation is done by comparing matching and registration performance using metrics computed from ground truth.

[1]  J. Serra MORPHOLOGICAL COLOR SIZE DISTRIBUTIONS FOR IMAGE CLASSIFICATION AND RETRIEVAL , 2002 .

[2]  Tara N. Sainath,et al.  Deep Neural Networks for Acoustic Modeling in Speech Recognition , 2012 .

[3]  Qin Yan,et al.  Scene classification with improved AlexNet model , 2017, 2017 12th International Conference on Intelligent Systems and Knowledge Engineering (ISKE).

[4]  Leonidas J. Guibas,et al.  The Earth Mover's Distance as a Metric for Image Retrieval , 2000, International Journal of Computer Vision.

[5]  Vijayan K. Asari,et al.  The History Began from AlexNet: A Comprehensive Survey on Deep Learning Approaches , 2018, ArXiv.

[6]  Ivan Laptev,et al.  A Distance Measure and a Feature Likelihood Map Concept for Scale-Invariant Model Matching , 2003, International Journal of Computer Vision.

[7]  Jianguo Zhang,et al.  Scene image classification method based on Alex-Net model , 2016, 2016 3rd International Conference on Informative and Cybernetics for Computational Social Systems (ICCSS).

[8]  Wen Yang,et al.  STRUCTURAL HIGH-RESOLUTION SATELLITE IMAGE INDEXING , 2010 .

[9]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[10]  Klemen Grm,et al.  Strengths and weaknesses of deep learning models for face recognition against image degradations , 2017, IET Biom..

[11]  Andrew Melbourne,et al.  Image similarity metrics in image registration , 2010, Medical Imaging.

[12]  Guojun Lu,et al.  Evaluation of similarity measurement for image retrieval , 2003, International Conference on Neural Networks and Signal Processing, 2003. Proceedings of the 2003.

[13]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Jan Flusser,et al.  Image registration methods: a survey , 2003, Image Vis. Comput..

[15]  Mita Nasipuri,et al.  COMPARATIVE STUDY OF DISTANCE METRICS FOR FINDING SKIN COLOR SIMILARITY OF TWO COLOR FACIAL IMAGES , 2013 .

[17]  Hammam A. Alshazly,et al.  Image Features Detection, Description and Matching , 2016 .

[18]  Jun Zhang,et al.  Feature extraction and image retrieval based on AlexNet , 2016, International Conference on Digital Image Processing.

[19]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[20]  K. K. Shukla,et al.  A Study of Distance Metrics in Histogram Based Image Retrieval , 2013, BIOINFORMATICS 2013.

[21]  Mohamed S. Shehata,et al.  Image Matching Using SIFT, SURF, BRIEF and ORB: Performance Comparison for Distorted Images , 2017, ArXiv.

[22]  Christopher Hunt,et al.  Notes on the OpenSURF Library , 2009 .

[23]  Xi Chen,et al.  Learning feature distance measures for image correspondences , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[24]  Jiri Matas,et al.  Robust wide-baseline stereo from maximally stable extremal regions , 2004, Image Vis. Comput..

[25]  Ngai-Man Cheung,et al.  Image-based vehicle analysis using deep neural network: A systematic study , 2016, 2016 IEEE International Conference on Digital Signal Processing (DSP).