Cross-Domain Image Matching with Deep Feature Maps

We investigate the problem of automatically determining what type of shoe left an impression found at a crime scene. This recognition problem is made difficult by the variability in types of crime scene evidence (ranging from traces of dust or oil on hard surfaces to impressions made in soil) and the lack of comprehensive databases of shoe outsole tread patterns. We find that mid-level features extracted by pre-trained convolutional neural nets are surprisingly effective descriptors for this specialized domains. However, the choice of similarity measure for matching exemplars to a query image is essential to good performance. For matching multi-channel deep features, we propose the use of multi-channel normalized cross-correlation and analyze its effectiveness. Our proposed metric significantly improves performance in matching crime scene shoeprints to laboratory test impressions. We also show its effectiveness in other cross-domain image retrieval problems: matching facade images to segmentation labels and aerial photos to map images. Finally, we introduce a discriminatively trained variant and fine-tune our system through our proposed metric, obtaining state-of-the-art performance.

[1]  N. L. Johnson,et al.  Multivariate Analysis , 1958, Nature.

[2]  Juliet Popper Shaffer,et al.  A Multivariate Extension of the Correlation Ratio , 1974 .

[3]  J. Einax,et al.  Multivariate correlation analysis and its application in environmental analysis , 1991 .

[4]  Henry C. Lee,et al.  Advances in Fingerprint Technology , 1991 .

[5]  Robert B. Fisher,et al.  Multi-Variate Cross-Correlation and Image Matching , 1995, BMVC.

[6]  William J. Bodziak,et al.  Footwear Impression Evidence: Detection, Recovery and Examination , 1999 .

[7]  John Flynn,et al.  Automated processing of shoeprint images based on the Fourier transform for use in forensic science , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Pietro Perona,et al.  One-shot learning of object categories , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Nigel M. Allinson,et al.  Automatic Extraction and Classification of Footwear Patterns , 2006, IDEAL.

[10]  A. Bouridane,et al.  Automatic Recognition of Partial Shoeprints Using a Correlation Filter Classifier , 2008, 2008 International Machine Vision and Image Processing Conference.

[11]  Pradeep M. Patil,et al.  Rotation and intensity invariant shoeprint matching using Gabor transform with application to forensic science , 2009, Pattern Recognit..

[12]  Sergio Carrato,et al.  A Texture Based Shoe Retrieval System for Shoe Marks of Real Crime Scenes , 2009, ICIAP.

[13]  Shi-Min Hu,et al.  Sketch2Photo: internet image montage , 2009, ACM Trans. Graph..

[14]  Sargur N. Srihari,et al.  Footwear Print Retrieval System for Real Crime Scene Marks , 2010, ICWF.

[15]  Alexei A. Efros,et al.  Data-driven visual similarity for cross-domain image matching , 2011, ACM Trans. Graph..

[16]  Alexei A. Efros,et al.  Data-driven visual similarity for cross-domain image matching , 2011, ACM Trans. Graph..

[17]  Jean Ponce,et al.  Automatic alignment of paintings and photographs depicting a 3D scene , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[18]  Alexei A. Efros,et al.  Ensemble of exemplar-SVMs for object detection and beyond , 2011, 2011 International Conference on Computer Vision.

[19]  Jitendra Malik,et al.  Discriminative Decorrelation for Clustering and Classification , 2012, ECCV.

[20]  Radim Sára,et al.  Spatial Pattern Templates for Recognition of Objects with Regular Structure , 2013, GCPR.

[21]  Thomas Vetter,et al.  Unsupervised Footwear Impression Analysis and Retrieval from Crime Scene Data , 2014, ACCV Workshops.

[22]  Stefan Carlsson,et al.  CNN Features Off-the-Shelf: An Astounding Baseline for Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[23]  Chia-Hung Wei,et al.  Alignment of core point for shoeprint analysis and retrieval , 2014, 2014 International Conference on Information Science, Electronics and Electrical Engineering.

[24]  Ahmed M. Elgammal,et al.  Hierarchical Semantic Hashing: Visual Localization from Buildings on Maps , 2014, 2014 22nd International Conference on Pattern Recognition.

[25]  Andrew Zisserman,et al.  Deep Face Recognition , 2015, BMVC.

[26]  Yann LeCun,et al.  Computing the stereo matching cost with a convolutional neural network , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  Nikos Komodakis,et al.  Learning to compare image patches via convolutional neural networks , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[28]  Thomas Vetter,et al.  Probabilistic Compositional Active Basis Models for Robust Pattern Recognition , 2016, BMVC.

[29]  Xiaogang Wang,et al.  Learning Deep Feature Representations with Domain Guided Dropout for Person Re-identification , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30]  Shawn D. Newsam,et al.  Large-scale geolocalization of overhead imagery , 2016, SIGSPATIAL/GIS.

[31]  Marius Leordeanu,et al.  Aerial image geolocalization from recognition and matching of roads and intersections , 2016, BMVC.

[32]  Alexei A. Efros,et al.  Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[33]  Jacqueline A Speir,et al.  Classification of footwear outsole patterns using Fourier transform and local interest points. , 2017, Forensic science international.

[34]  Adam Kortylewski,et al.  Model-based image analysis for forensic shoe print recognition , 2017 .

[35]  Charless C. Fowlkes,et al.  Cross-Domain Forensic Shoeprint Matching , 2017 .