Deep Multilabel CNN for Forensic Footwear Impression Descriptor Identification

In recent years deep neural networks have become the workhorse of computer vision. In this paper, we employ a deep learning approach to classify footwear impression's features known as \emph{descriptors} for forensic use cases. Within this process, we develop and evaluate an effective technique for feeding downsampled greyscale impressions to a neural network pre-trained on data from a different domain. Our approach relies on learnable preprocessing layer paired with multiple interpolation methods used in parallel. We empirically show that this technique outperforms using a single type of interpolated image without learnable preprocessing, and can help to avoid the computational penalty related to using high resolution inputs, by making more efficient use of the low resolution inputs. We also investigate the effect of preserving the aspect ratio of the inputs, which leads to considerable boost in accuracy without increasing the computational budget with respect to squished rectangular images. Finally, we formulate a set of best practices for transfer learning with greyscale inputs, potentially widely applicable in computer vision tasks ranging from footwear impression classification to medical imaging.

[1]  Frank Hutter,et al.  Decoupled Weight Decay Regularization , 2017, ICLR.

[2]  P. R. Smith,et al.  Bilinear interpolation of digital images , 1981 .

[3]  M. Unser,et al.  Interpolation revisited [medical images application] , 2000, IEEE Transactions on Medical Imaging.

[4]  William J. Bodziak,et al.  Footwear Impression Evidence: Detection, Recovery and Examination , 1999 .

[5]  Abhishek Das,et al.  Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[6]  Yoshua Bengio,et al.  Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[7]  A. Williamson,et al.  Forensic Intelligence , 2021, Modern Police Leadership.

[8]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[9]  J. W. Tukey,et al.  The Measurement of Power Spectra from the Point of View of Communications Engineering , 1958 .

[10]  Boguslaw Cyganek,et al.  Impact of Low Resolution on Image Recognition with Deep Neural Networks: An Experimental Study , 2018, Int. J. Appl. Math. Comput. Sci..

[11]  Witold Pedrycz,et al.  Linguistic Descriptors in Face Recognition , 2018, Int. J. Fuzzy Syst..

[12]  Takaya Saito,et al.  The Precision-Recall Plot Is More Informative than the ROC Plot When Evaluating Binary Classifiers on Imbalanced Datasets , 2015, PloS one.

[13]  Технология National Policing Improvement Agency , 2011 .

[14]  Gaurav Jaiswal,et al.  Effects of Varying Resolution on Performance of CNN based Image Classification An Experimental Study , 2018, International Journal of Computer Sciences and Engineering.

[15]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[16]  Yuan Yu,et al.  TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[17]  Michael Unser,et al.  Image interpolation and resampling , 2000 .

[18]  Max A. Viergever,et al.  Quantitative evaluation of convolution-based methods for medical image interpolation , 2001, Medical Image Anal..

[19]  Roberto Brunelli,et al.  Template Matching Techniques in Computer Vision: Theory and Practice , 2009 .

[20]  Luca Antiga,et al.  Automatic differentiation in PyTorch , 2017 .

[21]  Katarzyna Musial,et al.  How to predict social relationships — Physics-inspired approach to link prediction , 2019, Physica A: Statistical Mechanics and its Applications.

[22]  Elad Eban,et al.  Scalable Learning of Non-Decomposable Objectives , 2016, AISTATS.

[23]  Sebastian Ruder,et al.  Universal Language Model Fine-tuning for Text Classification , 2018, ACL.

[24]  Jeffrey Tsao,et al.  Interpolation artifacts in multimodality image registration based on maximization of mutual information , 2003, IEEE Transactions on Medical Imaging.

[25]  P. M. Woodward,et al.  Information theory and inverse probability in telecommunication , 1952 .