Tattoo Image Search at Scale: Joint Detection and Compact Representation Learning

The explosive growth of digital images in video surveillance and social media has led to the significant need for efficient search of persons of interest in law enforcement and forensic applications. Despite tremendous progress in primary biometric traits (e.g., face and fingerprint) based person identification, a single biometric trait alone can not meet the desired recognition accuracy in forensic scenarios. Tattoos, as one of the important soft biometric traits, have been found to be valuable for assisting in person identification. However, tattoo search in a large collection of unconstrained images remains a difficult problem, and existing tattoo search methods mainly focus on matching cropped tattoos, which is different from real application scenarios. To close the gap, we propose an efficient tattoo search approach that is able to learn tattoo detection and compact representation jointly in a single convolutional neural network (CNN) via multi-task learning. While the features in the backbone network are shared by both tattoo detection and compact representation learning, individual latent layers of each sub-network optimize the shared features toward the detection and feature learning tasks, respectively. We resolve the small batch size issue inside the joint tattoo detection and compact representation learning network via random image stitch and preceding feature buffering. We evaluate the proposed tattoo search system using multiple public-domain tattoo benchmarks, and a gallery set with about 300K distracter tattoo images compiled from these datasets and images from the Internet. In addition, we also introduce a tattoo sketch dataset containing 300 tattoos for sketch-based tattoo search. Experimental results show that the proposed approach has superior performance in tattoo detection and tattoo search at scale compared to several state-of-the-art tattoo retrieval algorithms.

[1]  Jiwen Lu,et al.  Deep hashing for compact binary codes learning , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[3]  Xiaogang Wang,et al.  Joint Detection and Identification Feature Learning for Person Search , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Andrew Zisserman,et al.  Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[5]  Wei Liu,et al.  Supervised Discrete Hashing , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Rong Jin,et al.  Tattoo-ID: Automatic Tattoo Image Retrieval for Suspect and Victim Identification , 2007, PCM.

[7]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[8]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[9]  Vishal M. Patel,et al.  Deep Tattoo Recognition , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[10]  George W. Quinn,et al.  Tattoo Recognition Technology - Challenge (Tatt-C): Outcomes and Recommendations , 2015 .

[11]  Qi Tian,et al.  Scalar quantization for large scale image search , 2012, ACM Multimedia.

[12]  Anil K. Jain,et al.  Matching Composite Sketches to Face Photos: A Component-Based Approach , 2013, IEEE Transactions on Information Forensics and Security.

[13]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Jinhui Tang,et al.  Supervised Quantization for Similarity Search , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Victor Lempitsky,et al.  Additive Quantization for Extreme Vector Compression , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[17]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[18]  Qi Tian,et al.  Image Classification and Retrieval are ONE , 2015, ICMR.

[19]  Alphonse Bertillon,et al.  Signaletic instructions including the theory and practice of anthropometrical identification , 2022 .

[20]  Rongrong Ji,et al.  Supervised hashing with kernels , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[21]  Jingdong Wang,et al.  Composite Quantization for Approximate Nearest Neighbor Search , 2014, ICML.

[22]  Geoffrey E. Hinton,et al.  Learning a Nonlinear Embedding by Preserving Class Neighbourhood Structure , 2007, AISTATS.

[23]  Kristen Grauman,et al.  Kernelized locality-sensitive hashing for scalable image search , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[24]  R. McCabe,et al.  Information Technology: American National Standard for Information Systems: Data Format for the Interchange of Fingerprint, Facial, & Scar Mark & Tattoo (SMT) Information , 2000 .

[25]  Michael Isard,et al.  Object retrieval with large vocabularies and fast spatial matching , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[26]  Michael Isard,et al.  Lost in quantization: Improving particular object retrieval in large scale image databases , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[27]  Terrance E. Boult,et al.  Detecting and classifying scars, marks, and tattoos found in the wild , 2012, 2012 IEEE Fifth International Conference on Biometrics: Theory, Applications and Systems (BTAS).

[28]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[29]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.

[31]  Matti Pietikäinen,et al.  From BoW to CNN: Two Decades of Texture Representation for Texture Classification , 2018, International Journal of Computer Vision.

[32]  Antonio Torralba,et al.  Spectral Hashing , 2008, NIPS.

[33]  Qi Tian,et al.  Max-SIFT: Flipping invariant descriptors for Web logo search , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[34]  Trevor Darrell,et al.  Learning to Hash with Binary Reconstructive Embeddings , 2009, NIPS.

[35]  Rong Jin,et al.  Rank-based distance metric learning: An application to image retrieval , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[36]  Anil K. Jain,et al.  Face Matching and Retrieval in Forensics Applications , 2012, IEEE MultiMedia.

[37]  David J. Fleet,et al.  Minimal Loss Hashing for Compact Binary Codes , 2011, ICML.

[38]  Liqing Zhang,et al.  MindFinder: interactive sketch-based image search on millions of images , 2010, ACM Multimedia.

[39]  Abhinav Gupta,et al.  Training Region-Based Object Detectors with Online Hard Example Mining , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[40]  Shuo Yang,et al.  WIDER FACE: A Face Detection Benchmark , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[41]  Rong Jin,et al.  An efficient key point quantization algorithm for large scale image retrieval , 2009, LS-MMRM '09.

[42]  Daniel Manger,et al.  Large-Scale Tattoo Image Retrieval , 2012, 2012 Ninth Conference on Computer and Robot Vision.

[43]  Xingquan Zhu,et al.  Hashing Techniques , 2017 .

[44]  Patrick Pérez,et al.  SuBiC: A Supervised, Structured Binary Code for Image Search , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[45]  Adams Wai-Kin Kong,et al.  A geometric-based tattoo retrieval system , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[46]  Jian Sun,et al.  Collaborative Index Embedding for Image Retrieval , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[47]  S. Acton,et al.  Matching and Retrieval of Tattoo Images: Active Contour CBIR and Glocal Image Features , 2008, 2008 IEEE Southwest Symposium on Image Analysis and Interpretation.

[48]  Rong Jin,et al.  Image Retrieval in Forensics: Tattoo Image Database Application , 2012, IEEE MultiMedia.

[49]  Shie Mannor,et al.  A Tutorial on the Cross-Entropy Method , 2005, Ann. Oper. Res..

[50]  Nicole Immorlica,et al.  Locality-sensitive hashing scheme based on p-stable distributions , 2004, SCG '04.

[51]  Wu-Jun Li,et al.  Isotropic Hashing , 2012, NIPS.

[52]  Amit R.Sharma,et al.  Face Photo-Sketch Synthesis and Recognition , 2012 .

[53]  Junsong Yuan,et al.  Compressive Quantization for Fast Object Instance Search in Videos , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[54]  Anthony Hoogs,et al.  Tattoo detection and localization using region-based deep learning , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[55]  Edward J. Delp,et al.  Automatic and manual tattoo localization , 2016, 2016 IEEE Symposium on Technologies for Homeland Security (HST).

[56]  Karla Brkić,et al.  Tattoo Detection for Soft Biometric De-identification Based on Convolutional Neural Networks , 2016 .

[57]  Wei Liu,et al.  Learning to Hash for Indexing Big Data—A Survey , 2015, Proceedings of the IEEE.

[58]  Qi Tian,et al.  SIFT Meets CNN: A Decade Survey of Instance Retrieval , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[59]  Prateek Jain,et al.  Fast Similarity Search for Learned Metrics , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[60]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[61]  Shin'ichi Satoh,et al.  Faster R-CNN Features for Instance Search , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[62]  Chu-Song Chen,et al.  Supervised Learning of Semantics-Preserving Hash via Deep Convolutional Neural Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[63]  Wu-Jun Li,et al.  Scalable Graph Hashing with Feature Transformation , 2015, IJCAI.

[64]  Cordelia Schmid,et al.  Product Quantization for Nearest Neighbor Search , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[65]  Zhenan Sun,et al.  Fast Supervised Discrete Hashing , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[66]  Hongxin Zhang,et al.  End-to-End Spatial Transform Face Detection and Recognition , 2020, Virtual Real. Intell. Hardw..

[67]  Anil K. Jain,et al.  Tattoo based identification: Sketch to image matching , 2013, 2013 International Conference on Biometrics (ICB).

[68]  Wei Liu,et al.  Hashing with Graphs , 2011, ICML.

[69]  Wu-Jun Li,et al.  Feature Learning Based Deep Supervised Hashing with Pairwise Labels , 2015, IJCAI.

[70]  Rong Jin,et al.  Content-based image retrieval: An application to tattoo images , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[71]  Adams Wai-Kin Kong,et al.  Tattoo detection based on CNN and remarks on the NIST database , 2016, 2016 International Conference on Biometrics (ICB).

[72]  Jiwen Lu,et al.  Deep Hashing for Scalable Image Search , 2017, IEEE Transactions on Image Processing.

[73]  Paul W. Fieguth,et al.  Texture Classification from Random Features , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[74]  Liqing Zhang,et al.  Edgel index for large-scale sketch-based image search , 2011, CVPR 2011.

[75]  Terrance E. Boult,et al.  Exemplar codes for facial attributes and tattoo recognition , 2014, IEEE Winter Conference on Applications of Computer Vision.

[76]  Xuelong Li,et al.  Compressed Hashing , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[77]  Piotr Indyk,et al.  Approximate nearest neighbors: towards removing the curse of dimensionality , 1998, STOC '98.

[78]  Shengcai Liao,et al.  Learning Face Representation from Scratch , 2014, ArXiv.