Ensembles of Deep Learning Models and Transfer Learning for Ear Recognition

The recognition performance of visual recognition systems is highly dependent on extracting and representing the discriminative characteristics of image data. Convolutional neural networks (CNNs) have shown unprecedented success in a variety of visual recognition tasks due to their capability to provide in-depth representations exploiting visual image features of appearance, color, and texture. This paper presents a novel system for ear recognition based on ensembles of deep CNN-based models and more specifically the Visual Geometry Group (VGG)-like network architectures for extracting discriminative deep features from ear images. We began by training different networks of increasing depth on ear images with random weight initialization. Then, we examined pretrained models as feature extractors as well as fine-tuning them on ear images. After that, we built ensembles of the best models to further improve the recognition performance. We evaluated the proposed ensembles through identification experiments using ear images acquired under controlled and uncontrolled conditions from mathematical analysis of images (AMI), AMI cropped (AMIC) (introduced here), and West Pomeranian University of Technology (WPUT) ear datasets. The experimental results indicate that our ensembles of models yield the best performance with significant improvements over the recently published results. Moreover, we provide visual explanations of the learned features by highlighting the relevant image regions utilized by the models for making decisions or predictions.

[1]  Farid Melgani,et al.  Ensemble of Deep Models for Event Recognition , 2018, ACM Trans. Multim. Comput. Commun. Appl..

[2]  Mohamed Abdel-Mottaleb,et al.  Exploiting color SIFT features for 2D ear recognition , 2011, 2011 18th IEEE International Conference on Image Processing.

[3]  Stefanos Zafeiriou,et al.  The unconstrained ear recognition challenge , 2017, 2017 IEEE International Joint Conference on Biometrics (IJCB).

[4]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[5]  Yizhou Yu,et al.  Borrowing Treasures from the Wealthy: Deep Transfer Learning through Selective Joint Fine-Tuning , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Guodong Guo,et al.  On Applicability of Tunable Filter Bank Based Feature for Ear Biometrics: A Study from Constrained to Unconstrained , 2017, Journal of Medical Systems.

[8]  Stefan Carlsson,et al.  CNN Features Off-the-Shelf: An Astounding Baseline for Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[9]  Toyotaro Suzumura,et al.  An Out-of-the-box Full-Network Embedding for Convolutional Neural Networks , 2017, 2018 IEEE International Conference on Big Knowledge (ICBK).

[10]  Jürgen Schmidhuber,et al.  Multi-column deep neural networks for image classification , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  Maurício Pamplona Segundo,et al.  Employing Fusion of Learned and Handcrafted Features for Unconstrained Ear Recognition , 2017, IET Biom..

[12]  Jürgen Schmidhuber,et al.  Multi-column deep neural network for traffic sign classification , 2012, Neural Networks.

[13]  Abdenour Hadid,et al.  Ear biometric recognition using local texture descriptors , 2014, J. Electronic Imaging.

[14]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[15]  Bolei Zhou,et al.  Learning Deep Features for Discriminative Localization , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[16]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[17]  Yi Zhang,et al.  Ear verification under uncontrolled conditions with convolutional neural networks , 2018, IET Biom..

[18]  Rob Fergus,et al.  Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[19]  Peter Peer,et al.  Training Convolutional Neural Networks with Limited Training Data for Ear Recognition in the Wild , 2017, 2017 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017).

[20]  Christoph Busch,et al.  Ear biometrics: a survey of detection, feature extraction and recognition methods , 2012, IET Biom..

[21]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[22]  Forrest N. Iandola,et al.  SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <1MB model size , 2016, ArXiv.

[23]  Maurício Pamplona Segundo,et al.  The Unconstrained Ear Recognition Challenge 2019 , 2019, 2019 International Conference on Biometrics (ICB).

[24]  Christoph Busch,et al.  A comparative study on texture and surface descriptors for ear biometrics , 2014, 2014 International Carnahan Conference on Security Technology (ICCST).

[25]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  Loris Nanni,et al.  Fusion of color spaces for ear authentication , 2009, Pattern Recognit..

[27]  Marina L. Gavrilova,et al.  Occlusion Detection and Index-based Ear Recognition , 2015, J. WSCG.

[28]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[29]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[30]  Andrew Zisserman,et al.  Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps , 2013, ICLR.

[31]  Mark S. Nixon,et al.  Toward Unconstrained Ear Recognition From Two-Dimensional Images , 2010, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[32]  Yoshua Bengio,et al.  How transferable are features in deep neural networks? , 2014, NIPS.

[33]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[34]  Yong Du,et al.  Learning pairwise SVM on hierarchical deep features for ear recognition , 2018, IET Biom..

[35]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[36]  Arun Ross,et al.  An introduction to biometric recognition , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[37]  Andrew G. Howard,et al.  Some Improvements on Deep Convolutional Neural Network Based Image Classification , 2013, ICLR.

[38]  Lina J. Karam,et al.  Unconstrained ear recognition using deep neural networks , 2018, IET Biom..

[39]  Ming Yang,et al.  DeepFace: Closing the Gap to Human-Level Performance in Face Verification , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[40]  Kaushik Roy,et al.  Data augmentation in CNN-based periocular authentication , 2016, 2016 6th International Conference on Information Communication and Management (ICICM).

[41]  Sergey Ioffe,et al.  Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[42]  Thomas Brox,et al.  Striving for Simplicity: The All Convolutional Net , 2014, ICLR.

[43]  Trevor Darrell,et al.  DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition , 2013, ICML.

[44]  Liang Tian,et al.  Ear recognition based on deep convolutional network , 2016, 2016 9th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI).

[45]  Dariusz Frejlichowski,et al.  The West Pomeranian University of Technology Ear Database - A Tool for Testing Biometric Algorithms , 2010, ICIAR.

[46]  Kiran B. Raja,et al.  Ear recognition after ear lobe surgery: A preliminary study , 2016, 2016 IEEE International Conference on Identity, Security and Behavior Analysis (ISBA).

[47]  Andrew Zisserman,et al.  Deep Face Recognition , 2015, BMVC.

[48]  Yang Li,et al.  Non-negative dictionary based sparse representation classification for ear recognition with occlusion , 2016, Neurocomputing.

[49]  Abhishek Das,et al.  Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[50]  Ali Abd Almisreb,et al.  Utilizing AlexNet Deep Transfer Learning for Ear Recognition , 2018, 2018 Fourth International Conference on Information Retrieval and Knowledge Management (CAMP).

[51]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[52]  Jian Guo,et al.  Deep CNN Ensemble with Data Augmentation for Object Detection , 2015, ArXiv.

[53]  Yu Qiao,et al.  A Discriminative Feature Learning Approach for Deep Face Recognition , 2016, ECCV.

[54]  Thomas Martinetz,et al.  Deep convolutional neural networks as generic feature extractors , 2015, 2015 International Joint Conference on Neural Networks (IJCNN).

[55]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[56]  Quoc V. Le,et al.  Do Better ImageNet Models Transfer Better? , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[57]  Abdelmgeid A. Ali,et al.  Ear recognition using local binary patterns: A comparative experimental study , 2019, Expert Syst. Appl..

[58]  Hod Lipson,et al.  Understanding Neural Networks Through Deep Visualization , 2015, ArXiv.

[59]  Temple F. Smith Occam's razor , 1980, Nature.

[60]  Peter Peer,et al.  Ear recognition: More than a survey , 2016, Neurocomputing.

[61]  Blaz Meden,et al.  Covariate analysis of descriptor-based ear recognition techniques , 2017, 2017 International Conference and Workshop on Bioinspired Intelligence (IWOBI).

[62]  Hazim Kemal Ekenel,et al.  Domain Adaptation for Ear Recognition Using Deep Convolutional Neural Networks , 2017, IET Biom..

[63]  Umit Kacar,et al.  ScoreNet: deep cascade score level fusion for unconstrained ear recognition , 2018, IET Biom..

[64]  Tian Ying,et al.  Human ear recognition based on deep convolutional neural network , 2018, 2018 Chinese Control And Decision Conference (CCDC).

[65]  Abdelmgeid A. Ali,et al.  Ear Biometric Recognition Using Gradient-Based Feature Descriptors , 2018, AISI.

[66]  Yi Zhang,et al.  USTB-Helloear: A Large Database of Ear Images Photographed Under Uncontrolled Conditions , 2017, ICIG.

[67]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[68]  Arun Ross,et al.  A survey on ear biometrics , 2013, CSUR.

[69]  James Philbin,et al.  FaceNet: A unified embedding for face recognition and clustering , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[70]  Peter Peer,et al.  Evaluation and analysis of ear recognition models: performance, complexity and resource requirements , 2020, Neural Computing and Applications.

[71]  Angélica González Arrieta,et al.  A brief review of the ear recognition process using deep neural networks , 2017, J. Appl. Log..