论文信息 - Average biased ReLU based CNN descriptor for improved face retrieval

Average biased ReLU based CNN descriptor for improved face retrieval

The convolutional neural networks (CNN), including AlexNet, GoogleNet, VGGNet, etc. extract features for many computer vision problems which are very discriminative. The trained CNN model over one dataset performs reasonably well whereas on another dataset of similar type the hand-designed feature descriptor outperforms the same trained CNN model. The Rectified Linear Unit (ReLU) layer discards some values in order to introduce the non-linearity. In this paper, it is proposed that the discriminative ability of deep image representation using trained model can be improved by Average Biased ReLU (AB-ReLU) at the last few layers. Basically, AB-ReLU improves the discriminative ability in two ways: 1) it exploits some of the discriminative and discarded negative information of ReLU and 2) it also neglects the irrelevant and positive information used in ReLU. The VGGFace model trained in MatConvNet over the VGG-Face dataset is used as the feature descriptor for face retrieval over other face datasets. The proposed approach is tested over six challenging, unconstrained and robust face datasets (PubFig, LFW, PaSC, AR, FERET and ExtYale) and also on a large scale face dataset (PolyUNIR) in retrieval framework. It is observed that the AB-ReLU outperforms the ReLU when used with a pre-trained VGGFace model over the face datasets. The validation error by training the network after replacing all ReLUs with AB-ReLUs is also observed to be favorable over each dataset. The AB-ReLU even outperforms the state-of-the-art activation functions, such as Sigmoid, ReLU, Leaky ReLU and Flexible ReLU over all seven face datasets.

Shiv Ram Dubey | Soumendu Chakraborty | S. Dubey | Soumendu Chakraborty

[1] Matti Pietikäinen,et al. Face Description with Local Binary Patterns: Application to Face Recognition , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2] LinLin Shen,et al. Directional binary code with application to PolyU near-infrared face database , 2010, Pattern Recognit. Lett..

[3] Shiv Ram Dubey,et al. Local Wavelet Pattern: A New Feature Descriptor for Image Retrieval in Medical CT Databases , 2015, IEEE Transactions on Image Processing.

[4] Aleix M. Martinez,et al. The AR face database , 1998 .

[5] Li Fei-Fei,et al. ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[6] Sven Behnke,et al. RGB-D object recognition and pose estimation based on pre-trained convolutional neural network features , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[7] Marwan Mattar,et al. Labeled Faces in the Wild: A Database forStudying Face Recognition in Unconstrained Environments , 2008 .

[8] Dumitru Erhan,et al. Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9] Harry Wechsler,et al. The FERET database and evaluation procedure for face-recognition algorithms , 1998, Image Vis. Comput..

[10] Shiv Ram Dubey,et al. Rotation and Illumination Invariant Interleaved Intensity Order-Based Local Descriptor , 2014, IEEE Transactions on Image Processing.

[11] Satish Kumar Singh,et al. Local Gradient Hexa Pattern: A Descriptor for Face Recognition and Retrieval , 2022, IEEE Transactions on Circuits and Systems for Video Technology.

[12] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13] Trevor Darrell,et al. Simultaneous Deep Transfer Across Domains and Tasks , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[14] Nitish Srivastava,et al. Exploiting Image-trained CNN Architectures for Unconstrained Video Classification , 2015, BMVC.

[15] Carlos D. Castillo,et al. An All-In-One Convolutional Neural Network for Face Analysis , 2016, 2017 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017).

[16] Subrahmanyam Murala,et al. Local Tetra Patterns: A New Feature Descriptor for Content-Based Image Retrieval , 2012, IEEE Transactions on Image Processing.

[17] Bolun Cai,et al. FReLU: Flexible Rectified Linear Units for Improving Convolutional Neural Networks , 2018, 2018 24th International Conference on Pattern Recognition (ICPR).

[18] Subhransu Maji,et al. Bilinear CNN Models for Fine-Grained Visual Recognition , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[19] G LoweDavid,et al. Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[20] Bruce A. Draper,et al. The challenge of face recognition from digital point-and-shoot cameras , 2013, 2013 IEEE Sixth International Conference on Biometrics: Theory, Applications and Systems (BTAS).

[21] Shiv Ram Dubey,et al. Face retrieval using frequency decoded local descriptor , 2017, Multimedia Tools and Applications.

[22] Andrea Vedaldi,et al. MatConvNet: Convolutional Neural Networks for MATLAB , 2014, ACM Multimedia.

[23] Sepp Hochreiter,et al. Fast and Accurate Deep Network Learning by Exponential Linear Units (ELUs) , 2015, ICLR.

[24] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[25] Shree K. Nayar,et al. Attribute and simile classifiers for face verification , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[26] Jing-Ming Guo,et al. Fusion of Deep Learning and Compressed Domain Features for Content-Based Image Retrieval , 2017, IEEE Transactions on Image Processing.

[27] Shu Liao,et al. Dominant Local Binary Patterns for Texture Classification , 2009, IEEE Transactions on Image Processing.

[28] Paul A. Viola,et al. Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[29] Gustavo Carneiro,et al. Unregistered Multiview Mammogram Analysis with Pre-trained Deep Learning Models , 2015, MICCAI.

[30] David J. Kriegman,et al. From Few to Many: Illumination Cone Models for Face Recognition under Variable Lighting and Pose , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[31] Peizhong Liu,et al. Fusion of Deep Learning and Compressed Domain Features for Content-Based Image Retrieval. , 2017, IEEE transactions on image processing : a publication of the IEEE Signal Processing Society.

[32] Hyeonjoon Moon,et al. The FERET Evaluation Methodology for Face-Recognition Algorithms , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[33] Sepp Hochreiter,et al. Rectified Factor Networks , 2015, NIPS.

[34] Guoying Zhao,et al. BRINT: Binary Rotation Invariant and Noise Tolerant Texture Classification , 2014, IEEE Transactions on Image Processing.

[35] Rama Chellappa,et al. Unconstrained face verification using deep CNN features , 2015, 2016 IEEE Winter Conference on Applications of Computer Vision (WACV).

[36] Uwe Stilla,et al. Deep Learning Earth Observation Classification Using ImageNet Pretrained Networks , 2016, IEEE Geoscience and Remote Sensing Letters.

[37] Shiv Ram Dubey,et al. Local directional relation pattern for unconstrained and robust face retrieval , 2017, Multimedia Tools and Applications.

[38] Carlos D. Castillo,et al. The Do’s and Don’ts for CNN-Based Face Verification , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[39] Bill Triggs,et al. Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[40] Yun Ge,et al. Exploiting representations from pre-trained convolutional neural networks for high-resolution remote sensing image retrieval , 2018, Multimedia Tools and Applications.

[41] Andrew L. Maas. Rectifier Nonlinearities Improve Neural Network Acoustic Models , 2013 .

[42] Shiv Ram Dubey,et al. Multichannel Decoded Local Binary Patterns for Content-Based Image Retrieval , 2016, IEEE Transactions on Image Processing.

[43] Geoffrey E. Hinton,et al. Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[44] Pourya Shamsolmoali,et al. High-dimensional multimedia classification using deep CNN and extended residual units , 2018, Multimedia Tools and Applications.

[45] Zechao Li,et al. Deep networks with non-static activation function , 2018, Multimedia Tools and Applications.

[46] Bhabatosh Chanda,et al. A Complete Dual-Cross Pattern for Unconstrained Texture Classification , 2017, 2017 4th IAPR Asian Conference on Pattern Recognition (ACPR).

[47] Koen E. A. van de Sande,et al. Selective Search for Object Recognition , 2013, International Journal of Computer Vision.

[48] Satish Kumar Singh,et al. Centre symmetric quadruple pattern: A novel descriptor for facial image recognition and retrieval , 2017, Pattern Recognit. Lett..

[49] King-Sun Fu,et al. IEEE Transactions on Pattern Analysis and Machine Intelligence Publication Information , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[50] Jian Sun,et al. Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[51] Xiuhua Jiang,et al. Multimedia image quality assessment based on deep feature extraction , 2019, Multimedia Tools and Applications.

[52] Shiv Ram Dubey,et al. Identity verification using shape and geometry of human hands , 2015, Expert Syst. Appl..

[53] Bhabatosh Chanda,et al. Local directional ZigZag pattern: A rotation invariant descriptor for texture classification , 2018, Pattern Recognit. Lett..

[54] James Philbin,et al. FaceNet: A unified embedding for face recognition and clustering , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[55] Shiv Ram Dubey,et al. Local Diagonal Extrema Pattern: A New and Efficient Feature Descriptor for CT Image Retrieval , 2015, IEEE Signal Processing Letters.

[56] Michael I. Jordan,et al. Unsupervised Domain Adaptation with Residual Transfer Networks , 2016, NIPS.

[57] Tianqi Chen,et al. Empirical Evaluation of Rectified Activations in Convolutional Network , 2015, ArXiv.

[58] Anil K. Jain,et al. Pushing the frontiers of unconstrained face detection and recognition: IARPA Janus Benchmark A , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[59] Subhransu Maji,et al. One-to-many face recognition with bilinear CNNs , 2015, 2016 IEEE Winter Conference on Applications of Computer Vision (WACV).

[60] Ji Wan,et al. Deep Learning for Content-Based Image Retrieval: A Comprehensive Study , 2014, ACM Multimedia.

[61] Xiaogang Wang,et al. Cross-scene crowd counting via deep convolutional neural networks , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[62] Matti Pietikäinen,et al. Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[63] Weijun Hu,et al. Piecewise supervised deep hashing for image retrieval , 2019, Multimedia Tools and Applications.

[64] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[65] J KriegmanDavid,et al. Acquiring Linear Subspaces for Face Recognition under Variable Lighting , 2005 .

[66] Fei-Fei Li,et al. Deep visual-semantic alignments for generating image descriptions , 2015, CVPR.

[67] Shiv Ram Dubey,et al. Local Bit-Plane Decoded Pattern: A Novel Feature Descriptor for Biomedical Image Retrieval , 2016, IEEE Journal of Biomedical and Health Informatics.

[68] Ricardo Matsumura de Araújo,et al. On the Performance of GoogLeNet and AlexNet Applied to Sketches , 2016, AAAI.

[69] Ling Shao,et al. Learning View-Model Joint Relevance for 3D Object Retrieval , 2015, IEEE Transactions on Image Processing.

[70] Yu Qiao,et al. A Discriminative Feature Learning Approach for Deep Face Recognition , 2016, ECCV.

[71] Snehasis Mukherjee,et al. LDOP: local directional order pattern for robust face retrieval , 2018, Multimedia Tools and Applications.

[72] Satish Kumar Singh,et al. Local directional gradient pattern: a local descriptor for face recognition , 2022, Multimedia Tools and Applications.

[73] Bhabatosh Chanda,et al. Local jet pattern: a robust descriptor for texture classification , 2017, Multimedia Tools and Applications.

[74] Ming Yang,et al. DeepFace: Closing the Gap to Human-Level Performance in Face Verification , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[75] Avinash C. Kak,et al. PCA versus LDA , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[76] Yanjie Wang,et al. Multi-scale dilated convolution of convolutional neural network for image denoising , 2019, Multimedia Tools and Applications.

[77] David J. Kriegman,et al. Acquiring linear subspaces for face recognition under variable lighting , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[78] Andrew Zisserman,et al. Deep Face Recognition , 2015, BMVC.