论文信息 - Discriminative Autoencoder for Feature Extraction: Application to Character Recognition

Discriminative Autoencoder for Feature Extraction: Application to Character Recognition

Conventionally, autoencoders are unsupervised representation learning tools. In this work, we propose a novel discriminative autoencoder. Use of supervised discriminative learning ensures that the learned representation is robust to variations commonly encountered in image datasets. Using the basic discriminating autoencoder as a unit, we build a stacked architecture aimed at extracting relevant representation from the training data. The efficiency of our feature extraction algorithm ensures a high classification accuracy with even simple classification schemes like KNN (K-nearest neighbor). We demonstrate the superiority of our model for representation learning by conducting experiments on standard datasets for character/image recognition and subsequent comparison with existing supervised deep architectures like class sparse stacked autoencoder and discriminative deep belief network.

Angshul Majumdar | Anupriya Gogna | A. Majumdar | Anupriya Gogna

[1] H. M. Abbas. Analysis and pruning of nonlinear auto-association networks , 2004 .

[2] Luís A. Alexandre,et al. Stacked Autoencoders Using Low-Power Accelerated Architectures for Object Recognition in Autonomous Systems , 2016, Neural Processing Letters.

[3] Yücel Altunbasak,et al. Eigenface-domain super-resolution for face recognition , 2003, IEEE Trans. Image Process..

[4] Ghassan Hamarneh,et al. N-Sift: N-Dimensional Scale Invariant Feature Transform for Matching Medical Images , 2007, ISBI.

[5] Yan Liu,et al. Discriminative deep belief networks for visual data classification , 2011, Pattern Recognit..

[6] David J. Field,et al. Emergence of simple-cell receptive field properties by learning a sparse code for natural images , 1996, Nature.

[7] Geoffrey E. Hinton. Training Products of Experts by Minimizing Contrastive Divergence , 2002, Neural Computation.

[8] H. Bourlard,et al. Auto-association by multilayer perceptrons and singular value decomposition , 1988, Biological Cybernetics.

[9] Baoxin Li,et al. Discriminative K-SVD for dictionary learning in face recognition , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[10] Yoshua Bengio,et al. Marginalized Denoising Auto-encoders for Nonlinear Representations , 2014, ICML.

[11] Dong Yu,et al. Deep Learning and Its Applications to Signal and Information Processing [Exploratory DSP] , 2011, IEEE Signal Processing Magazine.

[12] Yoshua. Bengio,et al. Learning Deep Architectures for AI , 2007, Found. Trends Mach. Learn..

[13] Amy Loutfi,et al. Learning Representations with a Dynamic Objective Sparse Autoencoder , 2012, NIPS 2012.

[14] Boaz Lerner,et al. Accurate and Fast Off and Online Fuzzy ARTMAP-Based Image Classification With Application to Genetic Abnormality Diagnosis , 2006, IEEE Transactions on Neural Networks.

[15] Frédéric Jurie,et al. Discriminative Autoencoders for Small Targets Detection , 2014, 2014 22nd International Conference on Pattern Recognition.

[16] Matti Pietikäinen,et al. Rotation Invariant Image Description with Local Binary Pattern Histogram Fourier Features , 2009, SCIA.

[17] Jochen J. Steil,et al. Online learning and generalization of parts-based image representations by non-negative sparse autoencoders , 2012, Neural Networks.

[18] Feiping Nie,et al. Effective Discriminative Feature Selection With Nontrivial Solution , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[19] Charles L. Lawson,et al. Solving least squares problems , 1976, Classics in applied mathematics.

[20] Geoffrey E. Hinton,et al. Acoustic Modeling Using Deep Belief Networks , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[21] Yee Whye Teh,et al. A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[22] Jim Jing-Yan Wang,et al. Max-min distance nonnegative matrix factorization , 2013, Neural Networks.

[23] Bill Triggs,et al. Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[24] Xiaolong Wang,et al. Convolutional Deep Networks for Visual Data Classification , 2012, Neural Processing Letters.

[25] Yoshua Bengio,et al. Greedy Layer-Wise Training of Deep Networks , 2006, NIPS.

[26] Richa Singh,et al. Face Verification via Class Sparsity Based Supervised Encoding , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27] Hau-San Wong,et al. Face recognition based on 2D Fisherface approach , 2006, Pattern Recognit..

[28] Michael J. Watts,et al. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS Publication Information , 2020, IEEE Transactions on Neural Networks and Learning Systems.

[29] Jun Zhou,et al. Hyperspectral Image Classification Based on Structured Sparse Logistic Regression and Three-Dimensional Wavelet Texture Features , 2013, IEEE Transactions on Geoscience and Remote Sensing.

[30] Yoshua Bengio,et al. Classification using discriminative restricted Boltzmann machines , 2008, ICML '08.

[31] Hong Yan,et al. Handwritten Digit Recognition by a Mixture of Local Principal Component Analysis , 1998, Neural Processing Letters.

[32] Larry S. Davis,et al. Label Consistent K-SVD: Learning a Discriminative Dictionary for Recognition , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[33] Tom Goldstein,et al. The Split Bregman Method for L1-Regularized Problems , 2009, SIAM J. Imaging Sci..

[34] Pascal Vincent,et al. Stacked Denoising Autoencoders: Learning Useful Representations in a Deep Network with a Local Denoising Criterion , 2010, J. Mach. Learn. Res..