Sparse recognition via intra-class dictionary learning using visual saliency information

In recent years, sparse recognition (SR) has increasingly become an emerging pattern recognition method. Because of its excellent recognition performance for some traditionally difficult problems (such as occluded or corrupted face recognition), several classical SR ideas (such as sparse representation-based classification (SRC) or dictionary-based sparse recognition (DSR)) have been the focus of research in the intelligent information field. However, for image recognition against actual backgrounds, there are still problems with these mainstream SR methods. Hence, this paper presents a new SR method which combines the advantages of both SRC and DSR. In the pre-processing, visual saliency information (VSI) for images with complex scenes is extracted by introducing the saliency map as a tool. Then, DSR is used to develop intra-class dictionary learning for the VSI data. The last step is to solve a l1-norm optimization problem to give the SR result by generating a global recognition matrix with the SRC mechanism. Experimental results show that the proposed method for 'real world' image recognition provides advantages over mainstream SR methods, in recognition rate and computation time cost.

[1]  Xiangjian He,et al.  Bayesian salient object detection based on saliency driven clustering , 2014, Signal Process. Image Commun..

[2]  M. Elad,et al.  $rm K$-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation , 2006, IEEE Transactions on Signal Processing.

[3]  Dong Xu,et al.  Human Gait Recognition Using Patch Distribution Feature and Locality-Constrained Group Sparse Representation , 2012, IEEE Transactions on Image Processing.

[4]  Antonio Torralba,et al.  Recognizing indoor scenes , 2009, CVPR.

[5]  Gang Wang,et al.  Image-to-Set Face Recognition Using Locality Repulsion Projections and Sparse Reconstruction-Based Similarity Measure , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[6]  Emmanuel J. Candès,et al.  Robust uncertainty principles: exact signal reconstruction from highly incomplete frequency information , 2004, IEEE Transactions on Information Theory.

[7]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[8]  N. Vasconcelos,et al.  Biologically plausible saliency mechanisms improve feedforward object recognition , 2010, Vision Research.

[9]  Dao-Qing Dai,et al.  Structured Sparse Error Coding for Face Recognition With Occlusion , 2013, IEEE Transactions on Image Processing.

[10]  Pascale Fung,et al.  Efficient Sparse Banded Acoustic Models for Speech Recognition , 2014, IEEE Signal Processing Letters.

[11]  Shengcai Liao,et al.  Kernel sparse representation with pixel-level and region-level local feature kernels for face recognition , 2014, Neurocomputing.

[12]  Liang-Tien Chia,et al.  Region-Based Saliency Detection and Its Application in Object Recognition , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[13]  Thomas Martinetz,et al.  Simple Method for High-Performance Digit Recognition Based on Sparse Coding , 2008, IEEE Transactions on Neural Networks.

[14]  Pietro Perona,et al.  Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[15]  Ran He,et al.  Two-Stage Nonnegative Sparse Representation for Large-Scale Face Recognition , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[16]  Ling Chen,et al.  Saliency-guided improvement for hand posture detection and recognition , 2014, Neurocomputing.

[17]  Nuno Vasconcelos,et al.  Discriminant Saliency, the Detection of Suspicious Coincidences, and Applications to Visual Recognition , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Lihi Zelnik-Manor,et al.  Context-Aware Saliency Detection , 2012, IEEE Trans. Pattern Anal. Mach. Intell..

[19]  D. L. Donoho,et al.  Compressed sensing , 2006, IEEE Trans. Inf. Theory.

[20]  Sudeep Sarkar,et al.  Saliency in images and video: a brief survey , 2012 .

[21]  Wilfried Philips,et al.  Sparse representation and position prior based face hallucination upon classified over-complete dictionaries , 2012, Signal Process..

[22]  Larry S. Davis,et al.  Label Consistent K-SVD: Learning a Discriminative Dictionary for Recognition , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Matti Pietikäinen,et al.  Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[24]  Changyin Sun,et al.  Supervised class-specific dictionary learning for sparse modeling in action recognition , 2012, Pattern Recognit..

[25]  Mohammad Reza Mohammadi,et al.  PCA-based dictionary building for accurate facial expression recognition via sparse representation , 2014, J. Vis. Commun. Image Represent..

[26]  Emmanuel J. Candès,et al.  Near-Optimal Signal Recovery From Random Projections: Universal Encoding Strategies? , 2004, IEEE Transactions on Information Theory.

[27]  Allen Y. Yang,et al.  Robust Face Recognition via Sparse Representation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  Allen R. Hanson,et al.  Scene Text Recognition Using Similarity and a Lexicon with Sparse Belief Propagation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[29]  Kjersti Engan,et al.  Method of optimal directions for frame design , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[30]  Lu Yang,et al.  Sparse representation and learning in visual recognition: Theory and applications , 2013, Signal Process..

[31]  Mi Zhang,et al.  Human Daily Activity Recognition With Sparse Representation Using Wearable Sensors , 2013, IEEE Journal of Biomedical and Health Informatics.

[32]  Thomas S. Huang,et al.  Multi-View Automatic Target Recognition using Joint Sparse Representation , 2012, IEEE Transactions on Aerospace and Electronic Systems.

[33]  Shrikanth Narayanan,et al.  Enhanced Sparse Imputation Techniques for a Robust Speech Recognition Front-End , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[34]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[35]  Stéphane Mallat,et al.  Matching pursuits with time-frequency dictionaries , 1993, IEEE Trans. Signal Process..

[36]  Wei Zhao,et al.  Dempster–Shafer Fusion of Multiple Sparse Representation and Statistical Property for SAR Target Configuration Recognition , 2014, IEEE Geoscience and Remote Sensing Letters.

[37]  Tuomas Virtanen,et al.  Exemplar-Based Sparse Representations for Noise Robust Automatic Speech Recognition , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[38]  Arnold W. M. Smeulders,et al.  Sparse representation for coarse and fine object recognition , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[39]  Rama Chellappa,et al.  Dictionary-Based Face Recognition Under Variable Lighting and Pose , 2012, IEEE Transactions on Information Forensics and Security.

[40]  Shrikanth S. Narayanan,et al.  Novel Variations of Group Sparse Regularization Techniques With Applications to Noise Robust Automatic Speech Recognition , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[41]  Jillian H. Fecteau,et al.  Salience, relevance, and firing: a priority map for target selection , 2006, Trends in Cognitive Sciences.

[42]  Michael A. Saunders,et al.  Atomic Decomposition by Basis Pursuit , 1998, SIAM J. Sci. Comput..

[43]  Shuyuan Yang,et al.  Compressive feature and kernel sparse coding-based radar target recognition , 2013 .

[44]  Chanho Jung,et al.  A Unified Spectral-Domain Approach for Saliency Detection and Its Application to Automatic Object Segmentation , 2012, IEEE Transactions on Image Processing.

[45]  Joel A. Tropp,et al.  Greed is good: algorithmic results for sparse approximation , 2004, IEEE Transactions on Information Theory.

[46]  Wenjian Wang,et al.  Saliency-SVM: An automatic approach for image segmentation , 2014, Neurocomputing.