论文信息 - Correlation and Class Based Block Formation for Improved Structured Dictionary Learning

Correlation and Class Based Block Formation for Improved Structured Dictionary Learning

In recent years, the creation of block-structured dictionary has attracted a lot of interest. Learning such dictionaries involve two step process: block formation and dictionary update. Both these steps are important in producing an effective dictionary. The existing works mostly assume that the block structure is known a priori while learning the dictionary. For finding the unknown block structure given a dictionary commonly sparse agglomerative clustering (SAC) is used. It groups atoms based on their consistency in sparse coding with respect to the unstructured dictionary. This paper explores two innovations towards improving the reconstruction as well as the classification ability achieved with the block-structured dictionary. First, we propose a novel block structuring approach that makes use of the correlation among dictionary atoms. Unlike the SAC approach, which groups diverse atoms, in the proposed approach the blocks are formed by grouping the top most correlated atoms in the dictionary. The proposed block clustering approach is noted to yield significant reductions in redundancy as well as provides a direct control on the block size when compared with the existing SAC-based block structuring. Later, motivated by works using supervised \emph{a priori} known block structure, we also explore the incorporation of class information in the proposed block formation approach to further enhance the classification ability of the block dictionary. For assessment of the reconstruction ability with proposed innovations is done on synthetic data while the classification ability has been evaluated in large variability speaker verification task.

Nagendra Kumar | Rohit Sinha | R. Sinha | Nagendra Kumar

[1] Michael Elad,et al. Image Denoising Via Learned Dictionaries and Sparse representation , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[2] Allen Y. Yang,et al. Robust Face Recognition via Sparse Representation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3] Rohit Sinha,et al. Language identification using sparse representation: A comparison between GMM supervector and i-vector based approaches , 2013, 2013 Annual IEEE India Conference (INDICON).

[4] Rama Chellappa,et al. Dictionary-Based Face Recognition Under Variable Lighting and Pose , 2012, IEEE Transactions on Information Forensics and Security.

[5] A. Bruckstein,et al. K-SVD : An Algorithm for Designing of Overcomplete Dictionaries for Sparse Representation , 2005 .

[6] Yonina C. Eldar,et al. Dictionary Optimization for Block-Sparse Representations , 2010, IEEE Transactions on Signal Processing.

[7] M. Elad,et al. $rm K$-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation , 2006, IEEE Transactions on Signal Processing.

[8] Babak Hassibi,et al. On the Reconstruction of Block-Sparse Signals With an Optimal Number of Measurements , 2008, IEEE Transactions on Signal Processing.

[9] Yonina C. Eldar,et al. Average Case Analysis of Multichannel Sparse Recovery Using Convex Relaxation , 2009, IEEE Transactions on Information Theory.

[10] Rohit Sinha,et al. Robust Speaker Verification With Joint Sparse Coding Over Learned Dictionaries , 2015, IEEE Transactions on Information Forensics and Security.

[11] R. Tibshirani,et al. Least angle regression , 2004, math/0406456.

[12] Michael Elad,et al. Efficient Implementation of the K-SVD Algorithm using Batch Orthogonal Matching Pursuit , 2008 .

[13] E. Ambikairajah,et al. Speaker verification using sparse representation classification , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[14] Douglas A. Reynolds,et al. Speaker Verification Using Adapted Gaussian Mixture Models , 2000, Digit. Signal Process..

[15] Angshul Majumdar,et al. Accelerating multi-echo T2 weighted MR imaging: analysis prior group-sparse optimization. , 2011, Journal of magnetic resonance.

[16] Florin Curelaru,et al. Front-End Factor Analysis For Speaker Verification , 2018, 2018 International Conference on Communications (COMM).

[17] Patrick Kenny,et al. Speaker and Session Variability in GMM-Based Speaker Verification , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[18] Yonina C. Eldar,et al. Block-sparsity: Coherence and efficient recovery , 2008, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[19] Guillermo Sapiro,et al. Sparse representations for image classification: learning discriminative and reconstructive non-parametric dictionaries , 2008 .

[20] Yunde Jia,et al. Orthonormal dictionary learning and its application to face recognition , 2016, Image Vis. Comput..

[21] Daniel Garcia-Romero,et al. Analysis of i-vector Length Normalization in Speaker Recognition Systems , 2011, INTERSPEECH.

[22] Ajit Rajwade,et al. Block and Group Regularized Sparse Modeling for Dictionary Learning , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[23] Larry S. Davis,et al. Label Consistent K-SVD: Learning a Discriminative Dictionary for Recognition , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24] Rohit Sinha,et al. Class specificity and commonality based discriminative dictionary for speaker verification , 2016, 2016 Twenty Second National Conference on Communication (NCC).

[25] Manhua Liu,et al. Latent Fingerprint Enhancement via Multi-Scale Patch Based Sparse Representation , 2015, IEEE Transactions on Information Forensics and Security.

[26] Yonina C. Eldar,et al. Robust Recovery of Signals From a Structured Union of Subspaces , 2008, IEEE Transactions on Information Theory.

[27] Mohammed Bennamoun,et al. Sparse Representation for Speaker Identification , 2010, 2010 20th International Conference on Pattern Recognition.

[28] Yi Ma,et al. Learning Category-Specific Dictionary and Shared Dictionary for Fine-Grained Image Categorization , 2014, IEEE Transactions on Image Processing.

[29] Rohit Sinha,et al. Improved speaker verification using block sparse coding over joint speaker-channel learned dictionary , 2015, TENCON 2015 - 2015 IEEE Region 10 Conference.

[30] Yonina C. Eldar,et al. Block-Sparse Signals: Uncertainty Relations and Efficient Recovery , 2009, IEEE Transactions on Signal Processing.