An analysis of sparseness and regularization in exemplar-based methods for speech classification

The use of exemplar-based techniques for both speech classification and recognition tasks has become increasingly popular in recent years. However, the notion of why sparseness is important for exemplar-based speech processing has been relatively unexplored. In addition, little analysis has been done in speech processing on the appropriateness of different types of sparsity regularization constraints. The goal of this paper is to answer the above two questions, both through mathematically analyzing different sparseness methods and also comparing these approaches for phonetic classification in TIMIT.

[1]  Georg Heigold,et al.  Speech recognition with state-based nearest neighbour classifiers , 2007, INTERSPEECH.

[2]  D. Nahamoo,et al.  Sparse Representation Phone Identification Features for Speech Recognition , 2010 .

[3]  Allen Y. Yang,et al.  Robust Face Recognition via Sparse Representation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Tara N. Sainath,et al.  Bayesian compressive sensing for phonetic classification , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[5]  Lawrence Carin,et al.  Bayesian Compressive Sensing , 2008, IEEE Transactions on Signal Processing.

[6]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[7]  R. Tibshirani,et al.  Least angle regression , 2004, math/0406456.

[8]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[9]  Pini Gurfil,et al.  Convex Feasibility Programming for Compressed Sensing , 2010, ArXiv.

[10]  Tuomas Virtanen,et al.  Noise robust exemplar-based connected digit recognition , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.