Evaluation of sparsifying algorithms for speech signals

Sparse representations of signals have been used in many areas of signal and image processing. It has also played an important role in compressive sensing algorithms since it performs well in sparse signals. A sparse representation is one in which small number of coefficients contain large proportion of the energy. Sparsity is important also in speech compression and coding, where the signal can be compressed in pre-processing stages. It leads to efficient and robust methods for compression, detection denoising and signal separation. The objective of this paper is to evaluate several transforms which is used to sparsify the speech signals. Fast Fourier Transform (FFT), Discrete Cosine Transform (DCT) and Discrete Wavelet Transform (DWT) will be compared and evaluated based on Gini Index. Sparsity properties and measures will be reviewed in this paper. Finally, sparse applications in speech compression and compressive sensing will be discussed.

[1]  Ping Chen,et al.  Generalized Discrete Cosine Transform , 2009, 2009 Pacific-Asia Conference on Circuits, Communications and Systems.

[2]  M. Vetterli,et al.  Sparse Sampling of Signal Innovations , 2008, IEEE Signal Processing Magazine.

[3]  Scott Rickard,et al.  The Gini index of speech , 2004 .

[4]  Rémi Gribonval,et al.  Sparse representations in unions of bases , 2003, IEEE Trans. Inf. Theory.

[5]  Michael Elad,et al.  Sparse Representation for Color Image Restoration , 2008, IEEE Transactions on Image Processing.

[6]  Michael Elad,et al.  Sparse and Redundant Modeling of Image Content Using an Image-Signature-Dictionary , 2008, SIAM J. Imaging Sci..

[7]  Scott Rickard,et al.  Parameterized Lifting for Sparse Signal Representations Using the Gini Index , 2005 .

[8]  Joel A. Tropp,et al.  Just relax: convex programming methods for identifying sparse signals in noise , 2006, IEEE Transactions on Information Theory.

[9]  X. Qu,et al.  Combined sparsifying transforms for compressed sensing MRI , 2010 .

[10]  Scott T. Rickard,et al.  Maximizing Sparsity of Wavelet Representations via Parameterized Lifting , 2007, 2007 15th International Conference on Digital Signal Processing.

[11]  H. Dalton The Measurement of the Inequality of Incomes , 1920 .

[12]  Vahid Tarokh,et al.  A Frame Construction and a Universal Distortion Bound for Sparse Representations , 2008, IEEE Transactions on Signal Processing.

[13]  Scott T. Rickard,et al.  Sparse sources are separated sources , 2006, 2006 14th European Signal Processing Conference.

[14]  G. Teschke Multi-frame representations in linear inverse problems with mixed multi-constraints , 2007 .

[15]  B. Arnold Majorization and the Lorenz Order: A Brief Introduction , 1987 .

[16]  E.J. Candes,et al.  An Introduction To Compressive Sampling , 2008, IEEE Signal Processing Magazine.

[17]  Scott T. Rickard,et al.  Comparing Measures of Sparsity , 2008, IEEE Transactions on Information Theory.