Supervised Sparse Coding Strategy in Cochlear Implants

In this paper we explore how to improve a sparse coding (SC) strategy that was successfully used to improve subjective speech perception in noisy environment in cochlear implants. On the basis of the existing unsupervised algorithm, we developed an enhanced supervised SC strategy, using the SC shrinkage (SCS) principle. The new algorithm is implemented at the stage of the spectral envelopes after the signal separation in a 22-channel filter bank. SCS can extract and transmit the most important information from noisy speech. The new algorithm is compared with the unsupervised algorithm using objective evaluation for speech in babble and white noise (signal-to-noise ratios, SNR = 10dB, 5dB, 0dB) using objective measures in a cochlea implant simulation. Results show that the supervised SC strategy performs better in white noise, but not significantly better with babble noise. Copyright © 2011 ISCA.

[1]  Guillermo Sapiro,et al.  Supervised Dictionary Learning , 2008, NIPS.

[2]  Liang Chen,et al.  Enhanced sparse speech processing strategy for cochlear implants , 2011, 2011 19th European Signal Processing Conference.

[3]  Xiaodong Li,et al.  Speech enhancement based on the structure of noise power spectral density , 2010, 2010 18th European Signal Processing Conference.

[4]  Xiaodong Li,et al.  Two-channel post-filtering based on adaptive smoothing and noise properties , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[5]  A. Bruckstein,et al.  K-SVD : An Algorithm for Designing of Overcomplete Dictionaries for Sparse Representation , 2005 .

[6]  J Bamford,et al.  The BKB (Bamford-Kowal-Bench) sentence lists for partially-hearing children. , 1979, British journal of audiology.

[7]  B. Moore,et al.  A Model of Loudness Applicable to Time-Varying Sounds , 2002 .

[8]  Lawrence Carin,et al.  Bayesian Compressive Sensing , 2008, IEEE Transactions on Signal Processing.

[9]  D. Pisoni,et al.  Talker-specific learning in speech perception , 1998, Perception & psychophysics.

[10]  Kah-Chye Tan,et al.  Postprocessing method for suppressing musical noise generated by spectral subtraction , 1998, IEEE Trans. Speech Audio Process..

[11]  Ju Liu,et al.  Speech Signal Enhancement Based on MAP Algorithm in the ICA Space , 2008, IEEE Transactions on Signal Processing.

[12]  Philipos C. Loizou,et al.  Speech Enhancement: Theory and Practice , 2007 .

[13]  Guoping Li,et al.  Sparse stimuli for cochlear implants , 2008, 2008 16th European Signal Processing Conference.

[14]  Mingyuan Zhou,et al.  Nonparametric learning of dictionaries for sparse representation of sensor signals , 2009, 2009 3rd IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing (CAMSAP).

[15]  Pascal Scalart,et al.  Speech enhancement based on a priori signal to noise estimation , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[16]  Guoping Li Speech perception in a sparse domain , 2008 .

[17]  Aapo Hyvärinen,et al.  Sparse Code Shrinkage: Denoising of Nongaussian Data by Maximum Likelihood Estimation , 1999, Neural Computation.

[18]  E. Oja,et al.  Sparse code shrinkage for image denoising , 1998, 1998 IEEE International Joint Conference on Neural Networks Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98CH36227).

[19]  R V Shannon,et al.  Speech Recognition with Primarily Temporal Cues , 1995, Science.