Cepstral weighting for speech dereverberation without musical noise

We present an effective way to reduce musical noise in binaural speech dereverberation algorithms based on an instantaneous weighting of the cepstrum. We propose this instantaneous technique, as temporal smoothing techniques result in a smearing of the signal over time and are thus expected to reduce the dereverberation performance. For the instantaneous weighting function we compute the a posteriori probability that a cepstral coefficient represents the speech spectral structure. The proposed algorithm incorporates a priori knowledge about the speech spectral structure by training the parameters of the respective likelihood function offline using a speech database. The proposed algorithm employs neither a voiced/unvoiced detection nor a fundamental period estimator and is shown to outperform an algorithm without cepstral processing in terms of a higher signal-to-interference ratio, a lower bark spectral distortion, and a lower log kurtosis ratio, indicating a reduction of musical noise.

[1]  Peter Vary,et al.  Digital Speech Transmission: Enhancement, Coding and Error Concealment , 2006 .

[2]  Jont B. Allen,et al.  Multimicrophone signal‐processing technique to remove room reverberation from speech signals , 1977 .

[3]  Peter Vary,et al.  A blind speech enhancement algorithm for the suppression of late reverberation and noise , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[4]  Philipos C. Loizou,et al.  Speech Enhancement: Theory and Practice , 2007 .

[5]  Kiyohiro Shikano,et al.  Automatic optimization scheme of spectral subtraction based on musical noise assessment via higher-order statistics , 2008 .

[6]  Rainer Martin,et al.  Cepstral Smoothing of Spectral Filter Gains for Speech Enhancement Without Musical Noise , 2007, IEEE Signal Processing Letters.

[7]  Petre Stoica,et al.  Total-Variance Reduction Via Thresholding: Application to Cepstral Analysis , 2007, IEEE Transactions on Signal Processing.

[8]  J.-M. Boucher,et al.  A New Method Based on Spectral Subtraction for Speech Dereverberation , 2001 .

[9]  Rainer Martin,et al.  On the Statistics of Spectral Amplitudes After Variance Reduction by Temporal Cepstrum Smoothing and Cepstral Nulling , 2009, IEEE Transactions on Signal Processing.

[10]  Rainer Martin,et al.  An Improved Parametric Model for Perception-Based Design of Virtual Acoustics , 2009 .

[11]  Thomas Esch,et al.  Model-Based Dereverberation Preserving Binaural Cues , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[12]  Eap Emanuël Habets Single- and multi-microphone speech dereverberation using spectral enhancement , 2007 .

[13]  Andrew Sekey,et al.  An Objective Measure for Predicting Subjective Quality of Speech Coders , 1992, IEEE J. Sel. Areas Commun..