Least squares signal declipping for robust speech recognition

This paper introduces a novel declipping algorithm based on constrained least-squares minimization. Digital speech signals are often sampled at 16 kHz and classic declipping algorithms fail to accurately reconstruct the signal at this sampling rate due to the scarcity of reliable samples after clipping. The Constrained Blind Amplitude Reconstruction algorithm interpolates missing data points such that the resulting function is smooth while ensuring the inferred data fall in a legitimate range. The inclusion of explicit constraints helps to guide an accurate interpolation. Evaluation of declipping performance is based on automatic speech recognition word error rate and Constrained Blind Amplitude Reconstruction is shown to outperform the current state-of-the-art declipping technology under a variety of conditions. Declipping performance in additive noise is also considered.

[1]  Simon J. Godsill,et al.  Monte Carlo smoothing for non-linearly distorted signals , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[2]  Michael Elad,et al.  Audio Inpainting , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[3]  Richard M. Stern,et al.  The 1996 Hub-4 Sphinx-3 System , 1997 .

[4]  M.G. Bellanger,et al.  Digital processing of speech signals , 1980, Proceedings of the IEEE.

[5]  Kazuhiro Nakadai,et al.  Restoration of clipped audio signal using recursive vector projection , 2011, TENCON 2011 - 2011 IEEE Region 10 Conference.

[6]  Laurent Jacques,et al.  Consistent iterative hard thresholding for signal declipping , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[7]  Patti Price,et al.  The DARPA 1000-word resource management database for continuous speech recognition , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[8]  Ivan W. Selesnick,et al.  Least Squares with Examples in Signal Processing 1 , 2013 .

[9]  Raymond N. J. Veldhuis,et al.  Adaptive interpolation of discrete-time signals that can be modeled as autoregressive processes , 1986, IEEE Trans. Acoust. Speech Signal Process..