A robust variable-rate speech coder

The goal of this study is to develop a robust and high-quality speech coder for wireless communication. The proposed coder is a perceptually-based variable-rate subband coder. The perceptual metric ensures that encoding is optimized to the human listener and is based on calculating the signal-to-mask ratio in short-time frames of the input signal. An adaptive bit allocation scheme is employed and the subband energies are then quantized using a Max-Lloyd quantizer. The coder is fully scalable-increasing the bit rates, improves the quality of encoded speech. Subjective listening tests, using quiet and noisy input signals, indicate that the proposed coder produces high-quality speech when operating at 12 kbps or higher. In error-free conditions, our coder has comparable performance to that of QCELP or GSM coders. For speech in background noise, however, our coder, at 12 kbps, outperforms QCELP significantly, and for music, it outperforms both QCELP and GSM.

[1]  Sadaoki Furui,et al.  Digital Speech Processing, Synthesis, and Recognition , 1989 .

[2]  Peter No,et al.  Digital Coding of Waveforms , 1986 .

[3]  A. Gersho Advances in speech and audio compression : Data compression , 1994 .

[4]  Abeer Alwan,et al.  Spectral analysis of subband filtered signals , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[5]  A.N. Willson,et al.  High-performance IIR QMF banks for speech subband coding , 1994, Proceedings of IEEE International Symposium on Circuits and Systems - ISCAS '94.

[6]  Karl Hellwig,et al.  Speech codec for the European mobile radio system , 1989, IEEE Global Telecommunications Conference, 1989, and Exhibition. 'Communications Technology for the 1990s and Beyond.

[7]  Raymond N. J. Veldhuis,et al.  Bit Rates in Audio Source Coding , 1992, IEEE J. Sel. Areas Commun..

[8]  古井 貞煕,et al.  Digital speech processing, synthesis, and recognition , 1989 .

[9]  Yair Shoham,et al.  New directions in subband coding , 1988, IEEE J. Sel. Areas Commun..

[10]  N. Jayant,et al.  Digital Coding of Waveforms: Principles and Applications to Speech and Video , 1990 .

[11]  Karl Hellwig,et al.  Speech codec for the European mobile radio system , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[12]  Jerry D. Gibson,et al.  Digital coding of waveforms: Principles and applications to speech and video , 1985, Proceedings of the IEEE.

[13]  Nikil Jayant,et al.  Signal Compression: Technology Targets and Research Directions , 1992, IEEE J. Sel. Areas Commun..

[14]  Karlheinz Brandenburg,et al.  The iso/mpeg-audio codec: A generic standard for coding of high quality digital audio , 1992 .

[15]  Thomas Sporer,et al.  -NMR- and -Masking Flag-: Evaluation of Quality Using Perceptual Criteria , 1992 .