High-quality coding of telephone speech and wideband audio

Digital speech technology is reviewed, with the emphasis on applications demanding high-quality reproduction of the speech signal. Examples of such applications are network telephony, ISDN terminals for audio teleconferencing, and systems for the storage of audio signals, which include the important subclass of wideband speech. Depending on the application, the bandwidth of input speech can vary from about 3 kHz to nearly 20 kHz. Coding for digital telephony at 4 and 8 kb/s, network quality coding at 16 kb/s, and coding for audio at 7 and 20 kHz are examined. Future directions in the field are discussed with respect to anticipated technology applications and the algorithms needed to support these technologies.<<ETX>>

[1]  D. Krahe Ein Verfahren zur Datenreduktion bei digitalen Audiosignalen unter Ausnutzung psychoakustischer Phänomene , 1986 .

[2]  Masato Miyoshi,et al.  Inverse filtering of room acoustics , 1988, IEEE Trans. Acoust. Speech Signal Process..

[3]  J. Flanagan,et al.  Computer‐steered microphone arrays for sound transduction in large rooms , 1985 .

[4]  P. Mermelstein G.722: a new CCITT coding standard for digital transmission of wideband audio signals , 1988, IEEE Communications Magazine.

[5]  Man Mohan Sondhi,et al.  Adaptive optimization of microphone arrays under a nonlinear constraint , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[6]  P. Kabal,et al.  A low delay 16 kbits/sec speech coder , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[7]  J. Russell,et al.  Cellular access digital network (CADN): Wireless access to networks of the future , 1987, IEEE Communications Magazine.

[8]  Ira Alan Gerson,et al.  Vector Sum Excited Linear Prediction (VSELP) , 1991 .

[9]  D.C. Cox,et al.  Portable digital radio communications-an approach to tetherless access , 1989, IEEE Communications Magazine.

[10]  Günther Theile,et al.  Low-Bit Rate Coding of High Quality Audio Signals , 1987 .

[11]  Karl Hellwig,et al.  Speech codec for the European mobile radio system , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[12]  W. Voiers,et al.  Diagnostic acceptability measure for speech communication systems , 1977 .

[13]  Juro Ohga,et al.  Adaptive microphone-array system for noise reduction , 1986, IEEE Trans. Acoust. Speech Signal Process..

[14]  Willem Bastiaan Kleijn,et al.  Robust CELP coders for noisy backgrounds and noisy channels , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[15]  Karlheinz Brandenburg OCF--A new coding algorithm for high quality sound signals , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[16]  James D. Johnston,et al.  Transform coding of audio signals using perceptual noise criteria , 1988, IEEE J. Sel. Areas Commun..

[17]  Peter Kabal,et al.  A low delay 16 kb/s speech coder , 1991, IEEE Trans. Signal Process..

[18]  P. Mabilleau,et al.  A comparative study of the proposed high quality coding schemes for digital music , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[19]  Thomas F. Quatieri,et al.  Speech analysis/Synthesis based on a sinusoidal representation , 1986, IEEE Trans. Acoust. Speech Signal Process..

[20]  Man Mohan Sondhi,et al.  Enhancement of ADPCM speech coding with backward-adaptive algorithms for postfiltering and noise feedback , 1988, IEEE J. Sel. Areas Commun..

[21]  J. D. Johnston Perceptual transform coding of wideband stereo signals , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[22]  B. Atal,et al.  Quantization procedures for the excitation in CELP coders , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[23]  M. Taka,et al.  CCITT Standardizing activities on speech coding , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[24]  Nuggehally Sampath Jayant,et al.  Speech coding with time-varying bit allocations to excitation and LPC parameters , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[25]  Thomas E. Tremain,et al.  AN EVALUATION OF 4800BPS VOICE CODERS , 1989 .

[26]  W. Daumer Subjective Evaluation of Several Efficient Speech Coders , 1982, IEEE Trans. Commun..

[27]  R. Steele The cellular environment of lightweight handheld portables , 1989, IEEE Communications Magazine.

[28]  N. Kitawaki,et al.  Quality assessment of speech coding and speech synthesis systems , 1988, IEEE Communications Magazine.

[29]  D.C. Cox,et al.  Universal digital portable radio communications , 1987, Proceedings of the IEEE.

[30]  Ed F. Deprettere,et al.  Regular-pulse excitation-A novel approach to effective and efficient multipulse coding of speech , 1986, IEEE Trans. Acoust. Speech Signal Process..

[31]  B. Atal High-quality speech at low bit rates: Multi-pulse and stochastically excited linear predictive coders , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[32]  R. Steele,et al.  High-user-density digital cellular mobile radio systems , 1985 .