State of the art and trends in speech coding

An introductory review of some basic speech coding techniques covers the most important properties of speech production and hearing, the ubiquitous techniques of quantization and linear prediction, and a recital of the most important measures of coding performance. In the survey that follows, several standardized speech coding systems reflecting the state of the art in speech coding are discussed in terms of coding method, bit rate, performance, complexity and typical application areas. Major future trends are indicated on the basis of expected future standards. The paper, which primarily deals with narrowband speech coding systems, is concluded by a review of the state of affairs and an outline of the future trends in the area of wideband speech coding.

[1]  Allen Gersho,et al.  Variable Rate Speech Coding for Cellular Networks , 1993 .

[2]  Schuyler Quackenbush A 7 kHz bandwidth, 32 kbps speech coder for ISDN , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[3]  Finn Tore Johansen,et al.  Real-time implementation of a wideband speech coder , 1993, Proceedings., IEEE Workshop on Speech Coding for Telecommunications,.

[4]  Jean-Pierre Adoul,et al.  8 kbit/s ACELP coding of speech with 10 ms speech-frame: a candidate for CCITT standardization , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[5]  Kuldip K. Paliwal,et al.  Efficient vector quantization of LPC parameters at 24 bits/frame , 1993, IEEE Trans. Speech Audio Process..

[6]  W. Voiers,et al.  Diagnostic acceptability measure for speech communication systems , 1977 .

[7]  A. Crossman A variable bit rate audio coder for videoconferencing , 1993, Proceedings., IEEE Workshop on Speech Coding for Telecommunications,.

[8]  Marcel Breeuwer,et al.  An introduction to source coding , 1993 .

[9]  N. Kitawaki,et al.  Quality assessment of speech coding and speech synthesis systems , 1988, IEEE Communications Magazine.

[10]  W. Bastiaan Kleijn,et al.  Waveform Interpolation in Speech Coding , 1993 .

[11]  M. Paez,et al.  Minimum Mean-Squared-Error Quantization in Speech PCM and DPCM Systems , 1972, IEEE Trans. Commun..

[12]  Takehiro Moriya,et al.  An 8-bit/s speech coder based on conjugate structure CELP , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[13]  Manfred R. Schroeder,et al.  Code-excited linear prediction(CELP): High-quality speech at very low bit rates , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[14]  Peter Meyer,et al.  Variable rate speech coding using perceptive thresholds and adaptive VUS detection , 1991, EUROSPEECH.

[15]  A.R.D. Thornton,et al.  Foundations of Modern Auditory Theory , 1970 .

[16]  Ira A. Gerson,et al.  A 5600 bps vselp speech coder candidate for half-rate gsm , 1993, Proceedings., IEEE Workshop on Speech Coding for Telecommunications,.

[17]  Allen Gersho,et al.  Advances in speech coding , 1991 .

[18]  Karl Hellwig,et al.  Speech codec for the European mobile radio system , 1989, IEEE Global Telecommunications Conference, 1989, and Exhibition. 'Communications Technology for the 1990s and Beyond.

[19]  Shirley Dex,et al.  JR 旅客販売総合システム(マルス)における運用及び管理について , 1991 .

[20]  F. Wuppermann,et al.  Feasibility Study of 32 Kb/S Wideband Speech and Music Coding With a Low-Delay Filterbank , 1993, Proceedings., IEEE Workshop on Speech Coding for Telecommunications,.

[21]  Robert L. Auger,et al.  The Space Shuttle Ground Terminal Delta Modulation System , 1978, IEEE Trans. Commun..

[22]  Bishnu S. Atal,et al.  On the use of pitch predictors with high temporal resolution , 1991, IEEE Trans. Signal Process..

[23]  J. C. Hardwick,et al.  The application of the IMBE speech coder to mobile communications , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[24]  Anthony D. Fagan,et al.  Wideband speech coding in 7.2 kbit/s , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[25]  Biing-Hwang Juang,et al.  Line spectrum pair (LSP) and speech data compression , 1984, ICASSP.

[26]  J. Makhoul,et al.  Vector quantization in speech coding , 1985, Proceedings of the IEEE.

[27]  Christiane Antweiler,et al.  High Quality Coding of Wideband Speech at 24 Kbit/s , 1993, Proceedings., IEEE Workshop on Speech Coding for Telecommunications,.

[28]  Robert M. Gray,et al.  An Algorithm for Vector Quantizer Design , 1980, IEEE Trans. Commun..

[29]  Joseph P. Campbell,et al.  The Dod 4.8 Kbps Standard (Proposed Federal Standard 1016) , 1991 .

[30]  William R. Gardner,et al.  QCELP: A Variable Rate Speech Coder for CDMA Digital Cellular , 1993 .

[31]  W. D. Voiers,et al.  Diagnostic Evaluation of Speech Intelligibility , 1977 .

[32]  C SeilerN,et al.  A monolithic implementation of a CVSD algorithm. , 1976 .

[33]  Rakesh Taori,et al.  Speech compression using pitch synchronous interpolation , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[34]  Christiane Antweiler,et al.  Objective analysis of the GSM half rate speech codec candidates , 1993, EUROSPEECH.

[35]  M.G. Bellanger,et al.  Digital processing of speech signals , 1980, Proceedings of the IEEE.

[36]  Willem Bastiaan Kleijn,et al.  Improved speech quality and efficient vector quantization in SELP , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[37]  Yen-Chun Lin,et al.  A Low-Delay CELP Coder for the CCITT 16 kb/s Speech Coding Standard , 1992, IEEE J. Sel. Areas Commun..

[38]  John E. Markel,et al.  Linear Prediction of Speech , 1976, Communication and Cybernetics.

[39]  K. Brandenburg,et al.  Current and future standardization of high-quality digital audio coding in MPEG , 1993, Proceedings of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[40]  Ira Alan Gerson,et al.  Vector Sum Excited Linear Prediction (VSELP) , 1991 .

[41]  Allen Gersho,et al.  Speech and Audio Coding for Wireless and Network Applications , 1993 .

[42]  Ed F. Deprettere,et al.  Regular-pulse excitation-A novel approach to effective and efficient multipulse coding of speech , 1986, IEEE Trans. Acoust. Speech Signal Process..

[43]  S. Dimolitsas,et al.  Current objectives in 4-kb/s wireline-quality speech coding standardization , 1994, IEEE Signal Processing Letters.

[44]  Bishnu S. Atal,et al.  A new model of LPC excitation for producing natural-sounding speech at low bit rates , 1982, ICASSP.

[45]  Peter No,et al.  Digital Coding of Waveforms , 1986 .

[46]  H. Duifhuis,et al.  Perceptual analysis of sound , 1972 .

[47]  P. Mabilleau,et al.  16 kbps wideband speech coding technique based on algebraic CELP , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[48]  J. Flanagan Speech Analysis, Synthesis and Perception , 1971 .