Speech coding standards

ABSTRACT In this chapter, we provide a survey of speech coding algorithms with emphasis on those methods that are part of voice communication standards. The organization of the chapter is as follows. The first section presents an introduction to speech coding algorithms and standards. The section Speech Analysis—Synthesis and Linear Prediction discusses short- and long-term linear prediction, and the section Linear Prediction and Speech Coding Standards presents standards based on open- and closed-loop linear prediction. The section Standards Based on Subband and Transform Coders discusses standards based on subband coders and transform coders. The chapter concludes with a summary.

[1]  I. Johansson,et al.  The adaptive multi-rate speech coder , 1999, 1999 IEEE Workshop on Speech Coding Proceedings. Model, Coders, and Error Criteria (Cat. No.99EX351).

[2]  Ira A. Garson Vector sum excited linear prediction (VSELP) speech coding for Japan digital cellular , 1990 .

[3]  A. Spanias,et al.  Perceptual coding of digital audio , 2000, Proceedings of the IEEE.

[4]  J. Makhoul,et al.  A mixed‐source model for speech compression and synthesis , 1978 .

[5]  Xavier Maitre,et al.  7 kHz audio coding within 64 kbit/s , 1988, IEEE J. Sel. Areas Commun..

[6]  Yen-Chun Lin,et al.  A Low-Delay CELP Coder for the CCITT 16 kb/s Speech Coding Standard , 1992, IEEE J. Sel. Areas Commun..

[7]  J. C. Hardwick,et al.  The application of the IMBE speech coder to mobile communications , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[8]  Guido Bertocci,et al.  Report: The 32-kb/s ADPCM coding standard , 1986, AT&T Technical Journal.

[9]  Louis Dunn Fielder,et al.  AC-3: Flexible Perceptual Coding for Audio Transmission and Storage , 1994 .

[10]  J. Makhoul,et al.  Linear prediction: A tutorial review , 1975, Proceedings of the IEEE.

[11]  I. Boyd,et al.  A speech codec for the Skyphone service , 1988 .

[12]  A. Hoogendoorn,et al.  Digital compact cassette , 1994, Proc. IEEE.

[13]  Peter Kroon,et al.  Pitch predictors with high temporal resolution , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[14]  I. A. Gerson,et al.  Vector sum excited linear prediction (VSELP) speech coding at 8 kbps , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[15]  P. Kroon,et al.  Generalized analysis-by-synthesis coding and its application to pitch prediction , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[16]  Sanjiv Nanda,et al.  Evolution of wireless data services: IS-95 to cdma2000 , 1998, IEEE Commun. Mag..

[17]  T. Yoshida,et al.  The rewritable MiniDisc system , 1994, Proc. IEEE.

[18]  Andreas Spanias,et al.  Speech coding: a tutorial review , 1994, Proc. IEEE.

[19]  Gerhard Stoll,et al.  ISO-MPEG-1 Audio: A Generic Standard for Coding of High-: Quality Digital Audio , 1994 .

[20]  Joseph P. Campbell,et al.  Voiced/Unvoiced classification of speech with applications to the U.S. government LPC-10E algorithm , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[21]  Manfred R. Schroeder,et al.  Code-excited linear prediction(CELP): High-quality speech at very low bit rates , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[22]  Jae S. Lim,et al.  Multiband excitation vocoder , 1988, IEEE Transactions on Acoustics, Speech, and Signal Processing.

[23]  P. Noll,et al.  Digital audio coding for visual communications , 1995, Proc. IEEE.

[24]  Thomas P. Barnwell,et al.  A new mixed excitation LPC vocoder , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[25]  Bishnu S. Atal,et al.  A new model of LPC excitation for producing natural-sounding speech at low bit rates , 1982, ICASSP.