Advances in speech and audio compression

Speech and audio compression has advanced rapidly in recent years spurred on by cost-effective digital technology and diverse commercial applications. Recent activity in speech compression is dominated by research and development of a family of techniques commonly described as code-excited linear prediction (CELP) coding. These algorithms exploit models of speech production and auditory perception and offer a quality versus bit rate tradeoff that significantly exceeds most prior compression techniques for rates in the range of 4 to 16 kb/s. Techniques have also been emerging in recent years that offer enhanced quality in the neighborhood of 2.4 kb/s over traditional vocoder methods. Wideband audio compression is generally aimed at a quality that is nearly indistinguishable from consumer compact-disc audio. Subband and transform coding methods combined with sophisticated perceptual coding techniques dominate in this arena with nearly transparent quality achieved at bit rates in the neighborhood of 128 kb/s per channel. >

[1]  Allen Gersho,et al.  Vector quantization of speech LSF parameters with generalized product codes , 1992, ICSLP.

[2]  W. Bastiaan Kleijn,et al.  Methods for waveform interpolation in speech coding , 1991, Digit. Signal Process..

[3]  Allen Gersho,et al.  Variable rate speech coding with phonetic segmentation , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4]  Herbert Reininger,et al.  Improved CELP coding using adaptive excitation codebooks , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[5]  Shoji Tominaga,et al.  ADPCM with a multiquantizer for speech coding , 1988, IEEE J. Sel. Areas Commun..

[6]  Masami Akamine,et al.  CELP coding with an adaptive density pulse excitation model , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[7]  R. Crochiere,et al.  Speech Coding , 1979, IEEE Transactions on Communications.

[8]  Yoshinori Tanaka,et al.  Tree-structured delta codebook for an efficient implementation of CELP , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[9]  Willem Bastiaan Kleijn,et al.  Robust CELP coders for noisy backgrounds and noisy channels , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[10]  Claude Lamblin,et al.  Variable rate speech coding with online segmentation and fast algebraic codes , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[11]  Masaaki Honda Speech coding using waveform matching based on LPC residual phase equalization , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[12]  Redwan Salami,et al.  Real-time implementation of a 9.6 kbit/s ACELP wideband speech coder , 1992, [Conference Record] GLOBECOM '92 - Communications for Global Users: IEEE.

[13]  R. Gray,et al.  Speech coding based upon vector quantization , 1980, ICASSP.

[14]  Allen Gersho,et al.  Complexity reduction methods for vector excitation coding , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[15]  Rajiv Laroia,et al.  Robust and efficient quantization of speech LSP parameters using structured vector quantizers , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[16]  Willem Bastiaan Kleijn,et al.  Continuous representations in linear predictive coding , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[17]  Willem Bastiaan Kleijn,et al.  Improved speech quality and efficient vector quantization in SELP , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[18]  J. D. Johnston,et al.  Sum-difference stereo transform coding , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[19]  Yen-Chun Lin,et al.  A Low-Delay CELP Coder for the CCITT 16 kb/s Speech Coding Standard , 1992, IEEE J. Sel. Areas Commun..

[20]  Luís B. Almeida,et al.  Harmonic coding at 4.8 kb/s , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[21]  Kazunori Ozawa,et al.  M-lcelp Speech Coding at Bit-rates Below 4kbps , 1993, Proceedings., IEEE Workshop on Speech Coding for Telecommunications,.

[22]  Przemyslaw Dymarski,et al.  Mixed excitation CELP coder , 1989, EUROSPEECH.

[23]  Allen Gersho,et al.  Phonetically-based vector excitation coding of speech at 3.6 kbps , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[24]  Ernst Eberlein,et al.  Combined Stereo Coding , 1992 .

[25]  Masami Akamine,et al.  Efficient excitation model for low bit rate speech coding , 1991, 1991., IEEE International Sympoisum on Circuits and Systems.

[26]  P. Mermelstein G.722: a new CCITT coding standard for digital transmission of wideband audio signals , 1988, IEEE Communications Magazine.

[27]  G.C.P. Lokhoff Precision adaptive subband coding (PASC) for the digital compact cassette (DCC) , 1992 .

[28]  Robert M. Gray,et al.  Multimode coding: application to CELP , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[29]  I. A. Gerson,et al.  Vector sum excited linear prediction (VSELP) speech coding at 8 kbps , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[30]  Costas Xydeas,et al.  On improving vector excitation coders through the use of spherical lattice codebooks (SLCs) , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[31]  P. Kroon,et al.  Generalized analysis-by-synthesis coding and its application to pitch prediction , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[32]  Ira Alan Gerson,et al.  Vector Sum Excited Linear Prediction (VSELP) , 1991 .

[33]  Ernst Eberlein,et al.  Advanced Audio Measurement System Using Psychoacoustic Properties , 1992 .

[34]  W. Bastiaan Kleijn,et al.  An efficient stochastically excited linear predictive coding algorithm for high quality low bit rate transmission of speech , 1988, Speech Commun..

[35]  Ken-ichi Sato,et al.  Variable rate speech coding for asynchronous transfer mode , 1990, IEEE Trans. Commun..

[36]  Takehiro Moriya,et al.  4.8 kbit/s delayed decision CELP coder using tree coding , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[37]  José M. Tribolet,et al.  Improved pitch prediction with fractional delays in CELP coding , 1989, International Conference on Acoustics, Speech, and Signal Processing.

[38]  C.-F. Chan,et al.  New multistage scheme for vector quantisation of PARCOR coefficients , 1992 .

[39]  M. Johnson,et al.  Pitch-orthogonal code-excited LPC , 1990, [Proceedings] GLOBECOM '90: IEEE Global Telecommunications Conference and Exhibition.

[40]  Bishnu S. Atal,et al.  A new model of LPC excitation for producing natural-sounding speech at low bit rates , 1982, ICASSP.

[41]  Nobuhiko Kitawaki Research of objective speech quality assessment , 1991 .

[42]  Gernot Kubin,et al.  Performance of noise excitation for unvoiced speech , 1993, Proceedings., IEEE Workshop on Speech Coding for Telecommunications,.

[43]  Ulrich Heute,et al.  A new approach to objective quality-measures based on attribute-matching , 1992, Speech Commun..

[44]  P. C. Meuse A 2400 bps multi-band excitation vocoder , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[45]  C. K. Un,et al.  Multistage self-excited linear predictive speech coder , 1989 .

[46]  Saeed Vaseghi Finite state CELP for variable rate speech coding , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[47]  S. Hayashi,et al.  Standardization activities on 8-kbit/s speech coding in CCITT SGXV , 1992, 1992 IEEE International Conference on Selected Topics in Wireless Communications.

[48]  Allen Gersho,et al.  Adaptive postfiltering for quality enhancement of coded speech , 1995, IEEE Trans. Speech Audio Process..

[49]  M. Johnson,et al.  Improving the performance of CELP-based speech coding at low bit rates , 1991, 1991., IEEE International Sympoisum on Circuits and Systems.

[50]  Thomas P. Barnwell,et al.  Recursive windowing for generating autocorrelation coefficients for LPC analysis , 1981 .

[51]  A. Gersho,et al.  Multiple-stage vector excitation coding of speech waveforms , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[52]  Allen Gersho,et al.  Vector adaptive predictive coding of speech at 9.6 kb/s , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[53]  Ahmet M. Kondoz,et al.  Improved quality CELP base-band coding of speech at low-bit rates , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[54]  Allen Gersho,et al.  On the structure of vector quantizers , 1982, IEEE Trans. Inf. Theory.

[55]  Luis A. Hernández Gómez,et al.  Vector quantized multipulse-LPC , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[56]  John Makhoul,et al.  Adaptive noise spectral shaping and entropy coding in predictive coding of speech , 1979 .

[57]  Kuldip K. Paliwal,et al.  Efficient vector quantization of LPC parameters at 24 bits/frame , 1993, IEEE Trans. Speech Audio Process..

[58]  Robert J. Safranek,et al.  Signal compression based on models of human perception , 1993, Proc. IEEE.

[59]  Christiane Antweiler,et al.  High Quality Coding of Wideband Speech at 24 Kbit/s , 1993, Proceedings., IEEE Workshop on Speech Coding for Telecommunications,.

[60]  Allen Gersho,et al.  Real-time vector excitation coding of speech at 4800 bps , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[61]  Luís B. Almeida,et al.  Harmonic coding: A low bit-rate, good-quality speech coding technique , 1982, ICASSP.

[62]  Jae S. Lim,et al.  Multiband excitation vocoder , 1988, IEEE Transactions on Acoustics, Speech, and Signal Processing.

[63]  Biing-Hwang Juang,et al.  Multiple stage vector quantization for speech coding , 1982, ICASSP.

[64]  Andrew Sekey,et al.  An Objective Measure for Predicting Subjective Quality of Speech Coders , 1992, IEEE J. Sel. Areas Commun..

[65]  Nuggehally Sampath Jayant,et al.  Adaptive postfiltering of 16 kb/s-ADPCM speech , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[66]  W. H. Holmes,et al.  Use of an auditory model to improve speech coders , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[67]  Peter Kabal,et al.  A low delay 16 kb/s speech coder , 1991, IEEE Trans. Signal Process..

[68]  W. Bastiaan Kleijn,et al.  Efficient channel coding for CELP using source information , 1992, Speech Commun..

[69]  D. Lin Speech coding using efficient pseudo-stochastic block codes , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[70]  Samir Saoudi,et al.  A new efficient algorithm to compute the LSP parameters for speech coding , 1992, Signal Process..

[71]  Allen Gersho,et al.  Vector Predictive Coding of Speech at 16 kbits/s , 1985, IEEE Trans. Commun..

[72]  Takehiro Moriya,et al.  Pitch synchronous innovation CELP (PSI-CELP) , 1993, EUROSPEECH.

[73]  Allen Gersho,et al.  Speech and Audio Coding for Wireless and Network Applications , 1993 .

[74]  Ahmet M. Kondoz,et al.  An 8 kbit/s LD-CELP with improved excitation and perceptual modelling , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[75]  P. Mabilleau,et al.  16 kbps wideband speech coding technique based on algebraic CELP , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[76]  S. Dimolitsas,et al.  Standardizing speech-coding technology for network applications , 1993, IEEE Communications Magazine.

[77]  Yoshinori Tanaka,et al.  Principal axis extracting vector excitation coding: high quality speech at 8 kb/s , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[78]  Thomas F. Quatieri,et al.  Speech analysis/Synthesis based on a sinusoidal representation , 1986, IEEE Trans. Acoust. Speech Signal Process..

[79]  Ahmet M. Kondoz,et al.  High quality multiband LPC coding of speech at 2.4 kbit/s , 1991 .

[80]  Allen Gersho,et al.  Low-delay vector excitation coding of speech at 8 kbit/s , 1991, IEEE Global Telecommunications Conference GLOBECOM '91: Countdown to the New Millennium. Conference Record.

[81]  Peter Kroon,et al.  Pitch predictors with high temporal resolution , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[82]  Allen Gersho,et al.  Variable Rate Speech Coding for Cellular Networks , 1993 .

[83]  Robert M. Gray,et al.  Multimode coding: A novel approach to narrow‐ and medium‐band coding , 1988 .

[84]  Isabel Trancoso,et al.  Efficient procedures for finding the optimum innovation in stochastic coders , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[85]  M. Johnson,et al.  Low-complexity multi-mode VXC using multi-stage optimization and mode selection (speech coding) , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[86]  B. Atal,et al.  Predictive coding of speech signals and subjective error criteria , 1979 .

[87]  A. M. Kondoz,et al.  CELP base-band coder for high quality speech coding at 9.6 to 2.4 kbps , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[88]  Gunnar Hult,et al.  Some remarks on a halting criterion for iterative low-pass filtering in a recently proposed pitch detection algorithm , 1991, Speech Commun..

[89]  Akihiko Sugiyama,et al.  Adaptive transform coding with an adaptive block size (ATC-ABS) , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[90]  Peter Kabal,et al.  Synthesis filter optimization and coding: Applications to CELP (speech analysis) , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[91]  Ahmet M. Kondoz,et al.  Natural sounding speech coder operating at 2.4 kb/s and below , 1992, 1992 IEEE International Conference on Selected Topics in Wireless Communications.

[92]  Claude Laflamme,et al.  Acelp speech coding at 8 kbit/s with a 10 ms frame: a candidate for ccitt standardization , 1993, Proceedings., IEEE Workshop on Speech Coding for Telecommunications,.

[93]  James D. Johnston,et al.  Transform coding of audio signals using perceptual noise criteria , 1988, IEEE J. Sel. Areas Commun..

[94]  W. Bastiaan Kleijn,et al.  Encoding speech using prototype waveforms , 1993, IEEE Trans. Speech Audio Process..

[95]  Jerry D. Gibson,et al.  Uniform and piecewise uniform lattice vector quantization for memoryless Gaussian and Laplacian sources , 1993, IEEE Trans. Inf. Theory.

[96]  V. Cuperman,et al.  Variable-rate low-delay analysis-by-synthesis speech coding at 8-16 kb/s , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[97]  Biing-Hwang Juang,et al.  Optimal quantization of LSP parameters , 1993, IEEE Trans. Speech Audio Process..

[98]  Peter Kabal,et al.  The computation of line spectral frequencies using Chebyshev polynomials , 1986, IEEE Trans. Acoust. Speech Signal Process..

[99]  Louis Dunn Fielder,et al.  High-quality audio transform coding at 128 kbits/s , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[100]  P. Urcun,et al.  A MUSICAM source codec for digital audio broadcasting and storage , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[101]  Allen Gersho,et al.  Gain-Adaptive Vector Quantization with Application to Speech Coding , 1987, IEEE Trans. Commun..

[102]  Yair Shoham High-quality speech coding at 2.4 to 4.0 kbit/s based on time-frequency interpolation , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[103]  Peter Kabal,et al.  Stability and performance analysis of pitch filters in speech coders , 1987, IEEE Trans. Acoust. Speech Signal Process..

[104]  Andrew Perkis,et al.  Joint source and channel trellis coding of line spectrum pair parameters , 1992, Speech Commun..

[105]  S. Morissette,et al.  On reducing computational complexity of codebook search in CELP coder through the use of algebraic codes , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[106]  D. Rowe,et al.  A robust 2400 bit/s MBE-LPC speech coder incorporating joint source and channel coding , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[107]  S. A. Mahmoud,et al.  Structured codebook design in CELP , 1990 .

[108]  Allen Gersho,et al.  Asymptotically optimal block quantization , 1979, IEEE Trans. Inf. Theory.

[109]  Per Hedelin A tone oriented voice excited vocoder , 1981, ICASSP.

[110]  Rosario Drogo de Iacovo,et al.  Embedded CELP coding for variable bit-rate between 6.4 and 9.6 kbit/s , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[111]  Allen Gersho,et al.  Vector quantization and signal compression , 1991, The Kluwer international series in engineering and computer science.

[112]  J.J. Shynk,et al.  Backward adaptation for low delay vector excitation coding of speech at 16 kbit/s , 1989, IEEE Global Telecommunications Conference, 1989, and Exhibition. 'Communications Technology for the 1990s and Beyond.

[113]  Takehiro Moriya,et al.  An 8-bit/s speech coder based on conjugate structure CELP , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[114]  Robert M. Gray,et al.  The Design of Trellis Waveform Coders , 1982, IEEE Trans. Commun..

[115]  Yair Shoham Constrained-stochastic excitation coding of speech at 4.8 kb/s , 1990, ICSLP.

[116]  T. J. Moulsley,et al.  Fast vector quantisation using orthogonal codebooks , 1991 .

[117]  B. Atal,et al.  Strategies for improving the performance of CELP coders at low bit rates (speech analysis) , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[118]  Karl Hellwig,et al.  Speech codec for the European mobile radio system , 1989, IEEE Global Telecommunications Conference, 1989, and Exhibition. 'Communications Technology for the 1990s and Beyond.

[119]  Y. Yatsuzuka Highly Sensitive Speech Detector and High-Speed Voiceband Data Discriminator in DSI-ADPCM Systems , 1982, IEEE Trans. Commun..

[120]  D.Y.-K. Wong Issues on speech storage , 1992 .

[121]  I. Boyd,et al.  The voice activity detector for the Pan-European digital cellular mobile telephone service , 1988, International Conference on Acoustics, Speech, and Signal Processing,.

[122]  Wolfgang Hess,et al.  Pitch Determination of Speech Signals , 1983 .

[123]  R. V. Cox,et al.  LD-CELP: a high quality 16 kb/s speech coder with low delay , 1990, [Proceedings] GLOBECOM '90: IEEE Global Telecommunications Conference and Exhibition.

[124]  Grant Allen Davidson,et al.  A low cost adaptive transform decoder implementation for high-quality audio , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[125]  Jun Matsumoto,et al.  Vector quantized MBE with simplified V/UV division at 3.0 kbit/s , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[126]  D. Sereno,et al.  CELP Coding for high-quality speech at 8 kbit/s , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[127]  Akihiko Sugiyama,et al.  A 128 kb/s Hi-Fi Audio CODEC Based on Adaptive Transform Coding with Adaptive Block Size MDCT , 1992, IEEE J. Sel. Areas Commun..

[128]  P. Noll,et al.  Wideband speech and audio coding , 1993, IEEE Communications Magazine.

[129]  Allen Gersho,et al.  Advances in speech coding , 1991 .

[130]  Bishnu S. Atal,et al.  Improving performance of multi-pulse LPC coders at low bit rates , 1984, ICASSP.

[131]  Thomas P. Barnwell,et al.  Improving the performance of a mixed excitation LPC vocoder in acoustic noise , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[132]  Nobuhiko Kitawaki,et al.  Real and artificial speech signals for objective quality evaluation of speech coding systems , 1989 .

[133]  Karlheinz Brandenburg,et al.  The iso/mpeg-audio codec: A generic standard for coding of high quality digital audio , 1992 .

[134]  Ahmet M. Kondoz,et al.  Low bit rate speech coding at 1.2 and 2.4 kb/s , 1992 .

[135]  Timothy Moulsley,et al.  An adaptive voiced/unvoiced speech classifier , 1989, EUROSPEECH.

[136]  W. Bastiaan Kleijn,et al.  Fast methods for the CELP speech coding algorithm , 1990, IEEE Trans. Acoust. Speech Signal Process..

[137]  Allen Gersho,et al.  Product Code Vector Quantization of LPC Parameters , 1993 .

[138]  Alain Le Guyader,et al.  A robust and fast celp coder at 16 kbit/s , 1988, Speech Commun..

[139]  Masaaki Honda,et al.  LPC speech coding based on variable-length segment quantization , 1988, IEEE Trans. Acoust. Speech Signal Process..

[140]  Ed F. Deprettere,et al.  Regular-pulse excitation-A novel approach to effective and efficient multipulse coding of speech , 1986, IEEE Trans. Acoust. Speech Signal Process..

[141]  Giovanni Fausto Andreotti,et al.  A 6.3 kb/s CELP codec suitable for half-rate system , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[142]  Allen Gersho,et al.  A variable-rate natural-quality parametric speech coder , 1994, Proceedings of ICC/SUPERCOMM'94 - 1994 International Conference on Communications.

[143]  Yair Shoham,et al.  Low-delay code-excited linear-predictive coding of wideband speech at 32 kbps , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[144]  Jae S. Lim,et al.  A real-time implementation of the improved MBE speech coder , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[145]  S. Morissette,et al.  Fast CELP coding based on the Barnes-Wall lattice in 16 dimensions , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[146]  T. Taniguchi,et al.  Efficient coding of LPC parameters using adaptive prefiltering and MSVQ with partially adaptive codebook , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[147]  R.W. Schafer,et al.  Digital representations of speech signals , 1975, Proceedings of the IEEE.

[148]  Takehiro Moriya,et al.  Pitch synchronous innovation code excited linear prediction (PSI‐CELP) , 1994 .

[149]  Arild Fuldseth,et al.  Wideband speech coding at 16 kbit/s for a videophone application , 1992, Speech Commun..

[150]  Przemyslaw Dymarski,et al.  Optimal and sub-optimal algorithms for selecting the excitation in linear predictive coders , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[151]  Allen Gersho,et al.  Low delay speech coding , 1993, Speech Commun..

[152]  J.D. Gibson,et al.  Adaptive prediction in speech differential encoding systems , 1980, Proceedings of the IEEE.

[153]  Ernst F Schroeder,et al.  Aspec-Adaptive Spectral Entropy Coding of High Quality Music Signals , 1991 .

[154]  P. Jacobs,et al.  Qcelp: The North American Cdma Digital Cellular Variable Rate Speech Coding Standard , 1993, Proceedings., IEEE Workshop on Speech Coding for Telecommunications,.

[155]  V. Cuperman,et al.  Backward pitch prediction for low-delay speech coding , 1989, IEEE Global Telecommunications Conference, 1989, and Exhibition. 'Communications Technology for the 1990s and Beyond.

[156]  G. S. Kang,et al.  Low-Bit Rate Speech Encoders Based on Line-Spectrum Frequencies (LSFs) , 1985 .

[157]  Peter Kabal,et al.  Pitch prediction filters in speech coding , 1989, IEEE Trans. Acoust. Speech Signal Process..

[158]  L. Fransen,et al.  Application of line-spectrum pairs to low-bit-rate speech encoders , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[159]  V. Ramamoorthy,et al.  Enhancement of ADPCM speech by adaptive postfiltering , 1984, AT&T Bell Laboratories Technical Journal.

[160]  J.-H. Chen,et al.  An 8 kb/s low-delay CELP speech coder , 1991, IEEE Global Telecommunications Conference GLOBECOM '91: Countdown to the New Millennium. Conference Record.

[161]  Bishnu S. Atal,et al.  ON IMPROVING THE PERFORMANCE OF PITCH PREDICTORS IN SPEECH CODING SYSTEMS , 1991 .

[162]  Peter Kabal,et al.  Rate-distortion function for speech coding based on perceptual distortion measure , 1992, [Conference Record] GLOBECOM '92 - Communications for Global Users: IEEE.

[163]  Yoshinori Tanaka,et al.  Speech Coding with Dynamic Bit Allocation (Multimode Coding) , 1991 .

[164]  John Princen,et al.  Analysis/Synthesis filter bank design based on time domain aliasing cancellation , 1986, IEEE Trans. Acoust. Speech Signal Process..

[165]  B. Paillard,et al.  PERCEVAL: Perceptual Evaluation of the Quality of Audio Signals , 1992 .

[166]  Thomas P. Barnwell,et al.  Implementation and evaluation of a 2400 bit/s mixed excitation LPC vocoder , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[167]  Allen Gersho,et al.  Phonetic Segmentation for Low Rate Speech Coding , 1991 .

[168]  R.F. Kubichek,et al.  Speech quality assessment using expert pattern recognition , 1989, Conference Proceeding IEEE Pacific Rim Conference on Communications, Computers and Signal Processing.

[169]  Allen Gersho,et al.  Encoding of LPC spectral parameters using switched-adaptive interframe vector prediction (speech coding) , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[170]  Karlheinz Brandenburg OCF--A new coding algorithm for high quality sound signals , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[171]  Costas S. Xydeas,et al.  A long history quantization approach to scalar and vector quantization of LSP coefficients , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[172]  Bishnu S. Atal,et al.  On the use of pitch predictors with high temporal resolution , 1991, IEEE Trans. Signal Process..

[173]  Takehiro Moriya Two-Channel Conjugate Vector Quantizer for Noisy Channel Speech Coding , 1992, IEEE J. Sel. Areas Commun..

[174]  Luca Cellario,et al.  Variable rate speech coding for umts , 1993, Proceedings., IEEE Workshop on Speech Coding for Telecommunications,.

[175]  Manfred R. Schroeder,et al.  Vocoders: Analysis and synthesis of speech , 1966 .

[176]  Jörg-Martin Müller Improving performance of code excited LPC-coders by joint optimization , 1989, Speech Commun..

[177]  J. Makhoul,et al.  Vector quantization in speech coding , 1985, Proceedings of the IEEE.

[178]  Michael Shapiro Brandstein A 1.5 Kbps multi-band excitation speech coder , 1990 .

[179]  G. Cosier,et al.  Voice control of the pan-European digital mobile radio system , 1989, IEEE Global Telecommunications Conference, 1989, and Exhibition. 'Communications Technology for the 1990s and Beyond.

[180]  E. Finnimore Objective Quality Assessment , 1986 .

[181]  Aníbal R. Figueiras-Vidal,et al.  On the behaviour of reduced complexity code-excited linear prediction (CELP) , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[182]  Jon W. Mark,et al.  Multiuser rate subband coding incorporating DSI and buffer control , 1990, IEEE Trans. Commun..

[183]  Claude R. Galand,et al.  Adaptive code excited predictive coding , 1992, IEEE Trans. Signal Process..

[184]  Juin-Hwey Chen,et al.  The creation and evolution of 16 kbit/s LD-CELP: From concept to standard , 1993, Speech Communication.

[185]  K. Zeger,et al.  Zero redundancy channel coding in vector quantisation , 1987 .

[186]  W. B. Kleijn,et al.  Improved pitch prediction , 1993, Proceedings., IEEE Workshop on Speech Coding for Telecommunications,.

[187]  Per Hedelin,et al.  Amplitude quantization for CELP excitation signals , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[188]  Yair Shoham High-quality speech coding at 2.4 kbps based on time-frequency interpolation , 1993, EUROSPEECH.

[189]  Manfred R. Schroeder,et al.  Rate distortion theory and predictive coding , 1981, ICASSP.

[190]  Isabel Trancoso,et al.  CELP and sinusoidal coders: Two solutions for speech coding at 4.8-9.6 kbps , 1990, Speech Commun..

[191]  Allen Gersho,et al.  Improved phonetically-segmented vector excitation coding at 3.4 kb/s , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[192]  Y. Yatsuzuka,et al.  A variable rate coding by APC with maximum likelihood quantization from 4.8 kbits/s to 16 kbits/s , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[193]  Man Mohan Sondhi,et al.  Enhancement of ADPCM speech coding with backward-adaptive algorithms for postfiltering and noise feedback , 1988, IEEE J. Sel. Areas Commun..

[194]  Manfred R. Schroeder,et al.  Code-excited linear prediction(CELP): High-quality speech at very low bit rates , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[195]  Francis Rumsey Hearing both sides-stereo sound for TV in the UK , 1990 .

[196]  Roch Lefebvre,et al.  8 kbit/s coding of speech with 6 ms frame-length , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[197]  Jean-Pierre Petit,et al.  Robust and fast code-excited linear predictive coding of speech signals , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[198]  R. A. Salami Binary code excited linear prediction (BCELP): new approach to CELP coding of speech without codebooks , 1989 .

[199]  Allen Gersho,et al.  Vector excitation coding with dynamic bit allocation , 1988, IEEE Global Telecommunications Conference and Exhibition. Communications for the Information Age.

[200]  Nuggehally Sampath Jayant,et al.  Speech coding with time-varying bit allocations to excitation and LPC parameters , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[201]  Masami Akamine,et al.  Adaptive bit-allocation between the pole-zero synthesis filter and excitation in CELP , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[202]  Joseph P. Campbell,et al.  The Dod 4.8 Kbps Standard (Proposed Federal Standard 1016) , 1991 .

[203]  K. Srinivasan,et al.  Voice activity detection for cellular networks , 1993, Proceedings., IEEE Workshop on Speech Coding for Telecommunications,.

[204]  H. Hassanein,et al.  A hybrid multiband excitation coder for low bit rates , 1992, 1992 IEEE International Conference on Selected Topics in Wireless Communications.

[205]  William R. Gardner,et al.  QCELP: A Variable Rate Speech Coder for CDMA Digital Cellular , 1993 .

[206]  B. Atal,et al.  Speech analysis and synthesis by linear prediction of the speech wave. , 1971, The Journal of the Acoustical Society of America.

[207]  Allen Gersho,et al.  Efficient Encoding of the Long-Term Predictor in Vector Excitation Coders , 1991 .

[208]  Nobuhiko Kitawaki,et al.  Objective measurement method for estimating speech quality of low-bit-rate speech coding , 1991 .

[209]  B. Atal High-quality speech at low bit rates: Multi-pulse and stochastically excited linear predictive coders , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[210]  Kenzo Akagiri,et al.  ATRAC: Adaptive Transform Acoustic Coding for MiniDisc , 1992 .

[211]  Louis Dunn Fielder,et al.  AC-3: Flexible Perceptual Coding for Audio Transmission and Storage , 1994 .

[212]  Nuggehally Sampath Jayant,et al.  Waveform quantization and coding , 1976 .

[213]  K. Brandenburg,et al.  Transform coding of high quality digital audio at low bit rates-algorithms and implementation , 1990, IEEE International Conference on Communications, Including Supercomm Technical Sessions.

[214]  Roar Hagen,et al.  Robust vector quantization in spectral coding , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[215]  Jianfeng Chen,et al.  A robust low-delay CELP speech coder at 16 kbits/s , 1989 .

[216]  Allen Gersho,et al.  Recent Trends and Techniques in Speech Coding , 1990, 1990 Conference Record Twenty-Fourth Asilomar Conference on Signals, Systems and Computers, 1990..

[217]  Renaud J. Di Francesco Real-time speech segmentation using pitch and convexity jump models: application to variable rate speech coding , 1990, IEEE Trans. Acoust. Speech Signal Process..

[218]  Peter No,et al.  Digital Coding of Waveforms , 1986 .

[219]  Vladimir Cuperman,et al.  Lattice Low Delay Vector Excitation for 8 kb/s Speech Coding , 1993 .

[220]  C. Xydeas,et al.  Advances in analysis by synthesis LPC speech coders , 1987 .

[221]  T. M. Liu,et al.  Phonetically-based LPC vector quantization of high quality speech , 1989, EUROSPEECH.

[222]  C. Xydeas An overview of speech coding techniques , 1992 .

[223]  R. Steele,et al.  High quality audio coding using analysis-by-synthesis technique , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[224]  Nikil Jayant,et al.  Signal Compression: Technology Targets and Research Directions , 1992, IEEE J. Sel. Areas Commun..

[225]  Allen Gersho,et al.  Pseudo-Gray coding , 1990, IEEE Trans. Commun..

[226]  S. A. Mahmoud,et al.  Tree searched multi-stage vector quantization of LPC parameters for 4 kb/s speech coding , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[227]  P. Mabilleau,et al.  Fast CELP coding based on algebraic codes , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[228]  M. Copperi Efficient excitation modeling in a low bit-rate CELP coder , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[229]  Daniele Sereno,et al.  Some experiments of 7 kHz audio coding at 16 kbit/s , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[230]  M. Johnson,et al.  Pitch sharpening for perceptually improved CELP, and the sparse-delta codebook for reduced computation , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[231]  Allen Gersho,et al.  Low-delay speech coding with adaptive interframe pitch tracking , 1993, Proceedings of ICC '93 - IEEE International Conference on Communications.

[232]  Allen Gersho,et al.  Real-time vector APC speech coding at 4800 bps with adaptive postfiltering , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[233]  Wolfgang J. Hess,et al.  Pitch and voicing determination , 1992 .

[234]  T. Ramstad,et al.  Variable rate coding for speech storage , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.