Phonetically-based vector excitation coding of speech at 3.6 kbps

A phonetically based segmentation of speech is performed to classify segments into five classes: onset, unvoiced low-pass voiced, steady-state voiced, and transient voiced. The segment lengths are constrained to an integer multiple of a unit-frame. For each segment class, a distinctive coding scheme based on vector excitation coding (VXC) is used. The maximum bit-rate is 3.6 kb/s, and a moderate coding delay of 45 ms is incurred. Performance is roughly comparable to conventional VXC/CELP (code-excited linear prediction) coding at 4.8 kb/s.<<ETX>>

[1]  Allen Gersho,et al.  Vector excitation coding with dynamic bit allocation , 1988, IEEE Global Telecommunications Conference and Exhibition. Communications for the Information Age.

[2]  Bishnu S. Atal,et al.  Improving performance of multi-pulse LPC coders at low bit rates , 1984, ICASSP.

[3]  B. Atal,et al.  Strategies for improving the performance of CELP coders at low bit rates (speech analysis) , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[4]  Joseph P. Campbell,et al.  Voiced/Unvoiced classification of speech with applications to the U.S. government LPC-10E algorithm , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  M. Copperi Rule-based speech analysis and application of CELP coding , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[6]  L. R. Rabiner,et al.  Single-frame vowel recognition using vector quantization with several distance measures , 1985, AT&T Technical Journal.

[7]  Chin-Hui Lee,et al.  On robust linear prediction of speech , 1988, IEEE Trans. Acoust. Speech Signal Process..

[8]  K. Ozawa,et al.  2.4 kbps pitch prediction multi-pulse speech coding , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[9]  Allen Gersho,et al.  Real-time vector excitation coding of speech at 4800 bps , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.