An LPC vocoder based on phase-equalized pitch waveform

This paper presents a speech coder operating at a very low bit-rate using a model called "phase-equalized pitch waveform". The basic idea of the coder is to employ pitchwise extraction of the linear predictive residual signal, a pitch waveform, in voiced speech. The residual signal is processed with a phase-equalization filter to increase the efficiency of both the pitch waveform quantization and interpolation. Listening tests showed that efficient and high-quality coding is achieved at 2.0 kbits/s. The quality of the coder is equal to that of the DoD FS1016 standard CELP at 4.8 or 2.4 kbits/s MELP.

[1]  Lajos Hanzo,et al.  A multiband excited waveform-interpolated 2.35-kbps speech codec for bandlimited channels , 2000, IEEE Trans. Veh. Technol..

[2]  W. Bastiaan Kleijn,et al.  Encoding speech using prototype waveforms , 1993, IEEE Trans. Speech Audio Process..

[3]  Allen Gersho,et al.  Advances in speech coding , 1991 .

[4]  Allen Gersho,et al.  Vector quantization and signal compression , 1991, The Kluwer international series in engineering and computer science.

[5]  Yusuke Hiwasaki,et al.  A new 2-kbit/s speech coder based on normalized pitch waveform , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[6]  Thomas P. Barnwell,et al.  MCCREE AND BARNWELL MIXED EXCITAmON LPC VOCODER MODEL LPC SYNTHESIS FILTER 243 SYNTHESIZED SPEECH-PERIODIC PULSE TRAIN-1 PERIODIC POSITION JITTER PULSE 4 , 2004 .

[7]  Takehiro Moriya,et al.  Speech coder using phase equalization and vector quantization , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[8]  K. Mano,et al.  Vector quantization of LSP parameters using moving average interframe prediction , 1994 .

[9]  Masaaki Honda Speech coding using waveform matching based on LPC residual phase equalization , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[10]  Jae S. Lim,et al.  Multiband excitation vocoder , 1988, IEEE Transactions on Acoustics, Speech, and Signal Processing.

[11]  Allen Gersho,et al.  Mixed-domain coding of speech at 3 kb/s , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[12]  Manfred R. Schroeder,et al.  Code-excited linear prediction(CELP): High-quality speech at very low bit rates , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[13]  Willem Bastiaan Kleijn,et al.  Improved speech quality and efficient vector quantization in SELP , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[14]  Gernot Kubin,et al.  Performance of noise excitation for unvoiced speech , 1993, Proceedings., IEEE Workshop on Speech Coding for Telecommunications,.

[15]  George S. Kang,et al.  2.4-kb/s Vocoder Based on Pitch-Synchronous Segmentation of Speech , 1995, Proceedings. IEEE Workshop on Speech Coding for Telecommunications.

[16]  Joseph P. Campbell,et al.  The Dod 4.8 Kbps Standard (Proposed Federal Standard 1016) , 1991 .

[17]  R. J. Holbeche,et al.  A mixed prototype waveform/CELP coder for sub 3 kbit/s , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.