Advances in Very Low Bit Rate Speech Coding Using Recognition and Synthesis Techniques

ALISP (Automatic Language Independent Speech Processing) units are an alternative concept to using phoneme-derived units in speech processing. This article describes advances in very low bit rate coding using ALISP units. Results of speaker-independent experiments are reported and speaker clustering using vector quantization is proposed. The improvements of speech re-synthesis using Harmonic Noise Model and dynamic selection of units are discussed.

[1]  Maxine Eskénazi,et al.  BREF, a large vocabulary spoken corpus for French , 1991, EUROSPEECH.

[2]  S. Furui,et al.  Unsupervised speaker adaptation method based on hierarchical spectral clustering , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[3]  E. Moulines,et al.  Spectral envelope estimation using a penalized likelihood criterion , 1997, Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics.

[4]  Gérard Chollet,et al.  Codage de la parole a bas et tres bas debits , 2000, Ann. des Télécommunications.

[5]  Gérard Chollet,et al.  Very Low Bit Rate Speech Coding: Comparison of Data-Driven Units with Syllable Segments , 1999, TSD.

[6]  C. Montacie,et al.  Temporal decomposition and acoustic-phonetic decoding of speech , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[7]  Eric Moulines,et al.  Estimation of the spectral envelope of voiced sounds using a penalized likelihood approach , 2001, IEEE Trans. Speech Audio Process..

[8]  Xavier Rodet,et al.  Generalized functional approximation for source-filter system modeling , 1991, EUROSPEECH.

[9]  Eric Moulines,et al.  HNS: Speech modification based on a harmonic+noise model , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[10]  Gérard Chollet,et al.  Segmental vocoder-going beyond the phonetic approach , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[11]  Eric Moulines,et al.  High-quality speech modification based on a harmonic + noise model , 1995, EUROSPEECH.