论文信息 - Advances in Very Low Bit Rate Speech Coding Using Recognition and Synthesis Techniques

Advances in Very Low Bit Rate Speech Coding Using Recognition and Synthesis Techniques

ALISP (Automatic Language Independent Speech Processing) units are an alternative concept to using phoneme-derived units in speech processing. This article describes advances in very low bit rate coding using ALISP units. Results of speaker-independent experiments are reported and speaker clustering using vector quantization is proposed. The improvements of speech re-synthesis using Harmonic Noise Model and dynamic selection of units are discussed.

[1] Maxine Eskénazi,et al. BREF, a large vocabulary spoken corpus for French , 1991, EUROSPEECH.

[2] S. Furui,et al. Unsupervised speaker adaptation method based on hierarchical spectral clustering , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[3] E. Moulines,et al. Spectral envelope estimation using a penalized likelihood criterion , 1997, Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics.

[4] Gérard Chollet,et al. Codage de la parole a bas et tres bas debits , 2000, Ann. des Télécommunications.

[5] Gérard Chollet,et al. Very Low Bit Rate Speech Coding: Comparison of Data-Driven Units with Syllable Segments , 1999, TSD.

[6] C. Montacie,et al. Temporal decomposition and acoustic-phonetic decoding of speech , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[7] Eric Moulines,et al. Estimation of the spectral envelope of voiced sounds using a penalized likelihood approach , 2001, IEEE Trans. Speech Audio Process..

[8] Xavier Rodet,et al. Generalized functional approximation for source-filter system modeling , 1991, EUROSPEECH.

[9] Eric Moulines,et al. HNS: Speech modification based on a harmonic+noise model , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[10] Gérard Chollet,et al. Segmental vocoder-going beyond the phonetic approach , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[11] Eric Moulines,et al. High-quality speech modification based on a harmonic + noise model , 1995, EUROSPEECH.