A hybrid time-frequency domain articulatory speech synthesizer

High quality speech at low bit rates (e.g., 2400 bits/s) is one of the important objectives of current speech research. As part of long range activity on this problem, we have developed an efficient computer program that will serve as a tool for investigating whether articulatory speech synthesis may achieve this low bit rate. At a sampling frequency of 8 kHz, the most comprehensive version of the program, including nasality and frication, runs at about twice real time on a Cray-1 computer.

[1]  J. Sundberg,et al.  Acoustic Properties of the Nasal Tract , 1976, Phonetica.

[2]  Hans Werner Strube,et al.  Calculations of the time varying vocal tract , 1984, Speech Commun..

[3]  J. Flanagan,et al.  Excitation of vocal-tract synthesizers. , 1969, The Journal of the Acoustical Society of America.

[4]  A. Rosenberg Effect of glottal pulse shape on the quality of natural vowels. , 1969 .

[5]  Michael Rodney Portnoff A quasi-one-dimensional digital simulation for the time-varying vocal tract. , 1973 .

[6]  J. L. Flanagan,et al.  Synthesis of speech from a dynamic model of the vocal cords and vocal tract , 1975, The Bell System Technical Journal.

[7]  R. T. Schumacher Ab initio calculations of the oscillations of a clarinet , 1979 .

[8]  Shinji Maeda,et al.  A digital simulation method of the vocal-tract system , 1982, Speech Commun..

[9]  W. Zemlin Speech and Hearing Science : Anatomy and Physiology , 1968 .

[10]  C.H. Coker,et al.  A model of articulatory dynamics and control , 1976, Proceedings of the IEEE.

[11]  Waveforms Hisashi Wakita Direct Estimation of the Vocal Tract Shape by Inverse Filtering of Acoustic Speech , 1973 .

[12]  R. Kuc,et al.  Determining vocal tract shape by applying dynamic constraints , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[13]  T. V. Ananthapadmanabha,et al.  Calculation of true glottal flow and its components , 1982, Speech Commun..

[14]  H. Hirose,et al.  Laryngeal Adjustments in Consonant Production , 1977, Phonetica: International Journal of Phonetic Science.

[15]  W. Strong,et al.  A model for the synthesis of natural sounding vowels , 1983 .

[16]  I. Titze,et al.  A theoretical study of the effects of various laryngeal configurations on the acoustics of phonation. , 1979, The Journal of the Acoustical Society of America.

[17]  John Nicholas Holmes,et al.  Speech synthesis , 1972 .

[18]  B. Yegnanarayana,et al.  Measuring source-tract interaction from speech , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[19]  Martin Rothenberg,et al.  An Interactive Model for the Voice Source , 1983 .

[20]  Jürgen Braas Ein digitales Leitungsmodell als Hilfsmittel zur Sprachsynthese , 1981 .

[21]  J. Flanagan,et al.  Self-oscillating source for vocal-tract synthesizers , 1968 .

[22]  J. Flanagan,et al.  Synthesis of voiced sounds from a two-mass model of the vocal cords , 1972 .

[23]  B Hutters,et al.  Vocal Fold Adjustments in Aspirated and Unaspirated Stops in Danish , 1985, Phonetica.

[24]  O. Fujimura,et al.  Sweep-tone measurements of vocal-tract characteristics. , 1971, The Journal of the Acoustical Society of America.

[25]  J. R. Resnick,et al.  The inverse problem for the vocal tract: numerical methods, acoustical experiments, and speech synthesis. , 1983, The Journal of the Acoustical Society of America.

[26]  J. Flanagan Speech Analysis, Synthesis and Perception , 1971 .

[27]  S E Levinson,et al.  Adaptive computation of articulatory parameters from the speech signal. , 1982, The Journal of the Acoustical Society of America.

[28]  M. Sondhi Model for wave propagation in a lossy vocal tract. , 1974, The Journal of the Acoustical Society of America.

[29]  J. Flanagan,et al.  Signal models for low bit‐rate coding of speech , 1980 .

[30]  I. Titze Parameterization of the glottal area, glottal flow, and vocal fold contact area. , 1984, The Journal of the Acoustical Society of America.

[31]  J L Flanagan,et al.  Voices of men and machines. , 1972, The Journal of the Acoustical Society of America.

[32]  Shinji Maeda The role of the sinus cavities in the production of nasal vowels , 1982, ICASSP.

[33]  B. Atal,et al.  Inversion of articulatory-to-acoustic transformation in the vocal tract by a computer-sorting technique. , 1978, The Journal of the Acoustical Society of America.

[34]  L. Boves,et al.  Pressure measurements during speech production using semiconductor miniature pressure transducers: impact on models for speech production. , 1985, The Journal of the Acoustical Society of America.