A brief history of synthetic speech

Abstract This paper retraces, in an informal way, some of the history of speech synthesis and speech research from Von Kempelen's speaking machine to linear prediction.

[1]  T. Gramss Word recognition with the feature finding neural network (FFNN) , 1991, Neural Networks for Signal Processing Proceedings of the 1991 IEEE Workshop.

[2]  Manfred Schroeder,et al.  Fractals, Chaos, Power Laws: Minutes From an Infinite Paradise , 1992 .

[3]  M. Schroeder Determination of the geometry of the human vocal tract by acoustic measurements. , 1967, The Journal of the Acoustical Society of America.

[4]  Manfred R. Schroeder,et al.  Code-excited linear prediction(CELP): High-quality speech at very low bit rates , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  M. V. Mathews,et al.  Laboratory computers: Their capabilities and how to make them work for you , 1970 .

[6]  S Kiritani,et al.  Computer controlled radiography for observation of movements of articulatory and other human organs. , 1973, Computers in biology and medicine.

[7]  M R Schroeder,et al.  Flat-spectrum speech. , 1986, The Journal of the Acoustical Society of America.

[8]  Ronald W. Schafer,et al.  Digital Processing of Speech Signals , 1978 .

[9]  Gunnar Fant,et al.  Acoustic Theory Of Speech Production , 1960 .

[10]  Manfred R. Schroeder,et al.  Bandwidth compression of speech by analytic-signal rooting , 1967 .

[11]  P. Mermelstein Determination of the vocal-tract shape from measured formant frequencies. , 1967, The Journal of the Acoustical Society of America.

[12]  H. Fujisaki,et al.  Automatic Extraction of Fundamental Period of Speech by Auto‐Correlation Analysis and Peak Detection , 1960 .

[13]  K. Stevens,et al.  Reduction of Speech Spectra by Analysis‐by‐Synthesis Techniques , 1961 .

[14]  H. Dudley,et al.  The Speaking Machine of Wolfgang von Kempelen , 1949 .

[15]  J. Flanagan Speech Analysis, Synthesis and Perception , 1971 .

[16]  Bruce P. Bogert The Vobanc—A Two‐to‐One Speech Band‐Width Reduction System , 1956 .

[17]  Charles K. Chui,et al.  An Introduction to Wavelets , 1992 .

[18]  J. Flanagan,et al.  Synthesis of voiced sounds from a two-mass model of the vocal cords , 1972 .

[19]  S. L. Hanauer,et al.  B.s.t.j. brief interpolation of data with continuous speech signals , 1967 .

[20]  G. Oscar Russell THE MECHANISM OF SPEECH , 1929 .

[21]  A. Gove,et al.  Mechanisms of Speech , 1968 .

[22]  Manfred R. Schroeder,et al.  Vocoders: Analysis and synthesis of speech , 1966 .

[23]  C. C. Goodyear,et al.  On the use of neural networks in articulatory speech synthesis , 1993 .

[24]  E. Friedman,et al.  The telephone book , 1979 .

[25]  Wolfgang Hess,et al.  Pitch Determination of Speech Signals , 1983 .

[26]  B. Atal,et al.  Optimizing digital speech coders by exploiting masking properties of the human ear , 1978 .

[27]  Hans Werner Strube,et al.  Modulation-Frequency Encoding of Speech with Applications to Neural Speech Recognizers , 1993 .

[28]  T. Houtgast,et al.  The Modulation Transfer Function in Room Acoustics as a Predictor of Speech Intelligibility , 1973 .

[29]  Manfred R. Schroeder,et al.  Correlation Techniques for Speech Bandwidth Compression , 1960 .

[30]  A. Liberman,et al.  Some Experiments on the Perception of Synthetic Speech Sounds , 1952 .

[31]  Gerold Ungeheuer Elemente einer Akustischen Theorie der Vokalartikulation , 1962 .

[32]  M. R. Schroeder,et al.  Adaptive predictive coding of speech signals , 1970, Bell Syst. Tech. J..

[33]  B. Atal,et al.  Predictive coding of speech signals and subjective error criteria , 1979 .

[34]  L. Rabiner,et al.  An introduction to hidden Markov models , 1986, IEEE ASSP Magazine.

[35]  M. R. Schroeder,et al.  Short‐Time “Cepstrum” Pitch Detection , 1964 .