50 Years of Progress in Speech and Speaker Recognition Research

Research in automatic speech and speaker recognition has now spanned five decades. This paper surveys the major themes and advances made in the past fifty years of research so as to provide a technological perspective and an appreciation of the fundamental progress that has been accomplished in this important area of speech communication. Although many techniques have been developed, many challenges have yet to be overcome before we can achieve the ultimate goal of creating machines that can communicate naturally with people. Such a machine needs to be able to deliver a satisfactory performance under a broad range of operating conditions. A much greater understanding of the human speech process is required before automatic speech and speaker recognition systems can approach human performance.

[1]  M. V. Mathews,et al.  Statistical techniques for talker identification , 1971 .

[2]  Andrew J. Viterbi,et al.  Error bounds for convolutional codes and an asymptotically optimum decoding algorithm , 1967, IEEE Trans. Inf. Theory.

[3]  S. Pruzansky Pattern‐Matching Procedure for Automatic Talker Recognition , 1963 .

[4]  J. Forgie,et al.  Results Obtained from a Vowel Recognition Computer Program , 1959 .

[5]  N. G. Zagoruyko,et al.  Automatic recognition of 200 words , 1970 .

[6]  W. Endres,et al.  Voice spectrograms as a function of age, voice disguise, and voice imitation. , 1971, The Journal of the Acoustical Society of America.

[7]  T. B. Martin,et al.  SPEECH RECOGNITION BY FEATURE-ABSTRACTION TECHNIQUES. , 1964 .

[8]  K. Nagata Spoken digit recognizer for Japanese language. , 1963 .

[9]  T. K. Vintsyuk Speech discrimination by dynamic programming , 1968 .

[10]  D. B. Fry,et al.  Theoretical aspects of mechanical speech recognition , 1959 .

[11]  J. E. Dammann,et al.  Experimental Studies in Speaker Verification, Using an Adaptive System , 1966 .

[12]  Shuji Doshita,et al.  The Phonetic Typewriter , 1962, IFIP Congress.

[13]  M. Mathews,et al.  Talker‐Recognition Procedure Based on Analysis of Variance , 1963 .

[14]  G. Doddington A Method or Speaker Verification , 1971 .

[15]  D. R. Reddy An approach to computer speech recognition by direct analysis of the speech wave , 1966 .

[16]  P. Denes,et al.  The design and operation of the mechanical speech recognizer at University College London , 1959 .

[17]  Van Nostrand,et al.  Error Bounds for Convolutional Codes and an Asymptotically Optimum Decoding Algorithm , 1967 .

[18]  C. D. Forgie,et al.  Automatic Recognition of Spoken Digits , 1958 .