Speaker verification by computer using speech intensity for temporal registration

A technique for automatic speaker verification is described in which voice pitch, low-frequency intensity, and the three lowest formant frequencies, all as functions of time, are the features used to represent an individual utterance. Verification consists of computing these features for a test utterance and comparing them with stored reference versions for the claimed identity. Before the test-versus-reference comparison is effected, the time dimension of the test utterance is warped to optimally register its intensity pattern onto the reference intensity pattern. Performance of the system is measured on a speaker population of moderate size. A variety of comparison formulas and various subsets of the five speech features are evaluated. The system responds either "accept" or "reject" to every utterance; "no decision" is not allowed. Automatic verification based solely upon voice pitch and intensity, both of which can be computed rapidly, yields average error rates below 1 percent.

[1]  J. E. Dammann,et al.  Experimental Studies in Speaker Verification, Using an Adaptive System , 1966 .

[2]  B Gold,et al.  Parallel processing techniques for estimating pitch periods of speech in the time domain. , 1969, The Journal of the Acoustical Society of America.

[3]  J. E. Luck Automatic speaker verification using cepstral measurements. , 1969, The Journal of the Acoustical Society of America.

[4]  S. K. Das,et al.  Pattern recognition in speaker verification , 1969, AFIPS '69 (Fall).

[5]  L. Rabiner,et al.  An approach to the approximation problem for nonrecursive digital filters , 1970 .

[6]  L. Rabiner,et al.  System for automatic formant analysis of voiced speech. , 1970, The Journal of the Acoustical Society of America.

[7]  S. Das,et al.  A scheme for speech processing in automatic speaker verification , 1971 .

[8]  A. E. Rosenberg Listener Performance in a Speaker Verification Task , 1971 .

[9]  G. Doddington A Method or Speaker Verification , 1971 .

[10]  L. Rabiner,et al.  Effects of smoothing and quantizing the parameters of formant-coded voiced speech. , 1971, The Journal of the Acoustical Society of America.

[11]  R. Lummis Real‐Time Technique for Speaker Verification by Computer , 1971 .

[12]  S. Das,et al.  Speaker Verification Experiments , 1971 .

[13]  R. Lummis Implementation of an On‐Line Speaker Verification Scheme , 1972 .

[14]  A. Rosenberg,et al.  Test of an Automatic Speaker Verification Method with Intensively Trained Professional Mimics , 1972 .

[15]  A. E. Rosenberg Listener Performance in a Speaker‐Verification Task with Deliberate Impostors , 1972 .

[16]  A. Rosenberg,et al.  Listener performance in speaker verification tasks , 1973 .