This paper describes new techniques for automatic speaker verification using telephone speech. The operation of the system is based on a set of functions of time obtained from acoustic analysis of a fixed, sentence-long utterance. Cepstrum coefficients are extracted by means of LPC analysis successively throughout an utterance to form time functions, and frequency response distortions introduced by transmission systems are removed. The time functions are expanded by orthogonal polynomial representations and, after a feature selection procedure, brought into time registration with stored reference functions to calculate the overall distance. This is accomplished by a new time warping method using a dynamic programming technique. A decision is made to accept or reject an identity claim, based on the overall distance. Reference functions and decision thresholds are updated for each customer. Several sets of experimental utterances were used for the evaluation of the system, which include male and female utterances recorded over a conventional telephone connection. Male utterances processed by ADPCM and LPC coding systems were used together with unprocessed utterances. Results of the experiment indicate that verification error rate of one percent or less can be obtained even if the reference and test utterances are subjected to different transmission conditions.
[1]
James L. Flanagan,et al.
Adaptive quantization in differential PCM coding of speech
,
1973
.
[2]
R. Lummis,et al.
Speaker verification by computer using speech intensity for temporal registration
,
1973
.
[3]
B. Atal.
Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification.
,
1974,
The Journal of the Acoustical Society of America.
[4]
L. Rabiner,et al.
Optimum FIR Digital Filter Implementations for Decimation, Interpolation, and Narrow-Band Filtering
,
1975
.
[5]
Aaron E. Rosenberg,et al.
New techniques for automatic speaker verification
,
1975
.
[6]
F. Itakura,et al.
Minimum prediction residual principle applied to speech recognition
,
1975
.
[7]
A. E. Rosenberg,et al.
Evaluation of an automatic speaker-verification system over telephone lines
,
1976,
The Bell System Technical Journal.
[8]
Ronald W. Schafer,et al.
Real-time digital hardware pitch detector
,
1976
.
[9]
John E. Markel,et al.
Linear Prediction of Speech
,
1976,
Communication and Cybernetics.