Quadratic detectors for feature extraction in text-independent speaker authentication

Text-independent speaker authentication requires a feature set which is sensitive to characteristics of speakers while being reasonably invariant across utterances. Features obtained from cepstra-based techniques have long been used for speech recognition. However, these features are utterance dependent and sensitive to noise so that it may be more difficult to use them for robust speaker authentication. In this paper, a quadratic detector, which is closely related to quadratic time-frequency representations, is proposed to achieve the required utterance-invariant feature extraction. As demonstrated on data derived from the King Corpus, the features extracted using the quadratic detector can provide better classification accuracy than solely cepstral features.<<ETX>>

[1]  Johan Schalkwyk,et al.  Detecting an imposter in telephone speech , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[2]  H. Gish Robust discrimination in automatic speaker identification , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[3]  George R. Doddington,et al.  Speaker verification using temporal decorrelation post-processing , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4]  Les Atlas,et al.  Quadratic detectors for general nonlinear analysis of speech , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  Richard J. Mammone,et al.  Robust cepstral features for speaker identification , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[6]  Les Atlas,et al.  Advantages of cascaded quadratic detectors for analysis of manufacturing sensor data , 1992, [1992] Proceedings of the IEEE-SP International Symposium on Time-Frequency and Time-Scale Analysis.

[7]  Les E. Atlas,et al.  Quadratic detectors for energy estimation , 1995, IEEE Trans. Signal Process..

[8]  James F. Kaiser,et al.  Some useful properties of Teager's energy operators , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[9]  L. Cohen,et al.  Time-frequency distributions-a review , 1989, Proc. IEEE.

[10]  Douglas A. Reynolds,et al.  Experimental evaluation of features for robust speaker identification , 1994, IEEE Trans. Speech Audio Process..

[11]  H. Teager Some observations on oral air flow during phonation , 1980 .