Speaker recognition under limited data condition by noise addition

This work demonstrates that, under limited data condition, it is indeed possible to improve the speaker recognition performance by controlled noise addition. The problem of limited data (<15s) for training and testing is overcome to some extent by adding noise at very high Signal to Noise Ratio (SNR) values. The noise added versions may be viewed as different instances of the given data. Hence put together increases the number of feature vectors. The speaker identification study is conducted using randomly selected 100 speakers from TIMIT database, Mel-Frequency Cepstral Coefficients (MFCC) features and Gaussian Mixture Model (GMM)-Universal Background Model (UBM). The method provides performance of 78.20% using only limited data and 80% using both limited and noisy data.

[1]  Danoush Hosseinzadeh,et al.  On the Use of Complementary Spectral Features for Speaker Recognition , 2008, EURASIP J. Adv. Signal Process..

[2]  Man-Wai Mak,et al.  A Comparison of Various Adaptation Methods for Speaker Verification With Limited Enrollment Data , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[3]  Herbert Gish,et al.  Speaker verification with limited enrollment data , 1997, EUROSPEECH.

[4]  Douglas A. Reynolds,et al.  Speaker Verification Using Adapted Gaussian Mixture Models , 2000, Digit. Signal Process..

[5]  Douglas D. O'Shaughnessy Speech Communications: Human and Machine , 2012 .

[6]  John H. L. Hansen,et al.  In-Set/Out-of-Set Speaker Recognition Under Sparse Enrollment , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[7]  John H. L. Hansen,et al.  Discriminative In-Set/Out-of-Set Speaker Recognition , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[8]  Victor Zue,et al.  Speech database development at MIT: Timit and beyond , 1990, Speech Commun..

[9]  S. R. Mahadeva Prasanna,et al.  Multiple frame size and rate analysis for speaker recognition under limited data condition , 2009 .