Speaker recognition using adaptively boosted classifier

A novel approach for speaker recognition is proposed. The system makes use of adaptive boosting (AdaBoost) and multilayer perceptrons (MLP) as classifier for closed set, text-dependent speaker recognition. The performance of the systems is assessed using a subset of 20 speakers, 10 male and 10 female, drawn from the YOHO speaker verification corpus. Results show that improvement in accuracy of recognition can be achieved through adaptive boosting of the classifier.

[1]  Dale Schuurmans,et al.  Boosting in the Limit: Maximizing the Margin of Learned Ensembles , 1998, AAAI/IAAI.

[2]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[3]  Soo-Ngee Koh,et al.  Pitch determination of noisy speech using wavelet transform in time and frequency domains , 1993, Proceedings of TENCON '93. IEEE Region 10 International Conference on Computers, Communications and Automation.

[4]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[5]  Jr. J.P. Campbell,et al.  Speaker recognition: a tutorial , 1997, Proc. IEEE.

[6]  S. Hyakin,et al.  Neural Networks: A Comprehensive Foundation , 1994 .

[7]  M.G. Bellanger,et al.  Digital processing of speech signals , 1980, Proceedings of the IEEE.

[8]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1997, EuroCOLT.

[9]  J.M. Naik,et al.  Speaker verification: a tutorial , 1990, IEEE Communications Magazine.

[10]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[11]  Günther Palm,et al.  Signal modeling for speaker identification , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[12]  R. Sankar,et al.  Pitch extraction algorithm for voice recognition applications , 1988, [1988] Proceedings. The Twentieth Southeastern Symposium on System Theory.

[13]  Robert E. Schapire,et al.  A Brief Introduction to Boosting , 1999, IJCAI.

[14]  Bayya Yegnanarayana,et al.  Formant extraction from Fourier transform phase , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[15]  K. Fushikida A formant extraction method using autocorrelation domain inverse filtering and focusing method , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[16]  Tetsuya Shimamura,et al.  A modified cepstrum method for pitch extraction , 1998, IEEE. APCCAS 1998. 1998 IEEE Asia-Pacific Conference on Circuits and Systems. Microelectronics and Integrating Systems. Proceedings (Cat. No.98EX242).

[17]  Yoav Freund,et al.  Boosting a weak learning algorithm by majority , 1995, COLT '90.