A novel whispered speaker identification system based on extreme learning machine

Whispered speech speaker identification system is one of the most demanding efforts in automatic speaker recognition applications. Due to the profound variations between neutral and whispered speech in acoustic characteristics, the performance of conventional speaker identification systems applied on neutral speech degrades drastically when compared to whisper speech. This work presents a novel speaker identification system using whispered speech based on an innovative learning algorithm which is named as extreme learning machine (ELM). The features used in this proposed system are Instantaneous frequency with probability density models. Parametric and nonparametric probability density estimation with ELM was compared with the hybrid parametric and nonparametric probability density estimation with Extreme Learning Machine (HPNP-ELM) for instantaneous frequency modeling. The experimental result shows the significant performance improvement of the proposed whisper speech speaker identification system.

[1]  Qi Li,et al.  A detection approach to search-space reduction for HMM state alignment in speaker verification , 2001, IEEE Trans. Speech Audio Process..

[2]  John H. L. Hansen,et al.  Analysis and classification of speech mode: whispered through shouted , 2007, INTERSPEECH.

[3]  J.H.L. Hansen,et al.  An efficient scoring algorithm for Gaussian mixture model based speaker identification , 1998, IEEE Signal Processing Letters.

[4]  Heming Zhao,et al.  Whispered speech speaker identification based on SVM and FA , 2010, 2010 International Conference on Audio, Language and Image Processing.

[5]  Arun Ross,et al.  An introduction to biometric recognition , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[6]  Yen-Jen Oyang,et al.  Data classification with a relaxed model of variable kernel density estimation , 2005, Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005..

[7]  DeLiang Wang,et al.  Robust Speaker Identification in Noisy and Reverberant Conditions , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[8]  Dianhui Wang,et al.  Extreme learning machines: a survey , 2011, Int. J. Mach. Learn. Cybern..

[9]  Juan Xu,et al.  Speaker identification with whispered speech using unvoiced-consonant phonemes , 2012, 2012 International Conference on Image Analysis and Signal Processing.

[10]  Kazuya Takeda,et al.  Analysis and recognition of whispered speech , 2005, Speech Commun..

[11]  William M. Campbell,et al.  Support vector machines for speaker and language recognition , 2006, Comput. Speech Lang..

[12]  Georges Quénot,et al.  Unsupervised Speaker Identification in TV Broadcast Based on Written Names , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[13]  Chee Kheong Siew,et al.  Extreme learning machine: Theory and applications , 2006, Neurocomputing.

[14]  Chung-Hsien Yang,et al.  Robust Speaker Identification and Verification , 2007, IEEE Computational Intelligence Magazine.

[15]  John H. L. Hansen,et al.  Blind Spectral Weighting for Robust Speaker Identification under Reverberation Mismatch , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[16]  Tanja Schultz,et al.  Whispering Speaker Identification , 2007, 2007 IEEE International Conference on Multimedia and Expo.

[17]  Mark A. Clements,et al.  Reconstruction of speech from whispers , 2002, MAVEBA.

[18]  Sun-Yuan Kung,et al.  Estimation of elliptical basis function parameters by the EM algorithm with application to speaker verification , 2000, IEEE Trans. Neural Networks Learn. Syst..

[19]  John H. L. Hansen,et al.  Speaker Identification Within Whispered Speech Audio Streams , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[20]  Douglas E. Sturim,et al.  Support vector machines using GMM supervectors for speaker verification , 2006, IEEE Signal Processing Letters.

[21]  S. Jovicic,et al.  Acoustic analysis of consonants in whispered speech. , 2008, Journal of voice : official journal of the Voice Foundation.

[22]  Chang-Hong Lin,et al.  Speaker Identification With Whispered Speech for the Access Control System , 2015, IEEE Transactions on Automation Science and Engineering.