An Overview of Automatic Speaker Verification System

Biometrics is used as a form of identification in many access control systems. Some of them are fingerprint, iris, face, speech, and retina. Speech biometrics is used for speaker verification. Speech is the most convenient way to communicate with person and machine, so it plays a vital role in signal processing. Automatic speaker verification is the authentication of individuals by doing analysis on speech utterances. Speaker verification falls into pattern matching problem. Many technologies are used for processing and storing voice prints. Some of them are Frequency Estimation, Hidden Markov Models, Gaussian Mixture Models, Neural Networks, Vector Quantization, and Decision Trees. Mainly speaker verification depends upon speaker modeling and this paper represents a brief overview of the speaker verification system with feature extraction and speaker modeling. Bob spear toolkit is used for evaluation and experiment for the result and analysis. Bob spear is an open-source toolkit for speech processing. For evaluation purpose, three algorithms are proposed which are GMM, ISV, and JFA with the same preprocessing and feature extraction techniques.

[1]  Larry P. Heck,et al.  Robust text-independent speaker identification over telephone channels , 1999, IEEE Trans. Speech Audio Process..

[2]  Sébastien Marcel,et al.  Spear: An open source toolbox for speaker recognition based on Bob , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[3]  A. E. Rosenberg,et al.  Evaluation of an automatic speaker-verification system over telephone lines , 1976, The Bell System Technical Journal.

[4]  Douglas A. Reynolds,et al.  A Tutorial on Text-Independent Speaker Verification , 2004, EURASIP J. Adv. Signal Process..

[5]  Seiichi Nakagawa,et al.  Text-independent speaker recognition using non-linear frame likelihood transformation , 1998, Speech Commun..

[6]  David A. van Leeuwen,et al.  On calibration of language recognition scores , 2006, Odyssey.

[7]  Douglas A. Reynolds,et al.  An overview of automatic speaker recognition technology , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[8]  Douglas A. Reynolds,et al.  An overview of automatic speaker diarization systems , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[9]  M. H. Moattar,et al.  Text-Independent Speaker Verification Using Variational Gaussian Mixture Model , 2011 .

[10]  Rosângela Coelho,et al.  Text-independent speaker recognition based on the Hurst parameter and the multidimensional fractional Brownian motion model , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[11]  Jr. J.P. Campbell,et al.  Speaker recognition: a tutorial , 1997, Proc. IEEE.

[12]  Bin Ma,et al.  Text-dependent speaker verification: Classifiers, databases and RSR2015 , 2014, Speech Commun..

[13]  William M. Campbell,et al.  Speaker Verification Using Support Vector Machines and High-Level Features , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[14]  Douglas E. Sturim,et al.  Speaker indexing in large audio databases using anchor models , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).