Parametric influence of speaker verification system — An experimental study

With the advent of servicing industries that needs to cater to a large number of their clients it has become necessary to offer self-help options to their clients. Very often the self-help channel is the interactive voice response (IVR) channel and the mode of interaction is spoken speech. The affect of this is the need for the self-help platform to be able to verify the identity of the user before making the self-help services. In this context speech based biometric has gained significance in recent times. There are several advantages of using speech as a biometric; but there has been a caution in using speech as a biometric in actual practice. This paper is part tutorial and part experimental. We first present a speaker verification system based on statistical modeling and then present experiment results to verify the influence of the train data, speech parameters and other information on the performance of the speaker verification system.

[1]  Ren-Hua Wang,et al.  A weighted distance measure based on the fine structure of feature space: application to speaker recognition , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[2]  Steve Gold Voice biometrics: real-world issues and solutions , 2010 .

[3]  Pavel Senin,et al.  Dynamic Time Warping Algorithm Review , 2008 .

[4]  C.-S. Liu,et al.  Study of line spectrum pair frequencies for speaker recognition , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[5]  B. Atal Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification. , 1974, The Journal of the Acoustical Society of America.

[6]  Sin-Horng Chen,et al.  On the use of pitch contour of Mandarin speech in text-independent speaker identification , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[7]  Zia Saquib,et al.  A Survey on Automatic Speaker Recognition Systems , 2010, FGIT-SIP/MulGraB.

[8]  Douglas A. Reynolds,et al.  SHEEP, GOATS, LAMBS and WOLVES A Statistical Analysis of Speaker Performance in the NIST 1998 Speaker Recognition Evaluation , 1998 .

[9]  D. O'Shaughnessy,et al.  Speaker recognition , 1986, IEEE ASSP Magazine.

[10]  Toni C. M. Rietveld,et al.  Speaker characterization in dutch using prosodic parameters , 1991, EUROSPEECH.

[11]  Jr. J.P. Campbell,et al.  Speaker recognition: a tutorial , 1997, Proc. IEEE.

[12]  Peter N. Yianilos,et al.  Learning String-Edit Distance , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[14]  H. Noda On the use of the information on individual speaker's position in the parameter space for speaker recognition , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[15]  Patricia Melin,et al.  Modular Neural Network with Fuzzy Integration and Its Optimization Using Genetic Algorithms for Human Recognition Based on Iris, Ear and Voice Biometrics , 2010, Soft Computing for Recognition Based on Biometrics.

[16]  Hervé Bourlard,et al.  Limited parameter hidden Markov models for connected digit speaker verification over telephone channels , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[17]  Chin-Hui Lee,et al.  Speaker verification based on combining speaker individuality parameter selection and decision , 2005, IEEE Workshop on Automatic Speech Recognition and Understanding, 2005..

[18]  John S. D. Mason,et al.  Integration of acoustic and visual speech for speaker recognition , 1993, EUROSPEECH.

[19]  B. Atal Automatic Speaker Recognition Based on Pitch Contours , 1969 .