Investigating Text-independent Speaker Verification from Practically Realizable System Perspective

This work projects an attempt to explore the prospects of text-independent speaker verification (SV) for practical realizable systems. Although the advancements in SV systems have gained attention towards deployable systems, the performance seems to degrade under uncontrolled conditions. A protocol for data collection is designed for the text-independent SV with student attendance as an application to create a database in a real-world scenario. The i-vector based speaker modeling is used for evaluating the performance that depicts major deviation of results from that obtained on standard database. This portrays the significance of having real-world scenario based databases for robust SV studies. Further, studies are performed related to speaker categorization, speaker confidence and model update that showcase their significance towards systems in practice. The database created in this work is available as a part of multi-style speaker recognition database.

[1]  Florin Curelaru,et al.  Front-End Factor Analysis For Speaker Verification , 2018, 2018 International Conference on Communications (COMM).

[2]  Douglas A. Reynolds,et al.  SHEEP, GOATS, LAMBS and WOLVES A Statistical Analysis of Speaker Performance in the NIST 1998 Speaker Recognition Evaluation , 1998 .

[3]  S. R. Mahadeva Prasanna,et al.  Multi-style speaker recognition database in practical conditions , 2018, Int. J. Speech Technol..

[4]  Douglas E. Sturim,et al.  Corpora for the Evaluation of Robust Speaker Recognition Systems , 2016, INTERSPEECH.

[5]  Mark C. Huggins,et al.  Confidence metrics for speaker identification , 2002, INTERSPEECH.

[6]  Douglas E. Sturim,et al.  SVM Based Speaker Verification using a GMM Supervector Kernel and NAP Variability Compensation , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[7]  S. R. Mahadeva Prasanna,et al.  Development of Multi-Level Speech based Person Authentication System , 2017, J. Signal Process. Syst..

[8]  Aaron Lawson,et al.  The Speakers in the Wild (SITW) Speaker Recognition Database , 2016, INTERSPEECH.

[9]  John H. L. Hansen,et al.  Robustness in Speech, Speaker, and Language Recognition: "You've Got to Know Your Limitations" , 2016, INTERSPEECH.

[10]  S. Furui,et al.  Cepstral analysis technique for automatic speaker verification , 1981 .

[11]  Rohan Kumar Das,et al.  Exploring different attributes of source information for speaker verification with limited test data. , 2016, The Journal of the Acoustical Society of America.

[12]  Kevin Walker,et al.  The RATS radio traffic collection system , 2012, Odyssey.

[13]  Aaron Lawson,et al.  The 2016 Speakers in the Wild Speaker Recognition Evaluation , 2016, INTERSPEECH.

[14]  Enrique Cantó,et al.  Real-Time Speaker Verification System Implemented on Reconfigurable Hardware , 2013, J. Signal Process. Syst..

[15]  Bin Ma,et al.  Joint Application of Speech and Speaker Recognition for Automation and Security in Smart Home , 2011, INTERSPEECH.

[16]  S. R. Mahadeva Prasanna,et al.  Speaker Verification from Short Utterance Perspective: A Review , 2018 .

[17]  S. R. Mahadeva Prasanna,et al.  Development and evaluation of online text-independent speaker verification system for remote person authentication , 2013, Int. J. Speech Technol..

[18]  S. R. Mahadeva Prasanna,et al.  Exploring kernel discriminant analysis for speaker verification with limited test data , 2017, Pattern Recognit. Lett..

[19]  S. R. Mahadeva Prasanna,et al.  Speech biometric based attendance system , 2014, 2014 Twentieth National Conference on Communications (NCC).

[20]  S. R. Mahadeva Prasanna,et al.  Exploring Session Variability and Template Aging in Speaker Verification for Fixed Phrase Short Utterances , 2016, INTERSPEECH.