The MIT Mobile Device Speaker Verification Corpus: Data Collection and Preliminary Experiments

In this paper we discuss data collection and preliminary experiments for a new speaker verification corpus collected on a small handheld device in multiple environments using multiple microphones. This corpus, which has been made publically available by MIT, is intended for explorations of the problem of robust speaker verification on handheld devices in noisy environments with limited training data. To provide a set of preliminary results, we examine text-dependent speaker verification under a variety of cross-conditional environment and microphone training constraints. Our preliminary results indicate that the presence of noise in the training data improves the robustness of our speaker verification models even when tested in mismatched environments

[1]  Saeed Vaseghi,et al.  Speaker identification in unknown noisy conditions - a universal compensation approach , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[2]  Douglas A. Reynolds,et al.  Speaker Verification Using Adapted Gaussian Mixture Models , 2000, Digit. Signal Process..

[3]  James R. Glass,et al.  A Comparative Study of Methods for Handheld Speaker Verification in Realistic Noisy Conditions , 2006, 2006 IEEE Odyssey - The Speaker and Language Recognition Workshop.

[4]  G.R. Doddington,et al.  Speaker recognition—Identifying people by their voices , 1985, Proceedings of the IEEE.

[5]  Alex Park,et al.  A comparison of normalization and training approaches for ASR-dependent speaker identification , 2004, INTERSPEECH.

[6]  Alex Park,et al.  ASR dependent techniques for speaker identification , 2002, INTERSPEECH.

[7]  James R. Glass,et al.  Speaker Verification Over Handheld Devices with Realistic Noisy Speech Data , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[8]  Jean-Luc Gauvain,et al.  Speaker verification over the telephone , 2000, Speech Commun..

[9]  Alex Park,et al.  MULTI-MODAL FACE AND SPEAKER IDENTIFICATION ON A HANDHELD DEVICE , 2003 .

[10]  Ram H. Woo,et al.  Exploration of small enrollment speaker verification on handheld devices , 2005 .

[11]  Chafic Mokbel,et al.  An overview of the PICASSO project research activities in speaker verification for telephone applications , 1999, EUROSPEECH.

[12]  Jean-Luc Gauvain,et al.  Feature and score normalization for speaker verification of cellular data , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[13]  James R. Glass A probabilistic framework for segment-based speech recognition , 2003, Comput. Speech Lang..

[14]  Alex Park,et al.  Towards robust person recognition on handheld devices using face and speaker identification technologies , 2003, ICMI '03.