Reliable Speaker Identification Using Multiple Microphones in Ubiquitous Robot Companion Environment

This paper presents a text-independent speaker identification system using multiple microphones on the robot, which is intended for use in human-robot interaction. For the purpose of the best possible classification rate in speaker identification, the individual identification results obtained from multiple microphones on the robot are combined by various combination schemes. The performance improvement has been achieved. Our ultimate goal is to enhance human-robot interaction by improving the recognition performance of speaker identification with multiple microphones on the robot side in adverse distant-talking environments. Various combination schemes obtained high classification accuracy in the ubiquitous robot companion (URC) environment, where the robot is connected to a server through extremely high broadband penetration rate. In conclusion, our speaker identification system can provide human-robot interaction with a reliable basic interface with high classification accuracy.

[1]  Jean-François Bonastre,et al.  Time and frequency pruning for speaker identification , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[2]  Tanja Schultz,et al.  Far-Field Speaker Recognition , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[3]  Douglas A. Reynolds,et al.  Speaker identification and verification using Gaussian mixture speaker models , 1995, Speech Commun..

[4]  Rashmi Gangadharaiah,et al.  Extracting additional information from Gaussian mixture model probabilities for improved text independent speaker identification , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[5]  Douglas D. O'Shaughnessy,et al.  A double Gaussian mixture modeling approach to speaker recognition , 1997, EUROSPEECH.

[6]  Ea-Ee Jan,et al.  Microphone arrays and speaker identification , 1994, IEEE Trans. Speech Audio Process..

[7]  Douglas D. O'Shaughnessy,et al.  On the use of some divergence measures in speaker recognition , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[8]  Douglas A. Reynolds,et al.  Robust text-independent speaker identification using Gaussian mixture speaker models , 1995, IEEE Trans. Speech Audio Process..

[9]  Jean-François Bonastre,et al.  Frame pruning for automatic speaker identification , 1998, 9th European Signal Processing Conference (EUSIPCO 1998).

[10]  Sridha Sridharan,et al.  Robust speaker recognition using microphone arrays , 2001, Odyssey.

[11]  Jean-François Bonastre,et al.  Frame pruning for speaker recognition , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[12]  Jiri Matas,et al.  On Combining Classifiers , 1998, IEEE Trans. Pattern Anal. Mach. Intell..