A DTW-based probability model for speaker feature analysis and data mining

This paper is a contribution to probabilistic data mining and pattern recognition. A DTW-based statistical model is proposed to explore the subspace structures of speaker feature space for feature evaluation, dimension reduction and inter-class information discovery in pattern space. We demonstrate its usefulness in isolated digits speaker identification, and the performance of the statistical model is compared with standard DTW recognition rate in the experiment. We argue that the probability model can be taken as data mining tools.

[1]  Jr. J.P. Campbell,et al.  Speaker recognition: a tutorial , 1997, Proc. IEEE.

[2]  Keinosuke Fukunaga,et al.  Introduction to Statistical Pattern Recognition , 1972 .

[3]  John W. Tukey,et al.  Exploratory Data Analysis , 1980, ACM SIGSPATIAL International Workshop on Advances in Geographic Information Systems.

[4]  Biing-Hwang Juang,et al.  On the use of bandpass liftering in speech recognition , 1987, IEEE Trans. Acoust. Speech Signal Process..

[5]  Ronald W. Schafer,et al.  Digital Processing of Speech Signals , 1978 .

[6]  Josef Kittler,et al.  Feature selection for a DTW-based speaker verification system , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[7]  Fuad Rahman,et al.  Selective partition algorithm for finding regions of maximum pairwise dissimilarity among statistical class models , 1997, Pattern Recognit. Lett..

[8]  Sadaoki Furui,et al.  Recent advances in speaker recognition , 1997, Pattern Recognit. Lett..

[9]  Keinosuke Fukunaga,et al.  Introduction to statistical pattern recognition (2nd ed.) , 1990 .

[10]  Lawrence R. Rabiner,et al.  A modified K-means clustering algorithm for use in isolated work recognition , 1985, IEEE Trans. Acoust. Speech Signal Process..

[11]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[12]  Kohji Fukunaga,et al.  Introduction to Statistical Pattern Recognition-Second Edition , 1990 .

[13]  Delphine Charlet,et al.  Optimizing feature set for speaker verification , 1997, Pattern Recognit. Lett..

[14]  Sarel van Vuuren,et al.  On the importance of components of the modulation spectrum for speaker verification , 1998, ICSLP.

[15]  Paul Scheunders,et al.  Non-linear dimensionality reduction techniques for unsupervised feature extraction , 1998, Pattern Recognit. Lett..

[16]  M. Sambur Speaker recognition using orthogonal linear prediction , 1975 .

[17]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[18]  M. Sambur,et al.  Selection of acoustic features for speaker identification , 1975 .

[19]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[20]  Sadaoki Furui,et al.  Research of individuality features in speech waves and automatic speaker recognition techniques , 1986, Speech Commun..

[21]  Sankar K. Pal,et al.  Unsupervised feature selection using a neuro-fuzzy approach , 1998, Pattern Recognit. Lett..