A New Classifier for Speaker Verification Based on the Fractional Brownian Motion Process

A novel text-independent verification system based on the fractional Brownian motion (M_dim_fBm) for automatic speaker recognition (ASR) is presented in this paper. The performance of the proposed M_dim_fBm was compared to those achieved with the GMM (Gaussian Mixture Models) classifier using the mel-cepstral coefficients. We have used a speech database – obtained from fixed and cellular phones – uttered by 75 different speakers. The results have shown the superior performance of the M_dim_fBm classifier in terms of recognition accuracy. In addition, the proposed classifier employs a much simpler modeling structure as compared to the GMM.

[1]  Patrice Abry,et al.  A Wavelet-Based Joint Estimator of the Parameters of Long-Range Dependence , 1999, IEEE Trans. Inf. Theory.

[2]  Y. Hashimoto,et al.  Pattern recognition of fruit shape based on the concept of chaos and neural networks , 2000 .

[3]  Jan Beran,et al.  Statistics for long-memory processes , 1994 .

[4]  J. Echauz,et al.  Fractal dimension characterizes seizure onset in epileptic patients , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[5]  Ingrid Daubechies,et al.  Ten Lectures on Wavelets , 1992 .

[6]  Alvin F. Martin,et al.  The DET curve in assessment of detection task performance , 1997, EUROSPEECH.

[7]  H. E. Hurst,et al.  Long-Term Storage Capacity of Reservoirs , 1951 .

[8]  Heinz-Otto Peitgen,et al.  The science of fractal images , 2011 .

[9]  Douglas A. Reynolds,et al.  Robust text-independent speaker identification using Gaussian mixture speaker models , 1995, IEEE Trans. Speech Audio Process..

[10]  Douglas A. Reynolds,et al.  Experimental evaluation of features for robust speaker identification , 1994, IEEE Trans. Speech Audio Process..

[11]  Dante Augusto Couto Barone,et al.  Fractal dimension applied to speaker identification , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[12]  Douglas A. Reynolds,et al.  Integrated models of signal and background with application to speaker identification in noise , 1994, IEEE Trans. Speech Audio Process..