论文信息 - A New Classifier for Speaker Verification Based on the Fractional Brownian Motion Process

A New Classifier for Speaker Verification Based on the Fractional Brownian Motion Process

A novel text-independent verification system based on the fractional Brownian motion (M_dim_fBm) for automatic speaker recognition (ASR) is presented in this paper. The performance of the proposed M_dim_fBm was compared to those achieved with the GMM (Gaussian Mixture Models) classifier using the mel-cepstral coefficients. We have used a speech database – obtained from fixed and cellular phones – uttered by 75 different speakers. The results have shown the superior performance of the M_dim_fBm classifier in terms of recognition accuracy. In addition, the proposed classifier employs a much simpler modeling structure as compared to the GMM.

Rosângela Coelho | Ricardo Sant Ana | Abraham Alcaim | A. Alcaim | R. Coelho

[1] Patrice Abry,et al. A Wavelet-Based Joint Estimator of the Parameters of Long-Range Dependence , 1999, IEEE Trans. Inf. Theory.

[2] Y. Hashimoto,et al. Pattern recognition of fruit shape based on the concept of chaos and neural networks , 2000 .

[3] Jan Beran,et al. Statistics for long-memory processes , 1994 .

[4] J. Echauz,et al. Fractal dimension characterizes seizure onset in epileptic patients , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[5] Ingrid Daubechies,et al. Ten Lectures on Wavelets , 1992 .

[6] Alvin F. Martin,et al. The DET curve in assessment of detection task performance , 1997, EUROSPEECH.

[7] H. E. Hurst,et al. Long-Term Storage Capacity of Reservoirs , 1951 .

[8] Heinz-Otto Peitgen,et al. The science of fractal images , 2011 .

[9] Douglas A. Reynolds,et al. Robust text-independent speaker identification using Gaussian mixture speaker models , 1995, IEEE Trans. Speech Audio Process..

[10] Douglas A. Reynolds,et al. Experimental evaluation of features for robust speaker identification , 1994, IEEE Trans. Speech Audio Process..

[11] Dante Augusto Couto Barone,et al. Fractal dimension applied to speaker identification , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[12] Douglas A. Reynolds,et al. Integrated models of signal and background with application to speaker identification in noise , 1994, IEEE Trans. Speech Audio Process..