Measures on wavelet segmentation of speech

Speech segmentation is widely used in many speech applications. We propose a new wavelet-based extension of the typical spectrum-based non-uniform speech segmentation methods. The use of wavelets i mproves computation performance and provides easy and flexible adjusting of algorithm parameters. Segmentation accuracy measures are introduced and applied for evaluation as well.

[1]  Yonghong Yan,et al.  Fusion based speech segmentation in DARPA SPINE2 task , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[2]  Omar Farooq,et al.  Wavelet based robust sub-band features for phoneme recognition , 2004 .

[3]  Climent Nadeu,et al.  Wavelet transforms for non-uniform speech recognition systems , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[4]  María José Castro Bleda,et al.  Automatic Segmentation of Speech at the Phonetic Level , 2002, SSPR/SPR.

[5]  Beng T. Tan,et al.  Applying wavelet analysis to speech segmentation and classification , 1994, Defense, Security, and Sensing.

[6]  John W. Tukey,et al.  Nonlinear (nonsuperposable) methods for smoothing data , 1974 .

[7]  Stefan Grocholewski First database for spoken polish , 1998 .

[8]  Kris Demuynck,et al.  A Comparison of Different Approaches to Automatic Speech Segmentation , 2002, TSD.

[9]  Mariusz Ziólko,et al.  Wavelet method of speech segmentation , 2006, 2006 14th European Signal Processing Conference.

[10]  Zekeriya Tufekci,et al.  Mel-scaled discrete wavelet coefficients for speech recognition , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[11]  Hsin-Min Wang,et al.  A sequential metric-based audio segmentation method via the Bayesian information criterion , 2003, INTERSPEECH.

[12]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[13]  F. Schiel,et al.  INDEPENDENT AUTOMATIC SEGMENTATION OF SPEECH BY PRONUNCIATION MODELING , 1999 .

[14]  Hynek Hermansky,et al.  Segmentation of speech for speaker and language recognition , 2003, INTERSPEECH.