A JAVA interface for speech analysis and segmentation

The paper describes the current state of development of a multi-purpose software tool for speech research. This is composed by a “visualization front-end”, for displaying and editing the speech signal with associated annotations and acoustic features, and a “batch-processing interface” for applying speechprocessing algorithms to a whole database of signals. The software is mostly written in JAVA, but an extension mechanism is provided in order to integrate the interface with processing techniques implemented in different programming languages. The presented tool includes an original phone segmentation algorithm, for which some new experimental results are reported that prove its robustness to telephone bandwidth distortions.

[1]  A. Esposito,et al.  An user-friendly interface for text-independent phoneme segmentation , 2002 .

[2]  John H. L. Hansen,et al.  Automatic segmentation of speech recorded in unknown noisy channel characteristics , 1998, Speech Communication.

[3]  Anna Esposito,et al.  Automatic Parameter Estimation for a Context-Independent Speech Segmentation Algorithm , 2002, TSD.

[4]  Paul Dalsgaard,et al.  On the robust automatic segmentation of spontaneous speech , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[5]  Anna Esposito,et al.  A new text-independent method for phoneme segmentation , 2001, Proceedings of the 44th IEEE 2001 Midwest Symposium on Circuits and Systems. MWSCAS 2001 (Cat. No.01CH37257).

[6]  Hynek Hermansky,et al.  RASTA processing of speech , 1994, IEEE Trans. Speech Audio Process..

[7]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[8]  David G. Messerschmitt,et al.  Nearly Instantaneous Companding for Nonuniformly Quantized PCM , 1976, IEEE Trans. Commun..

[9]  Sara H. Basson,et al.  NTIMIT: a phonetically balanced, continuous speech, telephone bandwidth speech database , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[10]  H Hermansky,et al.  Perceptual linear predictive (PLP) analysis of speech. , 1990, The Journal of the Acoustical Society of America.

[11]  David Holmes,et al.  The Java Programming Language, Third Edition , 2000 .