Acoustic parameters for place of articulation identification and classification of Spanish unvoiced stops

The analysis of the acoustic parameters which best summarize the cues to phone discrimination for the language under consideration should be a previous step in acoustic-phonetic decoding, regardless of the methodology to be used. The Spanish language has not been widely analyzed from this point of view. This work deals with the acoustic discrimination of Spanish stop consonants. Our main goal was to find a reliable and reduced set of parameters for place of articulation identification of Spanish unvoiced stops. On the basis of the obtained parameters, two automatic classifiers were developed and tested. Only the acoustic features of the burst segment, automatically segmented from the speech waveform, were considered in the parameter estimation. The analysis of these features was carried out in both the time and frequency domains over a CV context corpus uttered by 6 speakers. In the first case, the classifier was designed as a procedural form. Alternatively, in the second case a statistical classifier was obtained from a previous automatic discriminant analysis of the parameters. Both classifiers were tested over a CV context corpus uttered by 40 new speakers not included in the analysis corpus, which resulted in a good rate of identification.

[1]  Yoshua Bengio,et al.  Phonetically motivated acoustic parameters for continuous speech recognition using artificial neural networks , 1991, Speech Commun..

[2]  D Kewley-Port,et al.  Time-varying features as correlates of place of articulation in stop consonants. , 1983, The Journal of the Acoustical Society of America.

[3]  F. Casacuberta,et al.  Linguistic decoding of Spanish continuous speech with hidden Markov models , 1994 .

[4]  B H Repp,et al.  Perception of intervocalic stop consonants: the contributions of closure duration and formant transitions. , 1983, The Journal of the Acoustical Society of America.

[5]  S. Blumstein,et al.  The role of the gross spectral shape as a perceptual cue to place articulation in initial stop consonants. , 1982, The Journal of the Acoustical Society of America.

[6]  S. Blumstein,et al.  Acoustic invariance in speech production: evidence from measurements of the spectral characteristics of stop consonants. , 1979, The Journal of the Acoustical Society of America.

[7]  M. Inés Torres,et al.  Acoustic-phonetic decoding of Spanish occlusive consonants , 1993, EUROSPEECH.

[8]  A. Quilis Fonética acústica de la lengua española , 1981 .

[9]  D. Kewley-Port Measurement of formant transitions in naturally produced stop consonant-vowel syllables. , 1982, The Journal of the Acoustical Society of America.

[10]  Gunnar Fant,et al.  Speech sounds and features , 1973 .

[11]  Hidefumi Kobatake,et al.  Spectral transition dynamics of voiceless stop consonants , 1987 .

[12]  Hermann Ney,et al.  Prototype systems for large-vocabulary speech recognition: polyglot and spicos , 1991, EUROSPEECH.

[13]  B H Repp,et al.  Acoustic properties and perception of stop consonant release transients. , 1989, The Journal of the Acoustical Society of America.

[14]  J. Mariani,et al.  Recent advances in speech processing , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[15]  S. Roucos,et al.  Acoustic-phonetic decoding of speech , 1988 .

[16]  Krishna S. Nathan Comparison of formant transition based stop classifiers: time-varying and time-invariant signal models , 1991, EUROSPEECH.

[17]  Heinrich Niemann,et al.  Recent Advances in Speech Understanding and Dialog Systems , 2012, NATO ASI Series.

[18]  M. Jack,et al.  Globally optimising formant tracker using generalised centroids , 1987 .

[19]  Lou Boves,et al.  Knowledge-based phoneme recognition , 1991, EUROSPEECH.

[20]  Victor Zue,et al.  Selecting acoustic features for stop consonant identification , 1983, ICASSP.

[21]  M. Inés Torres,et al.  Acoustic-Phonetic Decoding of Spanish Continuous Speech , 1994, Int. J. Pattern Recognit. Artif. Intell..

[22]  Chin-Hui Lee,et al.  Acoustic modeling for large vocabulary speech recognition , 1990 .

[23]  L. Pols,et al.  Plosive consonant identification in ambiguous sentences , 1985 .

[24]  Pietro Laface,et al.  Selection of speech units for a speaker-independent CSR task , 1991, EUROSPEECH.

[25]  D B Pisoni,et al.  Perception of static and dynamic acoustic cues to place of articulation in initial stop consonants. , 1983, The Journal of the Acoustical Society of America.