An acoustic-phonetic feature-based system for the automatic recognition of fricative consonants

The acoustic-phonetic characteristics and the automatic recognition of the American English fricatives are investigated. The acoustic features that exist in the literature are evaluated and new features are proposed. To test the value of the extracted features, a knowledge-based acoustic-phonetic system for the automatic recognition of fricatives, in speaker independent continuous speech, is proposed. The system uses an auditory-based front-end processing and incorporates new algorithms for the extraction and manipulation of the acoustic-phonetic features that proved to be rich in their information content. Several features, which describe the relative amplitude, location of the most dominant peak, spectral shape and duration of unvoiced portion, are combined in the recognition process. A recognition accuracy of 95% for voicing detection and 93% for place of articulation detection are obtained for TIMIT database continuous speech of 22 speakers from 5 different dialect regions.

[1]  S. Seneff A joint synchrony/mean-rate model of auditory speech processing , 1990 .

[2]  R. N. Ohde,et al.  Effect of relative amplitude of frication on perception of place of articulation. , 1991, The Journal of the Acoustical Society of America.

[3]  K. Harris Cues for the Discrimination of American English Fricatives in Spoken Syllables , 1958 .

[4]  S R Baum,et al.  Preliminary observations on the use of duration as a cue to syllable-initial fricative consonant voicing in English. , 1987, The Journal of the Acoustical Society of America.

[5]  Jorge A. Gurlekian,et al.  Recognition of the Spanish fricatives /s/ and /f/ , 1981 .

[6]  R. Cole,et al.  Perception of voicing in English affricates and fricatives. , 1975, The Journal of the Acoustical Society of America.

[7]  J R Cohen,et al.  Application of an auditory model to speech recognition. , 1989, The Journal of the Acoustical Society of America.

[8]  G. W. Hughes,et al.  Spectral Properties of Fricative Consonants , 1956 .

[9]  George P. McCasland Noise intensity and spectrum cues of spoken fricatives , 1979 .

[10]  Les E. Atlas,et al.  The challenge of spoken language systems: research directions for the nineties , 1995, IEEE Trans. Speech Audio Process..

[11]  D. Klatt,et al.  Analysis, synthesis, and perception of voice quality variations among female and male talkers. , 1990, The Journal of the Acoustical Society of America.

[12]  A. M. D. de Manrique,et al.  Acoustic analysis and perception of Spanish fricative consonants. , 1981, The Journal of the Acoustical Society of America.

[13]  A. Jongman Duration of frication noise required for identification of English fricatives. , 1989, The Journal of the Acoustical Society of America.

[14]  S. Blumstein,et al.  Acoustic characteristics of English voiceless fricatives: a descriptive analysis , 1988 .

[15]  J. Gurlekian On the recognition of the Spanish fricatives /s/ and /f/ , 1979 .

[16]  S. Blumstein,et al.  Acoustic and perceptual characteristics of voicing in fricatives and fricative clusters. , 1992, The Journal of the Acoustical Society of America.

[17]  S. Blumstein,et al.  On the role of the amplitude of the fricative noise in the perception of place of articulation in voiceless fricative consonants. , 1988, The Journal of the Acoustical Society of America.

[18]  K. Stevens,et al.  On the Properties of Voiceless Fricative Consonants , 1961 .

[19]  Stephanie Seneff A joint synchrony/mean-rate model of auditory speech processing , 1990 .