Two issues related to phonetic classification are addressed: first, whether there are any advantages in extracting acoustic attributes over directly using the spectral information for classification, and, second, whether it is advantageous to introduce an intermediate set of linguistic units, i.e., distinctive features, for phonetic classification. The authors focused on 13 monophthong vowels in American English, and investigated classification performance using an artificial neural net classifier with nearly 20000 vowel tokens from 550 speakers excised from the TIMIT corpus. The results indicate that acoustic attributes give performance similar to raw spectral information, but at potentially considerable computational savings. In addition, the distinctive feature representation gives similar performance to direct vowel classification, but potentially offers a more flexible mechanism for describing context dependency.<<ETX>>
[1]
James Glass,et al.
Acoustic segmentation and phonetic classification in the SUMMIT system
,
1988,
International Conference on Acoustics, Speech, and Signal Processing,.
[2]
Victor Zue,et al.
A comparative study of acoustic representations of speech for vowel classification using multi-layer perceptrons
,
1990,
ICSLP.
[3]
Stephanie Seneff.
A joint synchrony/mean-rate model of auditory speech processing
,
1990
.
[4]
Victor W. Zue,et al.
Phonetic classification using multi-layer perceptrons
,
1990,
International Conference on Acoustics, Speech, and Signal Processing.