论文信息 - Do phonetic features help to improve consonant identification in ASR?

Do phonetic features help to improve consonant identification in ASR?

The hidden Markov mode lling experiments presented in this paper show that consonant identification results can be improved substantially if a neural network is used to extract linguistically relevant information from the acoustic signal before applying hidden Markov mode lling. The neural network – or in this case a combination of two Kohonen networks – takes 12 mel-frequency cepstral coefficients, overall energy and the corresponding delta parameters as input and outputs distinctive phonetic features, like [±uvular] and [ ±plosive]. Not only does this preprocessing of the data lead to better consonant identification rates, the confusions that occur between the consonants are less severe from a phonetic viewpoint, as is demonstrated. One reason for the improved consonant identification is that the acoustically variable consonant realisations can be mapped onto identical phonetic features by the neural network. This makes the input to hidden Markov mode lling more homogenous and improves consonant identification. Furthermore, by using phonetic features the neural network helps the system to focus on linguistically relevant information in the acoustic signal.

Jacques C. Koreman | Bistra Andreeva | William J. Barry

[1] Carol Y. Espy-Wilson,et al. Speech parameterization based on phonetic features: application to speech recognition , 1995, EUROSPEECH.

[2] Paul Dalsgaard. Phoneme label alignment using acoustic-phonetic features and Gaussian probability density functions , 1992 .

[3] Jacques C. Koreman,et al. RELATIONAL PHONETIC FEATURES FOR CONSONANT IDENTIFICATION IN A HYBRID ASR SYSTEM , 1997 .

[4] Katrin Kirchhoff. Syllable-level desynchronisation of phonetic features for speech recognition , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[5] Jacques C. Koreman,et al. Exploiting transitions and focussing on linguistic properties for ASR , 1998, ICSLP.