论文信息 - A multi-lingual system for the determination of phonetic word stress using soft feature selection by neural networks

A multi-lingual system for the determination of phonetic word stress using soft feature selection by neural networks

Any TTS system requires a routine to determine the transcription of out of vocabulary (OOV) words. This transcription contains three information: the phoneme sequence, the position of syllable boundaries and the position of word stress. In the TTS system ”Papageno”, the phonemes and syllable boundaries are determined by a neural network proposed in [1]. In the same paper also a second network for word stress determination was proposed. A similar architecture is used here, enhanced by a diagonal matrix between the input and the hidden layer penalised by weight decay. Weight decay is a strategy to limit the growth of a weight unless it is really necessary. It can be used to improve the generalisation ability of the network.

Horst-Udo Hain | Hans-Georg Zimmermann

[1] Ralph Neuneier,et al. How to Train Neural Networks , 1996, Neural Networks: Tricks of the Trade.

[2] Gavin Burnage. Celex-a guide for users , 1990 .

[3] Horst-Udo Hain. A hybrid approach for grapheme-to-phoneme conversion based on a combination of partial string matching and a neural network , 2000, INTERSPEECH.