论文信息 - Improving homograph disambiguation with supervised machine learning

Improving homograph disambiguation with supervised machine learning

We describe a pre-existing rule-based homograph disambiguation system used for text-to-speech synthesis at Google, and compare it to a novel system which performs disambiguation using classifiers trained on a small amount of labeled data. An evaluation of these systems, using a new, freely available English data set, finds that hybrid systems (making use of both rules and machine learning) are significantly more accurate than either hand-written rules or machine learning alone. The evaluation also finds minimal performance degradation when the hybrid system is configured to run on limited-resource mobile devices rather than on production servers. The two best systems described here are used for homograph disambiguation on all US English text-to-speech traffic at Google.

[1] Daniela Braga,et al. Homograph ambiguity resolution in front-end design for portuguese TTS systems , 2007, INTERSPEECH.

[2] Ryan Doherty,et al. Semi-supervised Word Sense Disambiguation with Neural Models , 2016, COLING.

[3] Kenneth Ward Church,et al. Using bilingual materials to develop word sense disambiguation methods , 1992, TMI.

[4] Joseph P. Olive,et al. Text-to-speech synthesis , 1995, AT&T Technical Journal.

[5] David Yarowsky,et al. A corpus-based synthesizer , 1992, ICSLP.

[6] Marti A. Hearst. Noun Homograph Disambiguation Using Local Context in Large Text Corpora , 1991 .

[7] David Yarowsky,et al. Homograph Disambiguation in Text-to-Speech Synthesis , 1997 .

[8] Richard Sproat,et al. Applications of maximum entropy rankers to problems in spoken language processing , 2014, INTERSPEECH.

[9] Christopher D. Manning,et al. Enriching the Knowledge Sources Used in a Maximum Entropy Part-of-Speech Tagger , 2000, EMNLP.

[10] Richard Sproat,et al. Russian Stress Prediction using Maximum Entropy Ranking , 2013, EMNLP.

[11] Virach Sornlertlamvanich,et al. A Context-Sensitive Homograph Disambiguation in Thai Text-to-Speech Synthesis , 2003, NAACL.

[12] Richard Sproat,et al. The Kestrel TTS text normalization system , 2014, Natural Language Engineering.

[13] Shankar Kumar,et al. Normalization of non-standard words , 2001, Comput. Speech Lang..