Rule learning based Chinese prosodic phrase prediction

We describe a rule-learning approach towards Chinese prosodic phrase prediction for TTS systems. 3167 sentences with two-level prosodic phrase labeling information was prepared for analysis. Candidate features related to prosodic phrasing were extracted from the corpus to establish an example database. Based on this, a series of comparative experiments is conducted to collect the most effective features from the candidates. Two typical rule learning algorithms (C4.5 and TBL) were applied on the example database to induce prediction rules. To compare the results with others, the general evaluation parameters were introduced in the paper. With these parameters, the methods were compared to RNN and bigram based methods. Results show that the rule-learning approach introduced here can achieve better prediction accuracy than the nonrule based methods and yet retain the advantage of the simplicity and understandability.

[1]  Wu Hua,et al.  Speech corpus of Chinese discourse and the phonetic research , 2000, INTERSPEECH.

[2]  Paul Taylor,et al.  Assigning phrase breaks from part-of-speech sequences , 1997, Comput. Speech Lang..

[3]  Eric Brill,et al.  Transformation-Based Error-Driven Learning and Natural Language Processing: A Case Study in Part-of-Speech Tagging , 1995, CL.

[4]  Grace Ngai,et al.  Transformation Based Learning in the Fast Lane , 2001, NAACL.

[5]  Julia Hirschberg,et al.  Learning prosodic features using a tree representation , 2001, INTERSPEECH.

[6]  Hu Peng,et al.  Segmenting unrestricted Chinese text into prosodic words instead of lexical words , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[7]  Xiaohua Shi,et al.  An RNN-based algorithm to detect prosodic phrase for Chinese TTS , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[8]  Julia Hirschberg,et al.  Automatic classification of intonational phrase boundaries , 1992 .

[9]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[10]  Steven Abney,et al.  Chunks and Dependencies: Bringing Processing Evidence to Bear on Syntax , 2002 .