论文信息 - Automatic feature template generation for maximum entropy based intonational phrase break prediction

Automatic feature template generation for maximum entropy based intonational phrase break prediction

The prediction of intonational phrase (IP) breaks is important for both the naturalness and intelligibility of Text-to- Speech (TTS) systems. In this paper, we propose a maximum entropy (ME) model to predict IP breaks from unrestricted text, and evaluate various keyword selection approaches in different domains. Furthermore, we design a hierarchical clustering algorithm for automatic generation of feature templates, which minimizes the need for human supervision during ME model training. Results of comparative experiments show that, for the task of IP break prediction, ME model obviously outperforms classification and regression tree (CART), log-likelihood ratio is the best scoring measure of keyword selection, compared with manual templates, templates automatically generated by our approach greatly improves the F-score of ME based IP break prediction, and significantly reduces the size of ME model.

You Zhou

[1] Adam L. Berger,et al. A Maximum Entropy Approach to Natural Language Processing , 1996, CL.

[2] Yiming Yang,et al. A Comparative Study on Feature Selection in Text Categorization , 1997, ICML.

[3] Mitchell P. Marcus,et al. Maximum entropy models for natural language ambiguity resolution , 1998 .

[4] David Yarowsky,et al. Hierarchical Decision Lists for Word Sense Disambiguation , 2000, Comput. Humanit..

[5] Mari Ostendorf,et al. Automatic labeling of prosodic patterns , 1994, IEEE Trans. Speech Audio Process..

[6] Paul Taylor,et al. Assigning phrase breaks from part-of-speech sequences , 1997, Comput. Speech Lang..

[7] Hwee Tou Ng,et al. A Maximum Entropy Approach to Chinese Word Segmentation , 2005, SIGHAN@IJCNLP 2005.

[8] Dwight L. Bolinger,et al. Intonation and Its Uses: Melody in Grammar and Discourse , 1989 .

[9] Martha Palmer,et al. Simple Features for Chinese Word Sense Disambiguation , 2002, COLING.

[10] Dunja Mladenic,et al. Feature Selection for Unbalanced Class Distribution and Naive Bayes , 1999, ICML.

[11] Min Chu,et al. Locating Boundaries for Prosodic Constituents in Unrestricted Mandarin Texts , 2001, Int. J. Comput. Linguistics Chin. Lang. Process..

[12] Adwait Ratnaparkhi,et al. A Maximum Entropy Model for Part-Of-Speech Tagging , 1996, EMNLP.