Adaptation of Prosodic Phrasing Models

There is considerable variation in the prosodic phrasing of speech between different speakers and speech styles. Due to the time and cost of obtaining large quantities of data to train a model for every variation, it is desirable to develop models that can be adapted to new conditions with a limited amount of training data. We describe a technique for adapting HMMbased phrase boundary prediction models which alters a statistical distribution of prosodic phrase lengths. The adapted models show improved prediction performance across different speakers and types of spoken material.

[1]  Sabine Buchholz,et al.  Influence of syntax on prosodic boundary prediction , 2005, INTERSPEECH.

[2]  E. S. Pearson,et al.  THE USE OF CONFIDENCE OR FIDUCIAL LIMITS ILLUSTRATED IN THE CASE OF THE BINOMIAL , 1934 .

[3]  Mari Ostendorf,et al.  Prosody prediction for speech synthesis using transformational rule-based learning , 1998, ICSLP.

[4]  Paul Taylor,et al.  Assigning phrase breaks from part-of-speech sequences , 1997, Comput. Speech Lang..

[5]  James Paul Gee,et al.  Performance structures: A psycholinguistic and linguistic appraisal , 1983, Cognitive Psychology.

[6]  Christopher D. Manning,et al.  Enriching the Knowledge Sources Used in a Maximum Entropy Part-of-Speech Tagger , 2000, EMNLP.

[7]  Julia Hirschberg,et al.  Automatic classification of intonational phrase boundaries , 1992 .

[8]  Stephen Cox,et al.  Using part-of-speech for predicting phrase breaks , 2004, INTERSPEECH.

[9]  金田 重郎,et al.  C4.5: Programs for Machine Learning (書評) , 1995 .

[10]  Stephen Cox,et al.  Using Part-Of-Speech Tags for Predicting Phrase Breaks , 2004 .

[11]  Sadaoki Furui,et al.  Advances in Speech Signal Processing , 1991 .

[12]  J. Pierrehumbert The phonology and phonetics of English intonation , 1987 .

[13]  Yung-Hwan Oh,et al.  Prediction of prosodic phrase boundaries considering variable speaking rate , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[14]  Walter Daelemans,et al.  Predicting phrase breaks with memory-based learning , 2001, SSW.

[15]  Jeff Sauro,et al.  Estimating Completion Rates from Small Samples Using Binomial Confidence Intervals: Comparisons and Recommendations , 2005 .

[16]  Eileen Fitzpatrick,et al.  A Computational Grammar of Discourse-Neutral Prosodic Phrasing in English , 1990, Comput. Linguistics.

[17]  Jürgen TROUVAIN Tempo Control in Speech Synthesis by Prosodic Phrasing , 2002 .

[18]  Marcus L. Fach A comparison between syntactic and prosodic phrasing , 1999, EUROSPEECH.

[19]  Helmut Schmid,et al.  New Statistical Methods for Phrase Break Prediction , 2004, COLING.

[20]  Adwait Ratnaparkhi,et al.  A Maximum Entropy Model for Part-Of-Speech Tagging , 1996, EMNLP.

[21]  Mari Ostendorf,et al.  TOBI: a standard for labeling English prosody , 1992, ICSLP.

[22]  Aiko M. Hormann,et al.  Programs for Machine Learning. Part I , 1962, Inf. Control..

[23]  Mari Ostendorf,et al.  A Hierarchical Stochastic Model for Automatic Prediction of Prosodic Boundary Location , 1994, CL.

[24]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .