An articulatory-functional approach to modeling Persian focus prosody

This paper is an attempt to test PENTA, an articulatory-functional model, on Persian focus prosody. The test was done on a corpus consisting of utterances with different focus conditions using PENTAtrainer2, a trainable prosody synthesizer that optimizes categorical pitch targets each corresponding to multiple communicative functions. The evaluation was done by comparing the F0 contours generated by the extracted pitch targets to those of natural utterances through numerical and perceptual evaluations. The numerical results showed that the synthesized F0 was close to the natural contour in terms of RMSE (= 1.94) and Pearson’s r (= 0.84). Perceptual evaluation showed that the rate of focus identification and naturalness judgement by native Persian listeners were highly similar between synthetic and natural F0 contours.

[1]  S. Jun,et al.  Prosodic typology : the phonology of intonation and phrasing , 2014 .

[2]  Yi Xu,et al.  The Perception of Prosodic Focus in Persian , 2014 .

[3]  Santitham Prom-on,et al.  Toward invariant functional representations of variable surface fundamental frequency contours: Synthesizing speech melody via model-based stochastic learning , 2014, Speech Commun..

[4]  Yi Xu,et al.  Post-focus Compression: Cross-linguistic Distribution and Historical Origin , 2011, ICPhS.

[5]  Behzad Mahjani An Instrumental Study of Prosodic Features and Intonation in Modern Farsi (Persian) , 2003 .

[6]  Yi Xu,et al.  Modeling Japanese F0 contours using the PENTAtrainers and AMtrainer , 2014 .

[7]  Gilbert Lazard Grammaire du persan contemporain , 1958 .

[8]  C. Gussenhoven,et al.  The Persian pitch accent and its retention after the focus , 2012 .

[9]  Arsalan Kahnemuyipour,et al.  Syntactic Categories and Persian Stress , 2003 .

[10]  C. A. Ferguson Word Stress in Persian , 1957 .

[11]  Emily Q. Wang,et al.  Pitch targets and their realization: Evidence from Mandarin Chinese , 2001, Speech Commun..

[12]  Yi Xu,et al.  Speech melody as articulatorily implemented communicative functions , 2005, Speech Commun..

[13]  Santitham Prom-on,et al.  Modeling tone and intonation in Mandarin and English as a process of target approximation. , 2009, The Journal of the Acoustical Society of America.

[14]  Yi Xu,et al.  Phonetic Realization of Prosodic Focus in Persian , 2012 .

[15]  Paul Boersma,et al.  Praat, a system for doing phonetics by computer , 2002 .