Predicting tonal realizations in one Chinese dialect from another

Abstract Pronunciation dictionaries are usually expensive and time-consuming to prepare for the computational modeling of human languages, especially when the target language is under-resourced. Northern Chinese dialects are often under-resourced but used by a significant number of speakers. They share the basic sound inventories with Standard Chinese (SC). Also, their words usually share the segmental realizations and logographic written forms with the SC translation equivalents. Hence the pronunciation dictionaries of northern Chinese dialects could be easily available if we were able to predict the tonal realizations of the dialect words from the tonal information of their SC counterparts. This paper applies statistical modeling to investigate the tonal aspect of the related words between a northern dialect, i.e. Jinan Mandarin (JM), and Standard Chinese (SC). Multi-linear regression models were built with between-word pitch distance of JM words as the dependent variable and the following were included as the predictors: SC tonal relations, between-dialect tonal identity, and individual backgrounds. The results showed that tonal relations in SC and between-dialect identity, as predictors featuring the relation between the JM and SC tonal systems, are significant and robust predictors of JM tonal realizations. The speakers’ sociolinguistic and cognitive backgrounds, together with the tonal merge and neutral tone information within JM, are important for the prediction of JM tonal realizations and affect the way that between-language predictors take effect.

[1]  Yiya Chen Focus and intonational phrase boundary in standard Chinese , 2004, 2004 International Symposium on Chinese Spoken Language Processing.

[2]  David A. Belsley,et al.  Regression Analysis and its Application: A Data-Oriented Approach.@@@Applied Linear Regression.@@@Regression Diagnostics: Identifying Influential Data and Sources of Collinearity , 1981 .

[3]  J. Howie,et al.  On the Domain of Tone in Mandarin , 1974 .

[4]  Jerry Norman,et al.  The Chinese Dialects: Phonology , 2002 .

[5]  Y. Tohkura,et al.  A perceptual interference account of acquisition difficulties for non-native phonemes , 2003, Cognition.

[6]  J. Torgesen,et al.  Individual difference variables that predict response to training in phonological awareness. , 1996, Journal of experimental child psychology.

[7]  Elena Deza,et al.  Encyclopedia of Distances , 2014 .

[8]  M. Brysbaert,et al.  SUBTLEX-CH: Chinese Word and Character Frequencies Based on Film Subtitles , 2010, PloS one.

[9]  Philip C. Woodland Speaker adaptation for continuous density HMMs: a review , 2001 .

[10]  William Labov,et al.  A sociolinguistic perspective on sociophonetic research , 2006, J. Phonetics.

[11]  Hans-Peter Kriegel,et al.  LOF: identifying density-based local outliers , 2000, SIGMOD '00.

[12]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[13]  Jiahong Yuan,et al.  3 rd tone sandhi in Standard Chinese : A corpus approach , 2011 .

[14]  B. Lobanov Classification of Russian Vowels Spoken by Different Speakers , 1971 .

[15]  Li Aijun,et al.  CHINESE PROSODY AND PROSODIC LABELING OF SPONTANEOUS SPEECH , 2002 .

[16]  Matthew Y. Chen Cross-Dialectal Comparison: A Case Study and Some Theoretical Considerations. , 1973 .

[17]  Gang Peng,et al.  The effect of intertalker variations on acoustic-perceptual mapping in Cantonese and Mandarin tone systems. , 2012, Journal of speech, language, and hearing research : JSLHR.

[18]  Gary S. Dell,et al.  Effects of Frequency and Vocabulary Type on Phonological Speech Errors , 1990 .

[19]  William S-Y. Wang Competing Changes as a Cause of Residue , 1969 .

[20]  Lee H. Wurm,et al.  What residualizing predictors in regression analyses does (and what it does not do) , 2014 .

[21]  F. Craik,et al.  Cognitive control and lexical access in younger and older bilinguals. , 2008, Journal of experimental psychology. Learning, memory, and cognition.

[22]  A Wingfield,et al.  Response Latencies in Naming Objects , 1965, The Quarterly journal of experimental psychology.

[23]  Gregory J. Poarch,et al.  Effects of bilingualism and aging on executive function and working memory. , 2014, Psychology and aging.

[24]  W. Labov,et al.  Empirical foundations for a theory of language change , 2014 .

[25]  Jie Zhang,et al.  Mandarin Lexical Tone Recognition: The Gating Paradigm , 2008 .

[26]  Etienne Barnard,et al.  Pooling ASR data for closely related languages , 2010, SLTU.

[27]  P. Brockhoff,et al.  lmerTest: Tests for random and fixed effects for linear mixed effect models (lmer objects of lme4 package) , 2014 .

[28]  M. Y. Chen,et al.  Sound Change: Actuation and Implementation. , 1975 .

[29]  Carlos Gussenhoven,et al.  Emphasis and tonal implementation in Standard Chinese , 2008, J. Phonetics.

[30]  J. Grainger Word frequency and neighborhood frequency effects in lexical decision and naming. , 1990 .

[31]  Yiya Chen,et al.  Representation of Allophonic Tone Sandhi Variants , 2011 .

[32]  Chao-Yang Lee,et al.  Lexical tone in spoken word recognition: A view from Mandarin Chinese , 2000 .

[33]  Santitham Prom-on,et al.  Toward invariant functional representations of variable surface fundamental frequency contours: Synthesizing speech melody via model-based stochastic learning , 2014, Speech Commun..

[34]  Peredur Davies,et al.  A Systematic Comparison of Factors Affecting the Choice of Matrix Language in Three Bilingual Communities , 2011 .

[35]  Y Xu,et al.  Production and perception of coarticulated tones. , 1994, The Journal of the Acoustical Society of America.

[36]  Mahé Ben Hamed Neighbour-nets portray the Chinese dialect continuum and the linguistic legacy of China's demic history , 2005, Proceedings of the Royal Society B: Biological Sciences.

[37]  Emily Q. Wang,et al.  Pitch targets and their realization: Evidence from Mandarin Chinese , 2001, Speech Commun..

[38]  Philip C. Woodland,et al.  Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models , 1995, Comput. Speech Lang..

[39]  D. Bates,et al.  Linear Mixed-Effects Models using 'Eigen' and S4 , 2015 .

[40]  Chao Huang,et al.  Accent modeling based on pronunciation dictionary adaptation for large vocabulary Mandarin speech recognition , 2000, INTERSPEECH.

[41]  Yiya Chen,et al.  Tonal variability in lexical access , 2014 .

[42]  Catherine McBride-Chang,et al.  Phonological awareness in young Chinese children. , 2008, Developmental science.

[43]  Petr Motlícek,et al.  Using out-of-language data to improve an under-resourced speech recognizer , 2014, Speech Communication.

[44]  Loraine K. Obler,et al.  Working memory in simultaneous interpreters: Effects of task and age , 2012 .

[45]  Ching X. Xu,et al.  Effects of consonant aspiration on Mandarin tones , 2003 .

[46]  A D Baddeley,et al.  The Children's Test of Nonword Repetition: a test of phonological working memory. , 1994, Memory.

[47]  Yiya Chen,et al.  Post-focus F0 compression - Now you see it, now you don't , 2010, J. Phonetics.

[48]  V. V. Heuven,et al.  Mutual intelligibility of Chinese dialects experimentally tested , 2009 .

[49]  Jessie S. Nixon,et al.  Multi-level processing of phonetic variants in speech production and visual word processing: evidence from Mandarin lexical tones , 2015 .

[50]  Andi Wu,et al.  Statistically-Enhanced New Word Identification in a Rule-Based Chinese System , 2000, ACL 2000.

[51]  Thomas Niesler,et al.  Multi-accent acoustic modelling of South African English , 2012, Speech Commun..

[52]  Yiya Chen,et al.  An acoustic study of contextual tonal variation in Tianjin Mandarin , 2016, J. Phonetics.

[53]  Austin F. Frank,et al.  Analyzing linguistic data: a practical introduction to statistics using R , 2010 .

[54]  T. Florian Jaeger,et al.  Redundancy and reduction: Speakers manage syntactic information density , 2010, Cognitive Psychology.

[55]  Yiya Chen,et al.  How does phonology guide phonetics in segment-f0 interaction? , 2011, J. Phonetics.

[56]  Yiya Chen,et al.  Tonal Variants in the Bilingual Mental Lexicon , 2014 .

[57]  R. Baayen,et al.  Morphological influences on the recognition of monosyllabic monomorphemic words , 2006 .

[58]  Instituttet for sammenlignende kulturforskning,et al.  The Comparative Method in Historical Linguistics , 1967 .

[59]  Tanja Schultz,et al.  Globalphone: a multilingual speech and text database developed at karlsruhe university , 2002, INTERSPEECH.

[60]  Paul Boersma,et al.  Praat, a system for doing phonetics by computer , 2002 .

[61]  K. Rayner,et al.  On-line perception of Mandarin Tones 2 and 3: evidence from eye movements. , 2013, The Journal of the Acoustical Society of America.

[62]  Jing Li,et al.  Dialectal Chinese Speech Recognition: Final Report , 2004 .

[63]  W. Levelt Models of word production , 1999, Trends in Cognitive Sciences.

[64]  中国社会科学院語言研究所,et al.  中国语言地图集 = Language atlas of China , 2012 .

[65]  Tanja Schultz,et al.  Automatic speech recognition for under-resourced languages: A survey , 2014, Speech Commun..

[66]  D H Whalen,et al.  Information for Mandarin Tones in the Amplitude Contour and in Brief Segments , 1990, Phonetica.

[67]  Yi Xu Contextual tonal variations in Mandarin , 1997 .