Marginal Contrast Among Romanian Vowels: Evidence from ASR and Functional Load

This work quantifies the phonological contrast between the Romanian central vowels [2] and [1], which are considered separate phonemes, although they are historical allophones with few minimal pairs. We consider the vowels’ functional load within the Romanian inventory and the usefulness of the contrast for automatic speech recognition (ASR). Using a 7 hour corpus of automatically aligned broadcast speech, the relative frequencies of vowels are compared across phonological contexts. Results indicate a near complementary distribution of [2] and [1]: the contrast scores lowest of all pairwise comparisons on measures of functional load, and shows the highest Kullback-Leibler divergence, suggesting that few lexical distinctions depend on the contrast. Thereafter, forced alignment is performed using an existing ASR system. The system selects among [1], [2], ∅ for lexical /1/, testing for its reduction in continuous speech. The same data is transcribed using the ASR system where [2]/[1] are merged, testing the hypothesis that loss of a marginal contrast has little impact on ASR error rates. Both results are consistent with functional load calculations, indicating that the /2/ /1/ contrast is lexically and phonetically weak. These results show how automatic transcription tools can help test phonological predictions using continuous speech.

[1]  James Emil Flege,et al.  Effects of Spanish use on the production of Catalan vowels by early Spanish-Catalan bilinguals , 2015 .

[2]  C. F. Hockett THE QUANTIFICATION OF FUNCTIONAL LOAD--A LINGUISTIC PROBLEM. , 1966 .

[3]  Lori Lamel,et al.  Development of a speech-to-text transcription system for Finnish , 2010, SLTU.

[4]  Lori Lamel,et al.  Pronunciation Variants Across Systems, Languages and Speaking Style , 2007 .

[5]  Jean-Luc Gauvain,et al.  Lightly supervised and unsupervised acoustic model training , 2002, Comput. Speech Lang..

[6]  D. Surendran,et al.  Articulatory complexity, ambient frequency, and functional load as predictors of consonant development in children. , 2005, Journal of speech, language, and hearing research : JSLHR.

[7]  Kathleen Currie Hall,et al.  A typology of intermediate phonological relationships , 2013 .

[8]  Margaret E. L. Renwick The Phonetics and Phonology of Contrast: The Case of the Romanian Vowel System , 2014 .

[9]  Kathleen Currie Hall,et al.  A Probabilistic Model of Phonological Relationships from Contrast to Allophony. , 2009 .

[10]  Lori Lamel,et al.  PRONUNCIATION VARIANTS IN FRENCH : SCHWA & LIAISON , 1999 .

[11]  José Ignacio Hualde Quasi-Phonemic Contrasts in Spanish * , 2006 .

[12]  J. Scobbie,et al.  Quasi-phonemic contrast and the fuzzy inventory: examples from Scottish English , 2008 .

[13]  Jong Kyoung Kim,et al.  Speech recognition , 1983, 1983 IEEE International Solid-State Circuits Conference. Digest of Technical Papers.

[14]  Scott A. Jackson,et al.  High functional load inhibits phonological contrast loss: A corpus study , 2013, Cognition.

[15]  Hans Uszkoreit,et al.  The Romanian Language in the Digital Age , 2012 .

[16]  Wayne H. Ward,et al.  Speech recognition , 1997 .

[17]  Mark J. F. Gales,et al.  Maximum likelihood linear transformations for HMM-based speech recognition , 1998, Comput. Speech Lang..

[18]  P. Niyogi,et al.  Quantifying the functional load of phonemic oppositions, distinctive features, and suprasegmentals , 2006 .

[19]  E. Hume,et al.  The Impact of Allophony versus Contrast on Speech Perception , 2006 .

[20]  Ioana Chitoran,et al.  The Phonology of Romanian: A Constraint-Based Approach , 2001 .

[21]  François Pellegrino,et al.  Cross-language comparison of functional load for vowels, consonants, and tones , 2013, INTERSPEECH.

[22]  H Hermansky,et al.  Perceptual linear predictive (PLP) analysis of speech. , 1990, The Journal of the Acoustical Society of America.

[23]  Marianna Nadeu,et al.  Variation in the lexical distribution and implementation of phonetically similar phonemes in Catalan , 2016, J. Phonetics.

[24]  Lori Lamel,et al.  Exploring pronunciation variants for Romanian speech-to-text transcription , 2014, SLTU.