A Cognitively Grounded Measure of Pronunciation Distance

In this study we develop pronunciation distances based on naive discriminative learning (NDL). Measures of pronunciation distance are used in several subfields of linguistics, including psycholinguistics, dialectology and typology. In contrast to the commonly used Levenshtein algorithm, NDL is grounded in cognitive theory of competitive reinforcement learning and is able to generate asymmetrical pronunciation distances. In a first study, we validated the NDL-based pronunciation distances by comparing them to a large set of native-likeness ratings given by native American English speakers when presented with accented English speech. In a second study, the NDL-based pronunciation distances were validated on the basis of perceptual dialect distances of Norwegian speakers. Results indicated that the NDL-based pronunciation distances matched perceptual distances reasonably well with correlations ranging between 0.7 and 0.8. While the correlations were comparable to those obtained using the Levenshtein distance, the NDL-based approach is more flexible as it is also able to incorporate acoustic information other than sound segments.

[1]  Eliza Margaretha,et al.  Inducing a measure of phonetic similarity from pronunciation variation , 2012, J. Phonetics.

[2]  Brett Kessler,et al.  Computational dialectology in Irish Gaelic , 1995, EACL.

[3]  J. Nerbonne,et al.  Inducing a measure of phonetic similarity from dialect variation , 2011 .

[4]  D. Danks Equilibria of the Rescorla--Wagner model , 2003 .

[5]  Nathan C. Sanders,et al.  Phonological Distance Measures* , 2009, J. Quant. Linguistics.

[6]  Wilbert Heeringa,et al.  Measuring Dialect Differences , 2009 .

[7]  W. Heeringa,et al.  Predicting intelligibility and perceived linguistic distance by means of the Levenshtein algorithm , 2008 .

[8]  L. Allan,et al.  The widespread influence of the Rescorla-Wagner model , 1996, Psychonomic bulletin & review.

[9]  W. Labov Principles of Linguistic Change: Cognitive and Cultural Factors , 2010 .

[10]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[11]  Melody Dye,et al.  The Enigma of Number: Why Children Find the Meanings of Even Small Number Words Hard to Learn and How We Can Help Them Do Better , 2011, PloS one.

[12]  Dušica Filipović Đurđević,et al.  An amorphous model for morphological processing in visual comprehension based on naive discriminative learning. , 2011, Psychological review.

[13]  John Nerbonne,et al.  An Aggregate Analysis of Pronunciation in the Goeman-Taeldeman-van Reenen-Project Data , 2007 .

[14]  Martijn Wieling,et al.  Dialect Pronunciation Comparison and Spoken Word Recognition , 2007 .

[15]  Stewart M. McCauley,et al.  Error and expectation in language learning: The curious absence of mouses in adult speech , 2013 .

[16]  Angelika Braun,et al.  The Use of the Almeida-Braun System in the Measurement of Dutch Dialect Distances , 2003, Comput. Humanit..

[17]  Steven H. Weinberger,et al.  The Speech Accent Archive: towards a typology of English accents , 2011 .

[18]  R. Baayen,et al.  Quantitative Social Dialectology: Explaining Linguistic Variation Geographically and Socially , 2011, PloS one.

[19]  David Birdsong,et al.  Degree of foreign accent in English sentences produced by Korean children and adults , 2006, J. Phonetics.

[20]  Cecil H. Brown,et al.  Adding typology to lexicostatistics: A combined approach to language classification , 2009 .

[21]  John Nerbonne,et al.  Linguistic advergence and divergence in north-western Catalan: A dialectometric investigation of dialect leveling and border effects , 2013, Lit. Linguistic Comput..

[22]  W. Heeringa,et al.  Evaluation of String Distance Algorithms for Dialectology , 2006 .

[23]  W. Labov Principles Of Linguistic Change , 1994 .

[24]  John Nerbonne,et al.  Measuring Dialect Distance Phonetically , 1997, SIGMORPHON@EACL.

[25]  Sjef Barbiers English and Dutch as SOV-Languages and the Distribution of CP-Complements , 1998 .

[26]  Eric W. Holman,et al.  Evaluating linguistic distance measures , 2010 .

[27]  R. Rescorla,et al.  A theory of Pavlovian conditioning : Variations in the effectiveness of reinforcement and nonreinforcement , 1972 .

[28]  John Nerbonne,et al.  Evaluating the Pairwise String Alignment of Pronunciations , 2009, LaTeCH - SHELT&R@EACL.

[29]  W. Heeringa,et al.  Perceptive evaluation of Levenshtein dialect distance measurements using Norwegian dialect data , 2004, Language Variation and Change.

[30]  Melody Dye,et al.  The Effects of Feature-Label-Order and Their Implications for Symbolic Learning , 2010, Cogn. Sci..

[31]  Wilbert Jan Heeringa Measuring dialect pronunciation differences using Levenshtein distance , 2004 .