Modeling the Perceptual Learning of Novel Dialect Features

Modeling the Perceptual Learning of Novel Dialect Features

[1]  D. Pisoni,et al.  Speech Perception as a Talker-Contingent Process , 1993, Psychological science.

[2]  Natalie Schilling-Estes,et al.  American English: Dialects and Variation , 1998 .

[3]  Geoffrey Stewart Morrison,et al.  Vowel Inherent Spectral Change , 2013 .

[4]  K. Hornik,et al.  party : A Laboratory for Recursive Partytioning , 2009 .

[5]  P. Bertelson,et al.  Visual Recalibration of Auditory Speech Identification , 2003, Psychological science.

[6]  J. Turner,et al.  The significance of the social identity concept for social psychology with reference to individualism, interactionism and social influence , 1986 .

[7]  Jack Grieve,et al.  Regional Variation in Written American English , 2016 .

[8]  Maryam Najafian,et al.  Acoustic model selection using limited data for accent robust speech recognition , 2014, 2014 22nd European Signal Processing Conference (EUSIPCO).

[9]  L. Nygaard,et al.  Perceptual learning of systematic variation in Spanish-accented speech. , 2009, The Journal of the Acoustical Society of America.

[10]  Lin-Shan Lee,et al.  Rapid speaker adaptation using a priori knowledge by eigenspace analysis of MLLR parameters , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[11]  C. Clopper Effects of dialect variation on speeded word classification , 2007 .

[12]  Yonghong Yan,et al.  Discriminative Pronunciation Modeling Using the MPE Criterion , 2015, IEICE Trans. Inf. Syst..

[13]  Nancy Niedzielski,et al.  The Effect of Social Information on the Perception of Sociolinguistic Variables , 1999 .

[14]  Tyler Kendall,et al.  Variation in perception and production of mid front vowels in the U.S. Southern Vowel Shift , 2012, J. Phonetics.

[15]  Rosina Lippi English with an Accent: Language, Ideology and Discrimination in the United States , 1997 .

[16]  S. J. Young,et al.  Tree-based state tying for high accuracy acoustic modelling , 1994 .

[17]  Lior Shamir,et al.  Assessing the efficacy of benchmarks for automatic speech accent recognition , 2015, EAI Endorsed Trans. Creative Technol..

[18]  Nikolas Coupland,et al.  What is Sociolinguistic Theory , 1998 .

[19]  Tessa Bent,et al.  Perceptual adaptation to non-native speech , 2008, Cognition.

[20]  Sophie Dufour,et al.  Behavioral and electrophysiological evidence for the impact of regional variation on phoneme perception , 2009, Cognition.

[21]  Suzanne Romaine One Speaker, Two Languages: Cross-Disciplinary Perspectives on Code-Switching , 1997 .

[22]  Intergroup Dynamics in Speech Perception: Interaction Among Experience, Attitudes and Expectations , 2016 .

[23]  D. Pisoni,et al.  Talker-specific learning in speech perception , 1998, Perception & psychophysics.

[24]  John H. L. Hansen,et al.  Perceptual Recognition Cues in Native English Accent Variation: "Listener Accent, Perceived Accent, and Comprehension" , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[25]  Samantha A. Lyle Dialect Variation in Stop Consonant Voicing , 2008 .

[26]  Susannah V. Levi,et al.  Individual Differences in Learning Talker Categories: The Role of Working Memory , 2015, Phonetica.

[27]  J. McQueen,et al.  The specificity of perceptual learning in speech processing , 2005, Perception & psychophysics.

[28]  Manuel Díaz-Campos,et al.  Perceptual Categorization of Dialect Variation in Spanish , 2009 .

[29]  Jeremy Goslin,et al.  Does a regional accent perturb speech processing? , 2006, Journal of experimental psychology. Human perception and performance.

[30]  Odette Scharenborg,et al.  Parallels between HSR and ASR: how ASR can contribute to HSR , 2005, INTERSPEECH.

[31]  마이클 데이서,et al.  Improving speech recognition of mobile devices , 2003 .

[32]  Guy Bailey,et al.  Some aspects of African-American vernacular English phonology , 2013 .

[33]  Stephen J. Cox,et al.  Unsupervised model selection for recognition of regional accented speech , 2014, INTERSPEECH.

[34]  C. Fought,et al.  Chicano English in Context , 2002 .

[35]  Daniel Jurafsky,et al.  Which words are hard to recognize? Prosodic, lexical, and disfluency factors that increase speech recognition error rates , 2010, Speech Commun..

[36]  J. Pierrehumbert,et al.  Social Salience Discriminates Learnability of Contextual Cues in an Artificial Language , 2017, Front. Psychol..

[37]  M. P. Gelfer,et al.  The relative contributions of speaking fundamental frequency and formant frequencies to gender identification based on isolated vowels. , 2005, Journal of voice : official journal of the Voice Foundation.

[38]  Terrin N. Tamati,et al.  Lexical neighborhoods and phonological confusability in cross-dialect word recognition in noise , 2010 .

[39]  Lesley Milroy,et al.  Language and social networks , 1980 .

[40]  P. Iverson,et al.  Vowel normalization for accent: an investigation of best exemplar locations in northern and southern British English sentences. , 2004, The Journal of the Acoustical Society of America.

[41]  Jonas Lööf,et al.  Speaker Adaptation using Maximum Likelihood Linear Regression , 2005 .

[42]  Chin-Hui Lee,et al.  Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains , 1994, IEEE Trans. Speech Audio Process..

[43]  Christian Koops,et al.  University of Pennsylvania Working Papers in Linguistics , 2022 .

[44]  J. Nycz Changing words or changing rules? Second dialect acquisition and phonological representation , 2013 .

[45]  James D. Harnsberger,et al.  The perception of Malayalam nasal consonants by Marathi, Punjabi, Tamil, Oriya, Bengali, and American English listeners: A multidimensional scaling analysis , 2001, J. Phonetics.

[46]  Matthew J. Gordon,et al.  Small-Town Values and Big-City Vowels: A Study of the Northern Cities Shift in Michigan , 2000 .

[47]  P. Kuhl Human adults and human infants show a “perceptual magnet effect” for the prototypes of speech categories, monkeys do not , 1991, Perception & psychophysics.

[48]  Wei Li Social meaning in linguistic structure: code- switching in Norway , 2003 .

[49]  W. Baker,et al.  DIALECT IDENTIFICATION: THE EFFECTS OF REGION OF ORIGIN AND AMOUNT OF EXPERIENCE , 2009 .

[50]  Stephane Champely,et al.  Basic Functions for Power Analysis , 2015 .

[51]  Sanjeev Khudanpur,et al.  Librispeech: An ASR corpus based on public domain audio books , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[52]  D. Nguyen Text as social and cultural data : a computational perspective on variation in text , 2017 .

[53]  Dave F. Kleinschmidt,et al.  Robust speech perception: recognize the familiar, generalize to the similar, and adapt to the novel. , 2015, Psychological review.

[54]  Jean Vroomen,et al.  Phonetic recalibration only occurs in speech mode , 2009, Cognition.

[55]  G. A. Miller,et al.  The intelligibility of speech as a function of the context of the test materials. , 1951, Journal of experimental psychology.

[56]  D. Pisoni,et al.  Training Japanese listeners to identify English /r/ and /l/. II: The role of phonetic environment and talker variability in learning new perceptual categories. , 1993, The Journal of the Acoustical Society of America.

[57]  Mathias Scharinger,et al.  You had me at “Hello”: Rapid extraction of dialect information from spoken words , 2011, NeuroImage.

[58]  D. Childers,et al.  Gender recognition from speech. Part I: Coarse analysis. , 1991, The Journal of the Acoustical Society of America.

[59]  Julia Hirschberg,et al.  Automatic Dialect and Accent Recognition and its Application to Speech Recognition , 2011 .

[60]  Hank Liao,et al.  Large scale deep neural network acoustic modeling with semi-supervised training data for YouTube video transcription , 2013, 2013 IEEE Workshop on Automatic Speech Recognition and Understanding.

[61]  Some acoustic cues for categorizing American English regional dialects , 2001 .

[62]  P. Iverson,et al.  Plasticity in vowel perception and production: a study of accent change in young adults. , 2007, The Journal of the Acoustical Society of America.

[63]  Joseph Picone,et al.  Voice across America: Toward robust speaker-independent speech recognition for telecommunications applications , 1991, Digit. Signal Process..

[64]  Cynthia G. Clopper,et al.  Perception of Dialect Variation in Noise: Intelligibility and Classification , 2008, Language and speech.

[65]  Pedro J. Moreno,et al.  Multi-Dialectical Languages Effect on Speech Recognition: Too Much Choice Can Hurt , 2015, ICNLSP.

[66]  W. Labov The social motivation of a sound change , 1963 .

[67]  D. Pisoni,et al.  Training Japanese listeners to identify English /r/ and /l/: IV. Some effects of perceptual learning on speech production. , 1997, The Journal of the Acoustical Society of America.

[68]  M Sawalha,et al.  The effects of speakers' gender, age, and region on overall performance of Arabic automatic speech recognition systems using the phonetically rich and balanced Modern Standard Arabic speech corpus , 2013 .

[69]  A cross dialect study of vowel perception in Standard Indonesian , 1984 .

[70]  Keith B. Hall,et al.  Geo-location for voice search language modeling , 2015, INTERSPEECH.

[71]  Jay J. Van Bavel,et al.  Perceiving the World Through Group-Colored Glasses: A Perceptual Model of Intergroup Relations , 2016 .

[72]  A. Samuel,et al.  Perceptual learning for speech , 2009, Attention, perception & psychophysics.

[73]  H. Giles,et al.  Accommodation theory: Communication, context, and consequence. , 1991 .

[74]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[75]  Karen E. Pollock,et al.  Regional Variations in the Phonological Characteristics of African American Vernacular English , 2000 .

[76]  J. Hay,et al.  Stuffed toys and speech perception , 2010 .

[77]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[78]  E. Gibson,et al.  Principles of Perceptual Learning and Development , 1973 .

[79]  Brendan T. O'Connor,et al.  Demographic Dialectal Variation in Social Media: A Case Study of African-American English , 2016, EMNLP.

[80]  C. Davies Language and identity in discourse in the American South: Sociolinguistic repertoire as expressive resource in the presentation of self , 2007 .

[81]  Mari Ostendorf,et al.  ATAROS Technical Report 1: Corpus collection and initial task validation , 2014 .

[82]  Ronald A. Cole,et al.  New telephone speech corpora at CSLU , 1995, EUROSPEECH.

[83]  Dominic Telaar,et al.  Accent- and speaker-specific polyphone decision trees for non-native speech recognition , 2013, INTERSPEECH.

[84]  Elizabeth A. Strand,et al.  Auditory–visual integration of talker gender in vowel perception , 1999 .

[85]  Hsin-Min Wang,et al.  Eigenspace-based maximum a posteriori linear regression for rapid speaker adaptation , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[86]  B. Everitt,et al.  Statistical methods for rates and proportions , 1973 .

[87]  Erwan Pépiot Male and female speech: a study of mean f0, f0 range, phonation type and speech rate in Parisian French and American English speakers , 2014 .

[88]  David Bowie The effect of geographic mobility on the retention of a local dialect , 2000 .

[89]  Marzena Karpinska,et al.  Vowel perception by listeners from different English dialects , 2015, ICPhS.

[90]  C. Clopper,et al.  Effects of dialect on vowel acoustics and intelligibility , 2013, Journal of the International Phonetic Association.

[91]  Matthew H. Davis,et al.  Lexical information drives perceptual learning of distorted speech: evidence from the comprehension of noise-vocoded sentences. , 2005, Journal of experimental psychology. General.

[92]  Philip C. Woodland,et al.  Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models , 1995, Comput. Speech Lang..

[93]  L. Bernstein,et al.  Psychophysics of the McGurk and other audiovisual speech integration effects. , 2011, Journal of experimental psychology. Human perception and performance.

[94]  P. Kellman,et al.  Perceptual Learning, Cognition, and Expertise , 2013 .

[95]  Robert L. Goldstone,et al.  Definition , 1960, A Philosopher Looks at Sport.

[96]  On the role of vowel duration in the New Zealand English front vowel shift , 2009, Language Variation and Change.

[97]  P. Foulkes Exploring social-indexical knowledge: A long past but a short history , 2010 .

[98]  Francoise Beaufays,et al.  Google Search by Voice: A Case Study , 2010 .

[99]  R. Montemayor Jocks and Burnouts: Social Categories and Identity in the High School. , 1990 .

[100]  Geoffrey Stewart Morrison Theories of Vowel Inherent Spectral Change , 2013 .

[101]  Russell S. Kirby,et al.  The Atlas of North American English: Phonetics, Phonology and Sound Change. A Multimedia Reference Tool , 2007 .

[102]  J. Meers The acquisition of front rounded and nasalized vowels of French by native speakers of English , 2009 .

[103]  Paul Boersma,et al.  Praat, a system for doing phonetics by computer , 2002 .

[104]  C. Cutler Yorkville Crossing: White teens, hip hop and African American English , 1999 .

[105]  T. M. Nearey Phonetic feature systems for vowels , 1978 .

[106]  Fernando Peñalosa Chicano sociolinguistics, a brief introduction , 1981 .

[107]  Kevin B. McGowan Social Expectation Improves Speech Perception in Noise , 2015, Language and speech.

[108]  Jean Carletta,et al.  The AMI meeting corpus , 2005 .

[109]  Katherine S White,et al.  Adaptation to novel accents by toddlers. , 2011, Developmental science.

[110]  Stephen Cox,et al.  A comparison of two unsupervised approaches to accent identification , 1998, ICSLP.

[111]  Sarah C. Creel,et al.  How Talker Identity Relates to Language Processing , 2011, Lang. Linguistics Compass.

[112]  D. Pisoni,et al.  Effects of cross-language voice training on speech perception: whose familiar voices are more intelligible? , 2011, The Journal of the Acoustical Society of America.

[113]  Alfred Mertins,et al.  Automatic speech recognition and speech variability: A review , 2007, Speech Commun..

[114]  Roland Kuhn,et al.  Rapid speaker adaptation in eigenvoice space , 2000, IEEE Trans. Speech Audio Process..

[115]  Katie Drager,et al.  Sociophonetic Variation in Speech Perception , 2010, Lang. Linguistics Compass.

[116]  Jorge Proença,et al.  Automatic Annotation of Disfluent Speech in Children's Reading Tasks , 2016, IberSPEECH.

[117]  Carla Teixeira Lopes,et al.  TIMIT Acoustic-Phonetic Continuous Speech Corpus , 2012 .

[118]  Sid-Ahmed Selouani,et al.  Speaker-independent ASR for Modern Standard Arabic: effect of regional accents , 2012, International Journal of Speech Technology.

[119]  N. Coupland,et al.  Ideologised values for British accents , 2007 .

[120]  Cynthia G. Clopper,et al.  Effects of Lexical Competition and Dialect Exposure on Phonological Priming , 2017, Language and speech.

[121]  Julia Hirschberg,et al.  Prosodic and other cues to speech recognition failures , 2004, Speech Commun..

[122]  Maja Pantic,et al.  Discrimination Between Native and Non-Native Speech Using Visual Features Only , 2016, IEEE Transactions on Cybernetics.

[123]  J. Rickford,et al.  African American Vernacular English: Features, Evolution, Educational Implications , 1999 .

[124]  J. Hay,et al.  Congruence between ‘word age’ and ‘voice age’ facilitates lexical access , 2011 .

[125]  Ye-Yi Wang,et al.  Is word error rate a good indicator for spoken language understanding accuracy , 2003, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721).

[126]  D. Dahan,et al.  Talker adaptation in speech perception: Adjusting the signal or the representations? , 2008, Cognition.

[127]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[128]  Achim Zeileis,et al.  Partykit: a modular toolkit for recursive partytioning in R , 2015, J. Mach. Learn. Res..

[129]  C. Best,et al.  Discrimination of non-native consonant contrasts varying in perceptual assimilation to the listener's native phonological system. , 2001, The Journal of the Acoustical Society of America.

[130]  Daniel Lawrence Limited evidence for social priming in the perception of the BATH and STRUT vowels , 2015, ICPhS.

[131]  D. Pisoni,et al.  Training Japanese listeners to identify English /r/ and /l/: a first report. , 1991, The Journal of the Acoustical Society of America.

[132]  Geoffrey Zweig,et al.  The microsoft 2016 conversational speech recognition system , 2016, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[133]  Penelope Eckert,et al.  Where do ethnolects stop? , 2008 .

[134]  Christian Koops,et al.  The effect of perceived speaker age on the perception of PIN and PEN vowels in Houston, Texas , 2008 .

[135]  Jian Yang,et al.  Non-native speech recognition based on speaker adaptation , 2010, 2010 Sixth International Conference on Natural Computation.

[136]  John H. L. Hansen,et al.  Unsupervised Discriminative Training With Application to Dialect Classification , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[137]  Yifan Gong,et al.  Geo-location dependent deep neural network acoustic model for speech recognition , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[138]  John H. L. Hansen,et al.  Automatic Accent Assessment Using Phonetic Mismatch and Human Perception , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[139]  Margaret Maclagan,et al.  Getting fed up with our feet: Contrast maintenance and the New Zealand English “short” front vowel shift , 2007, Language Variation and Change.

[140]  P. Eckert The whole woman: Sex and gender differences in variation , 1989, Language Variation and Change.

[141]  John C. Wells Accents of English 3: Preface , 1982 .

[142]  Joris Pelemans,et al.  Automatic assessment of children's reading with the FLaVoR decoding using a phone confusion model , 2014, INTERSPEECH.

[143]  Catherine I. Watson,et al.  Mappings between vocal tract area functions, vocal tract resonances and speech formants for multiple speakers , 2014, INTERSPEECH.

[144]  Ronald A. Cole,et al.  Selective adaptation of English consonants using real speech , 1975 .

[145]  Kenny Smith,et al.  Acquiring variation in an artificial language: Children and adults are sensitive to socially conditioned linguistic variation , 2017, Cognitive Psychology.

[146]  R. Kominski,et al.  Language Use in the United States: 2007 , 2010 .

[147]  Jonathan Harrington,et al.  Acoustic evidence for vowel change in New Zealand English , 2000, Language Variation and Change.

[148]  Philip C. Woodland,et al.  Using accent-specific pronunciation modelling for robust speech recognition , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[149]  P. Warren,et al.  Short-term Exposure to One Dialect Affects Processing of Another , 2010, Language and speech.

[150]  K. Hornik,et al.  Unbiased Recursive Partitioning: A Conditional Inference Framework , 2006 .

[151]  Tyler Kendall,et al.  Exploring the relationship between production and perception in the mid front vowels of U.S. English , 2012 .

[152]  R. Wright Phonetically Based Phonology: A review of perceptual cues and cue robustness , 2004 .

[153]  P. Trudgill Sex, covert prestige and linguistic change in the urban British English of Norwich , 1972, Language in Society.

[154]  Cynthia G. Clopper,et al.  Homebodies and army brats: Some effects of early linguistic experience and residential history on dialect categorization , 2004, Language Variation and Change.