论文信息 - Modeling the Perceptual Learning of Novel Dialect Features - 字舞流文

Modeling the Perceptual Learning of Novel Dialect Features

Modeling the Perceptual Learning of Novel Dialect Features

Rachael Tatman | Rachael Tatman

[1] D. Pisoni,et al. Speech Perception as a Talker-Contingent Process , 1993, Psychological science.

[2] Natalie Schilling-Estes,et al. American English: Dialects and Variation , 1998 .

[3] Geoffrey Stewart Morrison,et al. Vowel Inherent Spectral Change , 2013 .

[4] K. Hornik,et al. party : A Laboratory for Recursive Partytioning , 2009 .

[5] P. Bertelson,et al. Visual Recalibration of Auditory Speech Identification , 2003, Psychological science.

[6] J. Turner,et al. The significance of the social identity concept for social psychology with reference to individualism, interactionism and social influence , 1986 .

[7] Jack Grieve,et al. Regional Variation in Written American English , 2016 .

[8] Maryam Najafian,et al. Acoustic model selection using limited data for accent robust speech recognition , 2014, 2014 22nd European Signal Processing Conference (EUSIPCO).

[9] L. Nygaard,et al. Perceptual learning of systematic variation in Spanish-accented speech. , 2009, The Journal of the Acoustical Society of America.

[10] Lin-Shan Lee,et al. Rapid speaker adaptation using a priori knowledge by eigenspace analysis of MLLR parameters , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[11] C. Clopper. Effects of dialect variation on speeded word classification , 2007 .

[12] Yonghong Yan,et al. Discriminative Pronunciation Modeling Using the MPE Criterion , 2015, IEICE Trans. Inf. Syst..

[13] Nancy Niedzielski,et al. The Effect of Social Information on the Perception of Sociolinguistic Variables , 1999 .

[14] Tyler Kendall,et al. Variation in perception and production of mid front vowels in the U.S. Southern Vowel Shift , 2012, J. Phonetics.

[15] Rosina Lippi. English with an Accent: Language, Ideology and Discrimination in the United States , 1997 .

[16] S. J. Young,et al. Tree-based state tying for high accuracy acoustic modelling , 1994 .

[17] Lior Shamir,et al. Assessing the efficacy of benchmarks for automatic speech accent recognition , 2015, EAI Endorsed Trans. Creative Technol..

[18] Nikolas Coupland,et al. What is Sociolinguistic Theory , 1998 .

[19] Tessa Bent,et al. Perceptual adaptation to non-native speech , 2008, Cognition.

[20] Sophie Dufour,et al. Behavioral and electrophysiological evidence for the impact of regional variation on phoneme perception , 2009, Cognition.

[21] Suzanne Romaine. One Speaker, Two Languages: Cross-Disciplinary Perspectives on Code-Switching , 1997 .

[22] Intergroup Dynamics in Speech Perception: Interaction Among Experience, Attitudes and Expectations , 2016 .

[23] D. Pisoni,et al. Talker-specific learning in speech perception , 1998, Perception & psychophysics.

[24] John H. L. Hansen,et al. Perceptual Recognition Cues in Native English Accent Variation: "Listener Accent, Perceived Accent, and Comprehension" , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[25] Samantha A. Lyle. Dialect Variation in Stop Consonant Voicing , 2008 .

[26] Susannah V. Levi,et al. Individual Differences in Learning Talker Categories: The Role of Working Memory , 2015, Phonetica.

[27] J. McQueen,et al. The specificity of perceptual learning in speech processing , 2005, Perception & psychophysics.

[28] Manuel Díaz-Campos,et al. Perceptual Categorization of Dialect Variation in Spanish , 2009 .

[29] Jeremy Goslin,et al. Does a regional accent perturb speech processing? , 2006, Journal of experimental psychology. Human perception and performance.

[30] Odette Scharenborg,et al. Parallels between HSR and ASR: how ASR can contribute to HSR , 2005, INTERSPEECH.

[31] 마이클 데이서,et al. Improving speech recognition of mobile devices , 2003 .

[32] Guy Bailey,et al. Some aspects of African-American vernacular English phonology , 2013 .

[33] Stephen J. Cox,et al. Unsupervised model selection for recognition of regional accented speech , 2014, INTERSPEECH.

[34] C. Fought,et al. Chicano English in Context , 2002 .

[35] Daniel Jurafsky,et al. Which words are hard to recognize? Prosodic, lexical, and disfluency factors that increase speech recognition error rates , 2010, Speech Commun..

[36] J. Pierrehumbert,et al. Social Salience Discriminates Learnability of Contextual Cues in an Artificial Language , 2017, Front. Psychol..

[37] M. P. Gelfer,et al. The relative contributions of speaking fundamental frequency and formant frequencies to gender identification based on isolated vowels. , 2005, Journal of voice : official journal of the Voice Foundation.

[38] Terrin N. Tamati,et al. Lexical neighborhoods and phonological confusability in cross-dialect word recognition in noise , 2010 .

[39] Lesley Milroy,et al. Language and social networks , 1980 .

[40] P. Iverson,et al. Vowel normalization for accent: an investigation of best exemplar locations in northern and southern British English sentences. , 2004, The Journal of the Acoustical Society of America.

[41] Jonas Lööf,et al. Speaker Adaptation using Maximum Likelihood Linear Regression , 2005 .

[42] Chin-Hui Lee,et al. Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains , 1994, IEEE Trans. Speech Audio Process..

[43] Christian Koops,et al. University of Pennsylvania Working Papers in Linguistics , 2022 .

[44] J. Nycz. Changing words or changing rules? Second dialect acquisition and phonological representation , 2013 .

[45] James D. Harnsberger,et al. The perception of Malayalam nasal consonants by Marathi, Punjabi, Tamil, Oriya, Bengali, and American English listeners: A multidimensional scaling analysis , 2001, J. Phonetics.

[46] Matthew J. Gordon,et al. Small-Town Values and Big-City Vowels: A Study of the Northern Cities Shift in Michigan , 2000 .

[47] P. Kuhl. Human adults and human infants show a “perceptual magnet effect” for the prototypes of speech categories, monkeys do not , 1991, Perception & psychophysics.

[48] Wei Li. Social meaning in linguistic structure: code- switching in Norway , 2003 .

[49] W. Baker,et al. DIALECT IDENTIFICATION: THE EFFECTS OF REGION OF ORIGIN AND AMOUNT OF EXPERIENCE , 2009 .

[50] Stephane Champely,et al. Basic Functions for Power Analysis , 2015 .

[51] Sanjeev Khudanpur,et al. Librispeech: An ASR corpus based on public domain audio books , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[52] D. Nguyen. Text as social and cultural data : a computational perspective on variation in text , 2017 .

[53] Dave F. Kleinschmidt,et al. Robust speech perception: recognize the familiar, generalize to the similar, and adapt to the novel. , 2015, Psychological review.

[54] Jean Vroomen,et al. Phonetic recalibration only occurs in speech mode , 2009, Cognition.

[55] G. A. Miller,et al. The intelligibility of speech as a function of the context of the test materials. , 1951, Journal of experimental psychology.

[56] D. Pisoni,et al. Training Japanese listeners to identify English /r/ and /l/. II: The role of phonetic environment and talker variability in learning new perceptual categories. , 1993, The Journal of the Acoustical Society of America.

[57] Mathias Scharinger,et al. You had me at “Hello”: Rapid extraction of dialect information from spoken words , 2011, NeuroImage.

[58] D. Childers,et al. Gender recognition from speech. Part I: Coarse analysis. , 1991, The Journal of the Acoustical Society of America.

[59] Julia Hirschberg,et al. Automatic Dialect and Accent Recognition and its Application to Speech Recognition , 2011 .

[60] Hank Liao,et al. Large scale deep neural network acoustic modeling with semi-supervised training data for YouTube video transcription , 2013, 2013 IEEE Workshop on Automatic Speech Recognition and Understanding.

[61] Some acoustic cues for categorizing American English regional dialects , 2001 .

[62] P. Iverson,et al. Plasticity in vowel perception and production: a study of accent change in young adults. , 2007, The Journal of the Acoustical Society of America.

[63] Joseph Picone,et al. Voice across America: Toward robust speaker-independent speech recognition for telecommunications applications , 1991, Digit. Signal Process..

[64] Cynthia G. Clopper,et al. Perception of Dialect Variation in Noise: Intelligibility and Classification , 2008, Language and speech.

[65] Pedro J. Moreno,et al. Multi-Dialectical Languages Effect on Speech Recognition: Too Much Choice Can Hurt , 2015, ICNLSP.

[66] W. Labov. The social motivation of a sound change , 1963 .

[67] D. Pisoni,et al. Training Japanese listeners to identify English /r/ and /l/: IV. Some effects of perceptual learning on speech production. , 1997, The Journal of the Acoustical Society of America.

[68] M Sawalha,et al. The effects of speakers' gender, age, and region on overall performance of Arabic automatic speech recognition systems using the phonetically rich and balanced Modern Standard Arabic speech corpus , 2013 .

[69] A cross dialect study of vowel perception in Standard Indonesian , 1984 .

[70] Keith B. Hall,et al. Geo-location for voice search language modeling , 2015, INTERSPEECH.

[71] Jay J. Van Bavel,et al. Perceiving the World Through Group-Colored Glasses: A Perceptual Model of Intergroup Relations , 2016 .

[72] A. Samuel,et al. Perceptual learning for speech , 2009, Attention, perception & psychophysics.

[73] H. Giles,et al. Accommodation theory: Communication, context, and consequence. , 1991 .

[74] J. R. Landis,et al. The measurement of observer agreement for categorical data. , 1977, Biometrics.

[75] Karen E. Pollock,et al. Regional Variations in the Phonological Characteristics of African American Vernacular English , 2000 .

[76] J. Hay,et al. Stuffed toys and speech perception , 2010 .

[77] Jacob Cohen. A Coefficient of Agreement for Nominal Scales , 1960 .

[78] E. Gibson,et al. Principles of Perceptual Learning and Development , 1973 .

[79] Brendan T. O'Connor,et al. Demographic Dialectal Variation in Social Media: A Case Study of African-American English , 2016, EMNLP.

[80] C. Davies. Language and identity in discourse in the American South: Sociolinguistic repertoire as expressive resource in the presentation of self , 2007 .

[81] Mari Ostendorf,et al. ATAROS Technical Report 1: Corpus collection and initial task validation , 2014 .

[82] Ronald A. Cole,et al. New telephone speech corpora at CSLU , 1995, EUROSPEECH.

[83] Dominic Telaar,et al. Accent- and speaker-specific polyphone decision trees for non-native speech recognition , 2013, INTERSPEECH.

[84] Elizabeth A. Strand,et al. Auditory–visual integration of talker gender in vowel perception , 1999 .

[85] Hsin-Min Wang,et al. Eigenspace-based maximum a posteriori linear regression for rapid speaker adaptation , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[86] B. Everitt,et al. Statistical methods for rates and proportions , 1973 .

[87] Erwan Pépiot. Male and female speech: a study of mean f0, f0 range, phonation type and speech rate in Parisian French and American English speakers , 2014 .

[88] David Bowie. The effect of geographic mobility on the retention of a local dialect , 2000 .

[89] Marzena Karpinska,et al. Vowel perception by listeners from different English dialects , 2015, ICPhS.

[90] C. Clopper,et al. Effects of dialect on vowel acoustics and intelligibility , 2013, Journal of the International Phonetic Association.

[91] Matthew H. Davis,et al. Lexical information drives perceptual learning of distorted speech: evidence from the comprehension of noise-vocoded sentences. , 2005, Journal of experimental psychology. General.

[92] Philip C. Woodland,et al. Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models , 1995, Comput. Speech Lang..

[93] L. Bernstein,et al. Psychophysics of the McGurk and other audiovisual speech integration effects. , 2011, Journal of experimental psychology. Human perception and performance.

[94] P. Kellman,et al. Perceptual Learning, Cognition, and Expertise , 2013 .

[95] Robert L. Goldstone,et al. Definition , 1960, A Philosopher Looks at Sport.

[96] On the role of vowel duration in the New Zealand English front vowel shift , 2009, Language Variation and Change.

[97] P. Foulkes. Exploring social-indexical knowledge: A long past but a short history , 2010 .

[98] Francoise Beaufays,et al. Google Search by Voice: A Case Study , 2010 .

[99] R. Montemayor. Jocks and Burnouts: Social Categories and Identity in the High School. , 1990 .

[100] Geoffrey Stewart Morrison. Theories of Vowel Inherent Spectral Change , 2013 .

[101] Russell S. Kirby,et al. The Atlas of North American English: Phonetics, Phonology and Sound Change. A Multimedia Reference Tool , 2007 .

[102] J. Meers. The acquisition of front rounded and nasalized vowels of French by native speakers of English , 2009 .

[103] Paul Boersma,et al. Praat, a system for doing phonetics by computer , 2002 .

[104] C. Cutler. Yorkville Crossing: White teens, hip hop and African American English , 1999 .

[105] T. M. Nearey. Phonetic feature systems for vowels , 1978 .

[106] Fernando Peñalosa. Chicano sociolinguistics, a brief introduction , 1981 .

[107] Kevin B. McGowan. Social Expectation Improves Speech Perception in Noise , 2015, Language and speech.

[108] Jean Carletta,et al. The AMI meeting corpus , 2005 .

[109] Katherine S White,et al. Adaptation to novel accents by toddlers. , 2011, Developmental science.

[110] Stephen Cox,et al. A comparison of two unsupervised approaches to accent identification , 1998, ICSLP.

[111] Sarah C. Creel,et al. How Talker Identity Relates to Language Processing , 2011, Lang. Linguistics Compass.

[112] D. Pisoni,et al. Effects of cross-language voice training on speech perception: whose familiar voices are more intelligible? , 2011, The Journal of the Acoustical Society of America.

[113] Alfred Mertins,et al. Automatic speech recognition and speech variability: A review , 2007, Speech Commun..

[114] Roland Kuhn,et al. Rapid speaker adaptation in eigenvoice space , 2000, IEEE Trans. Speech Audio Process..

[115] Katie Drager,et al. Sociophonetic Variation in Speech Perception , 2010, Lang. Linguistics Compass.

[116] Jorge Proença,et al. Automatic Annotation of Disfluent Speech in Children's Reading Tasks , 2016, IberSPEECH.

[117] Carla Teixeira Lopes,et al. TIMIT Acoustic-Phonetic Continuous Speech Corpus , 2012 .

[118] Sid-Ahmed Selouani,et al. Speaker-independent ASR for Modern Standard Arabic: effect of regional accents , 2012, International Journal of Speech Technology.

[119] N. Coupland,et al. Ideologised values for British accents , 2007 .

[120] Cynthia G. Clopper,et al. Effects of Lexical Competition and Dialect Exposure on Phonological Priming , 2017, Language and speech.

[121] Julia Hirschberg,et al. Prosodic and other cues to speech recognition failures , 2004, Speech Commun..

[122] Maja Pantic,et al. Discrimination Between Native and Non-Native Speech Using Visual Features Only , 2016, IEEE Transactions on Cybernetics.

[123] J. Rickford,et al. African American Vernacular English: Features, Evolution, Educational Implications , 1999 .

[124] J. Hay,et al. Congruence between ‘word age’ and ‘voice age’ facilitates lexical access , 2011 .

[125] Ye-Yi Wang,et al. Is word error rate a good indicator for spoken language understanding accuracy , 2003, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721).

[126] D. Dahan,et al. Talker adaptation in speech perception: Adjusting the signal or the representations? , 2008, Cognition.

[127] Biing-Hwang Juang,et al. Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[128] Achim Zeileis,et al. Partykit: a modular toolkit for recursive partytioning in R , 2015, J. Mach. Learn. Res..

[129] C. Best,et al. Discrimination of non-native consonant contrasts varying in perceptual assimilation to the listener's native phonological system. , 2001, The Journal of the Acoustical Society of America.

[130] Daniel Lawrence. Limited evidence for social priming in the perception of the BATH and STRUT vowels , 2015, ICPhS.

[131] D. Pisoni,et al. Training Japanese listeners to identify English /r/ and /l/: a first report. , 1991, The Journal of the Acoustical Society of America.

[132] Geoffrey Zweig,et al. The microsoft 2016 conversational speech recognition system , 2016, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[133] Penelope Eckert,et al. Where do ethnolects stop? , 2008 .

[134] Christian Koops,et al. The effect of perceived speaker age on the perception of PIN and PEN vowels in Houston, Texas , 2008 .

[135] Jian Yang,et al. Non-native speech recognition based on speaker adaptation , 2010, 2010 Sixth International Conference on Natural Computation.

[136] John H. L. Hansen,et al. Unsupervised Discriminative Training With Application to Dialect Classification , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[137] Yifan Gong,et al. Geo-location dependent deep neural network acoustic model for speech recognition , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[138] John H. L. Hansen,et al. Automatic Accent Assessment Using Phonetic Mismatch and Human Perception , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[139] Margaret Maclagan,et al. Getting fed up with our feet: Contrast maintenance and the New Zealand English “short” front vowel shift , 2007, Language Variation and Change.

[140] P. Eckert. The whole woman: Sex and gender differences in variation , 1989, Language Variation and Change.

[141] John C. Wells. Accents of English 3: Preface , 1982 .

[142] Joris Pelemans,et al. Automatic assessment of children's reading with the FLaVoR decoding using a phone confusion model , 2014, INTERSPEECH.

[143] Catherine I. Watson,et al. Mappings between vocal tract area functions, vocal tract resonances and speech formants for multiple speakers , 2014, INTERSPEECH.

[144] Ronald A. Cole,et al. Selective adaptation of English consonants using real speech , 1975 .

[145] Kenny Smith,et al. Acquiring variation in an artificial language: Children and adults are sensitive to socially conditioned linguistic variation , 2017, Cognitive Psychology.

[146] R. Kominski,et al. Language Use in the United States: 2007 , 2010 .

[147] Jonathan Harrington,et al. Acoustic evidence for vowel change in New Zealand English , 2000, Language Variation and Change.

[148] Philip C. Woodland,et al. Using accent-specific pronunciation modelling for robust speech recognition , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[149] P. Warren,et al. Short-term Exposure to One Dialect Affects Processing of Another , 2010, Language and speech.

[150] K. Hornik,et al. Unbiased Recursive Partitioning: A Conditional Inference Framework , 2006 .

[151] Tyler Kendall,et al. Exploring the relationship between production and perception in the mid front vowels of U.S. English , 2012 .

[152] R. Wright. Phonetically Based Phonology: A review of perceptual cues and cue robustness , 2004 .

[153] P. Trudgill. Sex, covert prestige and linguistic change in the urban British English of Norwich , 1972, Language in Society.

[154] Cynthia G. Clopper,et al. Homebodies and army brats: Some effects of early linguistic experience and residential history on dialect categorization , 2004, Language Variation and Change.