Methods for pronunciation assessment in computer aided language learning

Learning a foreign language is a challenging endeavor that entails acquiring a wide range of new knowledge including words, grammar, gestures, sounds, etc. Mastering these skills all require extensive practice by the learner and opportunities may not always be available. Computer Aided Language Learning (CALL) systems provide non-threatening environments where foreign language skills can be practiced where ever and whenever a student desires. These systems often have several technologies to identify the different types of errors made by a student. This thesis focuses on the problem of identifying mispronunciations made by a foreign language student using a CALL system. We make several assumptions about the nature of the learning activity: it takes place using a dialogue system, it is a task- or game-oriented activity, the student should not be interrupted by the pronunciation feedback system, and that the goal of the feedback system is to identify severe mispronunciations with high reliability. Detecting mispronunciations requires a corpus of speech with human judgements of pronunciation quality. Typical approaches to collecting such a corpus use an expert phonetician to both phonetically transcribe and assign judgements of quality to each phone in a corpus. This is time consuming and expensive. It also places an extra burden on the transcriber. We describe a novel method for obtaining phone level judgements of pronunciation quality by utilizing non-expert, crowd-sourced, word level judgements of pronunciation. Foreign language learners typically exhibit high variation and pronunciation shapes distinct from native speakers that make analysis for mispronunciation difficult. We detail a simple, but effective method for transforming the vowel space of non-native speakers to make mispronunciation detection more robust and accurate. We show that this transformation not only enhances performance on a simple classification task, but also results in distributions that can be better exploited for mispronunciation detection. This transformation of the vowel is exploited to train a mispronunciation detector using a variety of features derived from acoustic model scores and vowel class distributions. We confirm that the transformation technique results in a more robust and accurate identification of mispronunciations than traditional acoustic models. (Copies available exclusively from MIT Libraries, Rm. 14-0551, Cambridge, MA 02139-4307. Ph. 617-253-5668; Fax 617-253-1690.)

[1]  Francis Destombes,et al.  The Development and Application of the IBM Speech Viewer , 1993 .

[2]  Mervyn A. Jack,et al.  SPELL: An automated system for computer-aided pronunciation teaching , 1993, Speech Commun..

[3]  Vikas Sindhwani,et al.  Data Quality from Crowdsourcing: A Study of Annotation Selection Criteria , 2009, HLT-NAACL 2009.

[4]  Douglas Morgenstern The Athena Language Learning Project. , 1986 .

[5]  Gregor Möhler,et al.  Intonational Foreign Accent : Speech Technology and Foreign Language Teaching , 1998 .

[6]  Stacy Marsella,et al.  The DARWARS Tactical Language Training System , 2004 .

[7]  Allan R. James,et al.  Second Language Speech , 1995 .

[8]  Harry S. Wohlert German by Satellite , 1991 .

[9]  Robert C. Gardner,et al.  Language Anxiety: Its Relationship to Other Anxieties and to Processing in Native and Second Languages* , 1991 .

[10]  Robert S. Hart The Illinois PLATO Foreign Languages Project , 2013 .

[11]  Helmer Strik,et al.  Feedback in computer assisted pronunciation training: technology push or demand pull? , 2002, INTERSPEECH.

[12]  Brian A Vander Schee Crowdsourcing: Why the Power of the Crowd Is Driving the Future of Business , 2009 .

[13]  Lou Boves,et al.  Using likelihood ratios to perform utterance verification in automatic pronunciation assessment , 1999, EUROSPEECH.

[14]  Marjorie Bingham Wesche Communicative Testing in a Second Language , 1983 .

[15]  James P. Pusack,et al.  DASHER: An Answer Processor for Language Study , 1986 .

[16]  Charles W. Stansfield,et al.  An Evaluation of Simulated Oral Proficiency Interviews as Measures of Spoken Language Proficiency. , 1990 .

[17]  Philippe Martin WinPitch LTL II, a multimodal pronunciation software , 2004 .

[18]  Seok-Chae Rhee,et al.  Development of the knowledge-based spoken English evaluation system and its application , 2004, INTERSPEECH.

[19]  D. Pisoni,et al.  Training Japanese listeners to identify English /r/ and /l/: IV. Some effects of perceptual learning on speech production. , 1997, The Journal of the Acoustical Society of America.

[20]  P. MacIntyre,et al.  The Subtle Effects of Language Anxiety on Cognitive Processing in the Second Language , 1994 .

[21]  J. Ross Quinlan,et al.  Learning decision tree classifiers , 1996, CSUR.

[22]  Keikichi Hirose,et al.  Improved structure-based automatic estimation of pronunciation proficiency , 2009, SLaTE.

[23]  Maxine Eskenazi,et al.  Using a Computer in Foreign Language Pronunciation Training: What Advantages? , 1999 .

[24]  Brendan T. O'Connor,et al.  Cheap and Fast – But is it Good? Evaluating Non-Expert Annotations for Natural Language Tasks , 2008, EMNLP.

[25]  R. A. Leibler,et al.  On Information and Sufficiency , 1951 .

[26]  Bonnie Adair-Hauck,et al.  Evaluating the Integration of Technology and Second Language Learning. , 2013 .

[27]  Tien-Lok Jonathan Lau SLLS: An Online Conversational Spoken Language Learning System , 2003 .

[28]  Eric Moulines,et al.  Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones , 1989, Speech Commun..

[29]  Xiaoming Xi,et al.  Automatic scoring of non-native spontaneous speech in tests of spoken English , 2009, Speech Commun..

[30]  Alexander I. Rudnicky,et al.  Ravenclaw: dialog management using hierarchical task decomposition and an expectation agenda , 2003, INTERSPEECH.

[31]  Lou Boves,et al.  Assessment of dutch pronunciation by means of automatic speech recognition technology , 1998, ICSLP.

[32]  Philip Hubbard,et al.  A Survey of Unanswered Questions in CALL , 2003 .

[33]  Yoonji Kim,et al.  Attention to Critical Acoustic Features for L 2 Phonemic Identification and its Implication on L 2 Perceptual Training , 2010 .

[34]  Craig Chaudron Progress in Language Classroom Research: Evidence from The Modern Language Journal, 1916‐2000 , 2001 .

[35]  Jared Bernstein,et al.  Development and validation of an automatic spoken Spanish test , 2006 .

[36]  Gérard Bailly,et al.  Visual articulatory feedback for phonetic correction in second language learning , 2010 .

[37]  Joost van Doremalen,et al.  DISCO: development and integration of speech technology into courseware for language learning , 2008, INTERSPEECH.

[38]  Ravi Purushotma,et al.  Commentary: You're Not Studying, You're Just... , 2005 .

[39]  C. Habel,et al.  Language , 1931, NeuroImage.

[40]  Keikichi Hirose,et al.  Structural representation of pronunciation and its application for classifying Japanese learners of English , 2007, SLaTE.

[41]  Tatsuya Kawahara,et al.  Automatic pronunciation error detection and guidance for foreign language learning , 1998, ICSLP.

[42]  M. Witt,et al.  Performance Measures for Phone-level Pronunciation Teaching in Call , 1998 .

[43]  H. Brown,et al.  Principles of Language Learning and Teaching , 1980 .

[44]  Lou Boves,et al.  Different aspects of expert pronunciation quality ratings and their relation to scores produced by speech recognition algorithms , 2000, Speech Commun..

[45]  Rod Ellis,et al.  Task-based Language Learning and Teaching , 2003 .

[46]  Mark Aronoff,et al.  Contemporary linguistics: An introduction , 1989 .

[47]  Yonghong Yan,et al.  Mandarin vowel pronunciation quality evaluation by a novel formant classification method and its combination with traditional algorithms , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[48]  Michael Higgins,et al.  Training English pronunciation for Japanese learners of English online , 2005 .

[49]  Yusuke Kondo,et al.  Bridging the Gap between L 2 Research and Classroom Practice ( 2 ) : Evaluation of Automatic Scoring System for L 2 Speech , 2010 .

[50]  堀 智子 Exploring shadowing as a method of English pronunciation training , 2008 .

[51]  Ian McGraw,et al.  A self-transcribing speech corpus: collecting continuous speech with an online educational game , 2009, SLaTE.

[52]  Horacio Franco,et al.  Automatic detection of mispronunciation for language instruction , 1997, EUROSPEECH.

[53]  Satoru Fukayama,et al.  Lexical Tones Learning with Automatic Music Composition System Considering Prosody of Mandarin Chinese , 2010 .

[54]  Chris Callison-Burch,et al.  Fast, Cheap, and Creative: Evaluating Translation Quality Using Amazon’s Mechanical Turk , 2009, EMNLP.

[55]  Eric Moulines,et al.  Non-parametric techniques for pitch-scale and time-scale modification of speech , 1995, Speech Commun..

[56]  Mei-Yuh Hwang,et al.  An Overview of the SPHINX-II Speech Recognition System , 1993, HLT.

[57]  Stephanie Seneff,et al.  Towards Automatic Tone Correction in Non-native Mandarin , 2006, ISCSLP.

[58]  Chris Callison-Burch,et al.  Using Mechanical Turk to Build Machine Translation Evaluation Sets , 2010, Mturk@HLT-NAACL.

[59]  Joyce W. Nutta Is Computer-Based Grammar Instruction as Effective as Teacher- Directed Grammar Instruction for Teaching L2 Structures? , 2013 .

[60]  Edward Vockell,et al.  The computer in the foreign language curriculum , 1988 .

[61]  Stephen Cox Speaker normalization in the MFCC domain , 2000, INTERSPEECH.

[62]  D. J. Young Creating a Low‐Anxiety Classroom Environment: What Does Language Anxiety Research Suggest? , 1991 .

[63]  Tracey M. Derwing,et al.  Foreign Accent, Comprehensibility, and Intelligibility in the Speech of Second Language Learners , 1995 .

[64]  Tracey M. Derwing,et al.  The Effects of Pronunciation Instruction on the Accuracy, Fluency, and Complexity of L2 Accented Speech. , 2003 .

[65]  Nobuaki Minematsu Yet another acoustic representation of speech sounds , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[66]  Krystyna A. Wachowicz,et al.  Software That Listens: It's Not a Question of Whether, It's a Question of How , 1999 .

[67]  W. Lewis Johnson,et al.  Tactical Language and Culture Training Systems: Using Artificial Intelligence to Teach Foreign Languages and Cultures , 2008, AAAI.

[68]  Helmer Strik,et al.  Automatic evaluation of Dutch pronunciation by using speech recognition technology , 1997, 1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings.

[69]  P. Skehan Task-based instruction , 2003, Language Teaching.

[70]  Carsten Roever,et al.  WEB-BASED LANGUAGE TESTING , 2001 .

[71]  Jean W. LeLoup,et al.  ON THE NET |Interactive and Multimedia Techniques in Online Language Lessons: A Sampler , 2003 .

[72]  Joost van Doremalen,et al.  Optimizing non-native speech recognition for CALL applications , 2009, INTERSPEECH.

[73]  Stephanie Seneff,et al.  Annotation and features of non-native Mandarin tone quality , 2009, INTERSPEECH.

[74]  Maxine Eskénazi,et al.  Non-Native Users in the Let’s Go!! Spoken Dialogue System: Dealing with Linguistic Mismatch , 2004, NAACL.

[75]  Diane Kewley-Port,et al.  Explicit Pronunciation Training Using Automatic Speech Recognition Technology , 2013 .

[76]  N. Garrett Technology in the Service of Language Learning: Trends and Issues , 1991 .

[77]  Stephanie Seneff,et al.  Immersive second language acquisition in narrow domains: a prototype ISLAND dialogue system , 2007, SLaTE.

[78]  B. Planken,et al.  AGE AND ULTIMATE ATTAINMENT IN THE PRONUNCIATION OF A FOREIGN LANGUAGE , 1997, Studies in Second Language Acquisition.

[79]  R. Lyster,et al.  CORRECTIVE FEEDBACK AND LEARNER UPTAKE , 1997, Studies in Second Language Acquisition.

[80]  P. Rosenbaum The Computer as a Learning Environment for Foreign Language Instruction , 1969 .

[81]  Stephanie Seneff,et al.  An interactive English pronunciation dictionary for Korean learners , 2004, INTERSPEECH.

[82]  Wai Kit Lo,et al.  Implementation of an extended recognition network for mispronunciation detection and diagnosis in computer-assisted pronunciation training , 2009, SLaTE.

[83]  John R. Allen,et al.  Individualizing Foreign Language Instruction with Computers at Dartmouth. , 1972 .

[84]  Jean Ann,et al.  Obstruent voicing and devoicing in the English of Cantonese speakers from Hong Kong , 2004 .

[85]  Jiyou Jia,et al.  CSIEC (computer simulator in educational communication): a virtual context-adaptive chatting partner for foreign language learners , 2004, IEEE International Conference on Advanced Learning Technologies, 2004. Proceedings..

[86]  Lyle F. Bachman 语言测试要略 = Fundamental considerations in language testing , 1990 .

[87]  Raúl Rojas,et al.  Neural Networks - A Systematic Introduction , 1996 .

[88]  W. Lambert,et al.  Motivational variables in second-language acquisition. , 1959, Canadian journal of psychology.

[89]  John H. Underwood Linguistics, Computers, and the Language Teacher: A Communicative Approach , 1984 .

[90]  Helmer Strik,et al.  Automatic pronunciation error detection: an acoustic-phonetic approach , 2004 .

[91]  Yonghong Yan,et al.  An SVM-Based Mandarin Pronunciation Quality Assessment System , 2009, ISNN.

[92]  John B. Carroll,et al.  The Prediction of Success in Intensive Foreign Language Training. , 1964 .

[93]  Tatsuya Kawahara,et al.  Practical Use of Autonomous English Pronunciation Learning System for Japanese Students , 2004 .

[94]  Jyh-Shing Roger Jang,et al.  Automatic pronunciation assessment for Mandarin Chinese , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[95]  Maxine Eskénazi,et al.  LET's GO: improving spoken dialog systems for the elderly and non-natives , 2003, INTERSPEECH.

[96]  David Galloway,et al.  The case for dynamic exercise systems in language learning , 2008 .

[97]  Thomas A. Boyle,et al.  Computer Mediated Testing: A Branched Program Achievement Test. , 1976 .

[98]  Steve J. Young,et al.  Language learning based on non-native speech recognition , 1997, EUROSPEECH.

[99]  Helmer Strik,et al.  Automatic pronunciation grading for Dutch , 1998 .

[100]  L. Baum,et al.  Statistical Inference for Probabilistic Functions of Finite State Markov Chains , 1966 .

[101]  E. Horwitz,et al.  Foreign Language Classroom Anxiety , 1986 .

[102]  Stephanie Seneff,et al.  Web-based dialogue and translation games for spoken language learning , 2007, SLaTE.

[103]  Stephanie Seneff,et al.  Mandarin Tone Acquisition through Typed Interactions , 2004 .

[104]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[105]  J. Kenworthy Teaching English Pronunciation , 1987 .

[106]  Amir Najmi,et al.  Subarashii: Encounters in Japanese Spoken Language Education , 1999 .

[107]  Weichao Chen,et al.  Motivate the Learners to Practice English through Playing with Chatbot CSIEC , 2008, Edutainment.

[108]  Tracey M. Derwing,et al.  PRONUNCIATION INSTRUCTION FOR “FOSSILIZED” LEARNERS: CAN IT HELP? , 1998 .

[109]  E. N. Adams,et al.  Pilot Study of a CAI Laboratory in German , 1968 .

[110]  Ronald C. Turner CARLOS: Computer-Assisted Instruction in Spanish. , 1970 .

[111]  Helmer Strik,et al.  Segmental errors in Dutch as a second language: How to establish priorities for CAPT , 2004 .

[112]  Ricardo Gutierrez-Osuna,et al.  Foreign accent conversion in computer assisted pronunciation training , 2009, Speech Commun..

[113]  S. Griffis EDITOR , 1997, Journal of Navigation.

[114]  Randy L Diehl,et al.  Acoustic and auditory phonetics: the adaptive design of speech sound systems , 2008, Philosophical Transactions of the Royal Society B: Biological Sciences.

[115]  P. Mermelstein,et al.  Distance measures for speech recognition, psychological and instrumental , 1976 .

[116]  Xiaoming Xi,et al.  SpeechraterTM: a construct-driven approach to scoring spontaneous non-native speech , 2007, SLaTE.

[117]  Silke M. Witt,et al.  Use of speech recognition in computer-assisted language learning , 2000 .

[118]  Yuen Yee Lo,et al.  Deriving salient learners’ mispronunciations from cross-language phonological comparisons , 2007, 2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU).

[119]  Wilga M. Rivers Teaching Foreign-Language Skills , 1968 .

[120]  Patti Price,et al.  VILTS: A Tale of Two Technologies , 1999 .

[121]  Tatsuya Kawahara,et al.  Japanese CALL system based on dynamic question generation and error prediction for ASR , 2009, SLaTE.

[122]  Helmer Strik,et al.  Pronunciation Evaluation in Read and Spontaneous Speech: A Comparison between human ratings and automatic scores , 2002 .

[123]  Mark J. F. Gales,et al.  Variance compensation within the MLLR framework for robust speech recognition and speaker adaptation , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[124]  Nobuaki Minematsu,et al.  Speech Analysis for Automatic Evaluation of Shadowing , 2010 .

[125]  Helmer Strik,et al.  Effective feedback on L2 pronunciation in ASR-based CALL , 2001 .

[126]  Robert C. Gardner,et al.  Investigating Language Class Anxiety Using the Focused Essay Technique , 1991 .

[127]  Z. Dörnyei Motivation and Motivating in the Foreign Language Classroom , 1994 .

[128]  Gabriel Jacobs,et al.  Treacherous Allies: Foreign Language Grammar Checkers , 2013 .

[129]  M. Swain,et al.  Problems in Output and the Cognitive Processes They Generate: A Step Towards Second Language Learning , 1995, Applied Linguistics.

[130]  Victor Zue,et al.  Second language acquisition through human computer dialogue , 2004, 2004 International Symposium on Chinese Spoken Language Processing.

[131]  Stephanie Seneff,et al.  Speech-enabled Card Games for Language Learners , 2008, AAAI.

[132]  Horacio Franco,et al.  WebGrader TM : A Multilingual Pronunciation Practice Tool , 1998 .

[133]  Edward L. Vockell,et al.  The Computer in the Classroom , 1988, Galileo in Pittsburgh.

[134]  Z. Dörnyei,et al.  Ten commandments for motivating language learners: results of an empirical study , 1998 .

[135]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[136]  Antoine Raux,et al.  A unit selection approach to F0 modeling and its application to emphasis , 2003, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721).

[137]  S. Krashen,et al.  The Natural Approach: Language Acquisition in the Classroom , 1983 .

[138]  Keikichi Hirose,et al.  Structural representation of the non-native pronunciations , 2005, INTERSPEECH.

[139]  Steve Young,et al.  The HTK book , 1995 .

[140]  James Glass,et al.  The SUMMIT speech recognition system: phonological modelling and lexical access , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[141]  Vassilios Digalakis,et al.  Automatic pronunciation evaluation of foreign speakers using unknown text , 2007, Comput. Speech Lang..

[142]  Helmer Strik,et al.  The goodness of pronunciation algorithm: a detailed performance study , 2009, SLaTE.

[143]  Helmer Strik,et al.  Phoneme Errors in Read and Spontaneous Non-Native Speech: Relevance for CAPT System Development , 2010 .

[144]  Mitch Weintraub,et al.  Automatic text-independent pronunciation scoring of foreign language student speech , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[145]  Steven H. Weinberger,et al.  The Wisdom of the Crowd’s Ear: Speech Accent Rating and Annotation with Amazon Mechanical Turk , 2010, Mturk@HLT-NAACL.

[146]  Noriko Nagata,et al.  Computer vs. Workbook Instruction in Second Language Acquisition , 2013, CALICO Journal.

[147]  Gérard Bailly,et al.  Towards the use of a Virtual Talking Head and of Speech Mapping tools for pronunciation training , 2007 .

[148]  James Emil Flege Second-language learning: the role of subject and phonetic variables , 1998 .

[149]  L. Boves,et al.  Quantitative assessment of second language learners' fluency by means of automatic speech recognition technology. , 2000, The Journal of the Acoustical Society of America.

[150]  Helmer Strik,et al.  Towards an Automatic Oral Proficiency Test for Dutch as a Second Language: Automatic Pronunciation Assessment in Read and Spontaneous Speech , 2000 .

[151]  D. Kalikow,et al.  Experiments with computer-controlled displays in second-language learning , 1972 .

[152]  Kristin Precoda,et al.  The SRI EduSpeak System: Recognition and Pronunciation Scoring for Language Learning , 2007 .

[153]  Hermann Ney,et al.  Vocal tract normalization as linear transformation of MFCC , 2003, INTERSPEECH.

[154]  Helmer Strik,et al.  ASR-based corrective feedback on pronunciation: does it really work? , 2006, INTERSPEECH.

[155]  Peter J. M. Groot,et al.  Computer Assisted Second Language Vocabulary Acquisition , 2000 .

[156]  Keikichi Hirose,et al.  A method for measuring the intelligibility and nonnativeness of phone quality in foreign language pronunciation training , 1998, ICSLP.

[157]  Nobuaki Minematsu Pronunciation assessment based upon the phonological distortions observed in language learners' utterances , 2004, INTERSPEECH.

[158]  Elmar Nöth,et al.  How Many Labellers ? Modelling Inter-Labeller Agreement and System Performance for the Automatic Assessment of Non-Native Prosody , 2010 .

[159]  G. Fant Non-uniform vowel normalization , 1975 .

[160]  Mark Hasegawa-Johnson,et al.  Automated pronunciation scoring using confidence scoring and landmark-based SVM , 2009, INTERSPEECH.

[161]  Stephanie Seneff,et al.  Speech-enabled card games for incidental vocabulary acquisition in a foreign language , 2009, Speech Commun..

[162]  Helmer Strik,et al.  L2 pronunciation quality in read and spontaneous speech , 2000, INTERSPEECH.

[163]  Maxine Eskenazi,et al.  USING AUTOMATIC SPEECH PROCESSING FOR FOREIGN LANGUAGE PRONUNCIATION TUTORING: SOME ISSUES AND A PROTOTYPE , 1999 .

[164]  Stephanie Seneff,et al.  An interactive interpretation game for learning Chinese , 2007, SLaTE.

[165]  Antoine Raux,et al.  Using Task-Oriented Spoken Dialogue Systems for Language Learning: Potential, Practical Applications and Challenges , 2004 .

[166]  Mitch Weintraub,et al.  Automatic evaluation of English spoken by Japanese students , 1989 .

[167]  A. Jongman,et al.  Acoustic and perceptual evaluation of Mandarin tone productions before and after perceptual training. , 2003, The Journal of the Acoustical Society of America.

[168]  Eric Moulines,et al.  A diphone synthesis system based on time-domain prosodic modifications of speech , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[169]  Uschi Felix,et al.  Analysing Recent CALL Effectiveness Research—Towards a Common Agenda , 2005 .

[170]  M. Salaberry The Use of Technology for Second Language Learning and Teaching: A Retrospective , 2001 .

[171]  P. MacIntyre,et al.  Anxiety and Second‐Language Learning: Toward a Theoretical Clarification* , 1989 .

[172]  S.V. Bharath Kumar Uniform speaker normalization using frequency-dependent scaling function , 2004, 2004 International Conference on Signal Processing and Communications, 2004. SPCOM '04..

[173]  Keikichi Hirose,et al.  Automatic pronunciation evaluation of language learners' utterances generated through shadowing , 2008, INTERSPEECH.

[174]  Weichao Chen,et al.  Improving the CSIEC Project and Adapting It to the English Teaching and Learning in China , 2006, ArXiv.

[175]  Jian Cheng,et al.  Evaluating diglossic aspects of an automated test of spoken modern standard Arabic , 2009, SLaTE.

[176]  Zhenhai Cao,et al.  Application and evaluation of speech technologies in language learning: experiments with the Saybot player , 2008, INTERSPEECH.

[177]  Roger T. Bell An Introduction to Applied Linguistics: Approaches and Methods in Language Teaching , 1981 .

[178]  Guus de Krom,et al.  Evaluation of second language learners' pronunciation using hidden Markov models , 1997, EUROSPEECH.

[179]  Yves Laprie,et al.  A computer-assisted learning of English prosody for French students , 2004 .

[180]  Horacio Franco,et al.  Calibration of machine scores for pronunciation grading , 1998, ICSLP.

[181]  Stephanie Seneff,et al.  Spoken Dialogue Systems for Language Learning , 2007, NAACL.

[182]  Manuela Gonzalez-Bueno Pronunciation Teaching Component in SL/FL Education Programs: Training Teachers To Teach Pronunciation. , 2001 .

[183]  Helmer Strik,et al.  The Pedagogy-Technology Interface in Computer Assisted Pronunciation Training , 2002 .

[184]  Björn Granström,et al.  Simicry : A mimicry-feedback loop for second language learning , 2010 .

[185]  Helmer Strik,et al.  Using speech recognition technology to assess foreign speakers' pronunciation of Dutch , 1997 .

[186]  Roman Jakobson,et al.  The Sound Shape of Language , 1979 .

[187]  Kathleen M. Bailey,et al.  American Undergraduates' Reactions to the Communication Skills of Foreign Teaching Assistants , 1981 .

[188]  James Emil Flege,et al.  Pronunciation Proficiency in the First and Second Languages of Korean-English Bilinguals. , 2000 .

[189]  Janet H. Murray,et al.  AN OVERVIEW OF THE MIT ATHENA LANGUAGE LEARNING PROJECT , 2013 .

[190]  Herbert Gish,et al.  Methods and experiments for text-independent speaker recognition over telephone channels , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[191]  Yu Hu,et al.  A new method for mispronunciation detection using Support Vector Machine based on Pronunciation Space Models , 2009, Speech Commun..

[192]  Stephanie Seneff,et al.  Rainbow rummy: a web-based game for vocabulary acquisition using computer-directed speech , 2009, SLaTE.

[193]  L. Baum,et al.  A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains , 1970 .

[194]  Yonghong Yan,et al.  New Machine Scores and Their Combinations for Automatic Mandarin Phonetic Pronunciation Quality Assessment , 2007, KES.

[195]  Maxine Eskénazi,et al.  An overview of spoken language technology for education , 2009, Speech Commun..

[196]  Yoon Kim,et al.  Automatic pronunciation scoring of specific phone segments for language instruction , 1997, EUROSPEECH.

[197]  Robert C. Gardner,et al.  The Effects of Induced Anxiety on Three Stages of Cognitive Processing in Computerized Vocabulary Learning , 1994, Studies in Second Language Acquisition.

[198]  Stan Davis,et al.  Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Se , 1980 .

[199]  Fabio Brugnara,et al.  Speaker normalization through constrained MLLR based transforms , 2004, INTERSPEECH.

[200]  Maxine Eskénazi,et al.  Detection of foreign speakers' pronunciation errors for second language training-preliminary results , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[201]  Horacio Franco,et al.  Automatic detection of phone-level mispronunciation for language learning , 1999, EUROSPEECH.

[202]  Jane Kuo,et al.  Assessing the Assessments: The OPI and the SOPI. , 1997 .

[203]  Keikichi Hirose,et al.  Pronunciation Proficiency Estimation Based on Multilayer Regression Analysis Using Speaker-independent Structural Features , 2010 .

[204]  Jack Mostow,et al.  Giving Help and Praise in a Reading Tutor with Imperfect Listening--Because Automated Speech Recognition Means Never Being Able to Say You're Certain , 2013, CALICO Journal.

[205]  D. Nunan Communicative Language Teaching: Making It Work. , 1987 .

[206]  Stacy Marsella,et al.  Tactical Language Training System: An Interim Report , 2004, Intelligent Tutoring Systems.