Dialectal Chinese Speech Recognition: Final Report

†Richard Sproat, University of Illinois (Thomas) Fang Zheng, Tsinghua University Liang Gu, IBM Jing Li, Tsinghua University Yanli Zheng, University of Illinois Yi Su, Johns Hopkins University Haolang Zhou, Johns Hopkins University Philip Bramsen, MIT David Kirsch, Lehigh University Izhak Shafran, Johns Hopkins University Stavros Tsakalidis, Johns Hopkins University Rebecca Starr, Stanford University Dan Jurafsky, Stanford University

[1]  Fernando Pereira,et al.  Weighted finite-state transducers in speech recognition , 2002, Comput. Speech Lang..

[2]  Yunxin Zhao,et al.  Fast model selection based speaker adaptation for nonnative speech , 2003, IEEE Trans. Speech Audio Process..

[3]  Kristin Precoda,et al.  Prosodic features for automatic text-independent evaluation of degree of nativeness for language learners , 2000, INTERSPEECH.

[4]  Wayne H. Ward,et al.  Issues in recognition of spanish-accented spontaneous english , 2003 .

[5]  R. W. King,et al.  Automatic accent classification of foreign accented Australian English speech , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[6]  Mehryar Mohri,et al.  A Rational Design for a Weighted Finite-State Transducer Library , 1997, Workshop on Implementing Automata.

[7]  Alex Waibel,et al.  Adaptation Methods For Non-Native Speech , 2001 .

[8]  Isabel Trancoso,et al.  Accent identification , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[9]  Tao Chen,et al.  Accent Issues in Large Vocabulary Continuous Speech Recognition , 2004, Int. J. Speech Technol..

[10]  George Karypis,et al.  CLUTO - A Clustering Toolkit , 2002 .

[11]  Pascale Fung,et al.  MLLR-based accent model adaptation without accented data , 2000, INTERSPEECH.

[12]  Laura Mayfield Tomokiyo,et al.  Lexical and acoustic modeling of non-native speech in LVSCR , 2000, INTERSPEECH.

[13]  Chao Huang,et al.  Accent modeling based on pronunciation dictionary adaptation for large vocabulary Mandarin speech recognition , 2000, INTERSPEECH.

[14]  Tanja Schultz,et al.  Comparison of acoustic model adaptation techniques on non-native speech , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[15]  Construction of Large-Scale Shanghai Putonghua Speech Corpus for Chinese Speech Recognition , 2022 .

[16]  Marina Sahakyan,et al.  Is non-native pronunciation modelling necessary ? , 2001, INTERSPEECH.

[17]  Carl de Marcken,et al.  Unsupervised language acquisition , 1996, ArXiv.

[18]  Mehryar Mohri,et al.  Voice signatures , 2003, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721).

[19]  Mark Hasegawa-Johnson,et al.  Stop consonant classification by dynamic formant trajectory , 2004, INTERSPEECH.

[20]  James R. Glass,et al.  Lexical modeling of non-native speech for automatic speech recognition , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[21]  Richard Sproat,et al.  Lattice-Based Search for Spoken Utterance Retrieval , 2004, NAACL.

[22]  Pascale Fung,et al.  Fast accent identification and accented speech recognition , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[23]  Alexander H. Waibel,et al.  Class phrase models for language modeling , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[24]  Andrej Ljolje Speech recognition using fundamental frequency and voicing in acoustic modeling , 2002, INTERSPEECH.

[25]  Dietrich Klakow Language-model optimization by mapping of corpora , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[26]  Chao Huang,et al.  Automatic accent identification using Gaussian mixture models , 2001, IEEE Workshop on Automatic Speech Recognition and Understanding, 2001. ASRU '01..

[27]  Stephen Cox,et al.  A comparison of two unsupervised approaches to accent identification , 1998, ICSLP.

[28]  Richard M. Schwartz,et al.  A compact model for speaker-adaptive training , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[29]  Tao Chen,et al.  Accent issue in large vocabulary continuous speech recognition , 2004 .

[30]  Pascale Fung,et al.  Modeling partial pronunciation variations for spontaneous Mandarin speech recognition , 2002, Comput. Speech Lang..

[31]  Kristin Precoda EVALUATION OF SPEAKER’S DEGREE OF NATIVENESS USING TEXT-INDEPENDENT PROSODIC FEATURES , 2001 .

[32]  Martha Larson,et al.  Compound splitting and lexical unit recombination for improved performance of a speech recognition system for German parliamentary speeches , 2000, INTERSPEECH.

[33]  Helmer Strik,et al.  Modeling pronunciation variation for ASR: A survey of the literature , 1999, Speech Commun..

[34]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[35]  Pascale Fung,et al.  Partial change accent models for accented Mandarin speech recognition , 2003, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721).

[36]  Mehryar Mohri Weighted Grammar Tools: The GRM Library , 2001 .

[37]  A. Waibel,et al.  Multilinguality in speech and spoken language systems , 2000, Proceedings of the IEEE.

[38]  Alex Waibel,et al.  Speaker, accent, and language identification using multilingual phone strings , 2002, Proceedings of the second international conference on Human Language Technology Research -.

[39]  Mehryar Mohri,et al.  A weight pushing algorithm for large vocabulary speech recognition , 2001, INTERSPEECH.

[40]  Philip C. Woodland,et al.  Using accent-specific pronunciation modelling for robust speech recognition , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[41]  Richard Sproat,et al.  Corpus-Based Methods in Chinese Morphology and Phonology , 2001 .

[42]  P. Woodland,et al.  Flexible speaker adaptation using maximum likelihood linear regression , 1995 .

[43]  D. Barnes,et al.  The Languages of China , 1989 .