Language Modeling with Functional Head Constraint for Code Switching Speech Recognition

In this paper, we propose novel structured language modeling methods for code mixing speech recognition by incorporating a well-known syntactic constraint for switching code, namely the Functional Head Constraint (FHC). Code mixing data is not abundantly available for training language models. Our proposed methods successfully alleviate this core problem for code mixing speech recognition by using bilingual data to train a structured language model with syntactic constraint. Linguists and bilingual speakers found that code switch do not happen between the functional head and its complements. We propose to learn the code mixing language model from bilingual data with this constraint in a weighted finite state transducer (WFST) framework. The constrained code switch language model is obtained by first expanding the search network with a translation model, and then using parsing to restrict paths to those permissible under the constraint. We im

[1]  Hervé Bourlard,et al.  Language dependent universal phoneme posterior estimation for mixed language speech recognition , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[2]  Irina Illina,et al.  Combined acoustic and pronunciation modelling for non-native speech recognition , 2007, INTERSPEECH.

[3]  William C. Ritchie,et al.  The handbook of bilingualism and multilingualism , 2012 .

[4]  Tan Lee,et al.  Semantics-based language modeling for Cantonese-English code-mixing speech recognition , 2010, 2010 7th International Symposium on Chinese Spoken Language Processing.

[5]  Ellen Woolford,et al.  Bilingual Code-switching and Syntactic Theory , 2016 .

[6]  Lin-Shan Lee,et al.  An integrated framework for transcribing Mandarin-English code-mixed lectures with improved acoustic and language modeling , 2010, 2010 7th International Symposium on Chinese Spoken Language Processing.

[7]  Martin Rajman,et al.  Lattice Parsing for Speech Recognition , 1999 .

[8]  Haizhou Li,et al.  A first speech recognition system for Mandarin-English code-switch conversational speech , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[9]  Dau-Cheng Lyu,et al.  Language identification on code-switching utterances using multiple cues , 2008, INTERSPEECH.

[10]  Jeff MacSwan,et al.  Code Switching and Grammatical Theory , 2008 .

[11]  Almeida Jacqueline Toribio,et al.  Code switching and X-bar theory: the fuctional head constraint , 1994 .

[12]  F. Coulmas,et al.  社会语言学通览 = The Handbook of sociolinguistics , 2001 .

[13]  Chung-Hsien Wu,et al.  Language boundary detection and identification of mixed-language speech based on MAP estimation , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[14]  Chen-Yu Chiang,et al.  A study on Hakka and mixed Hakka-Mandarin speech recognition , 2010, 2010 7th International Symposium on Chinese Spoken Language Processing.

[15]  David Sankoff,et al.  A formal grammar for code‐switching 1 , 1981 .

[16]  Tan Lee,et al.  Detection of language boundary in code-switching utterances by bi-phone probabilities , 2004, 2004 International Symposium on Chinese Spoken Language Processing.

[17]  David Sankoff,et al.  A Formal Grammar for Code-Switching. CENTRO Working Papers 8. , 1980 .

[18]  William D. Lewis,et al.  Intelligent Selection of Language Model Training Data , 2010, ACL.

[19]  Almeida Jacqueline Toribio,et al.  Code Switching and X-Bar Theory : The Functional Head Constraint , 2008 .

[20]  Yonghong Yan,et al.  Mandarin-English bilingual Speech Recognition for real world music retrieval , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[21]  Frederick Jelinek,et al.  Structured language modeling , 2000, Comput. Speech Lang..