THE LIMSI2006TC-STAREPPSTRANSCRIPTIONSYSTEMS*
暂无分享,去创建一个
Thispaperdescribes thespeech recognizers developed to transcribe European Parliament Plenary Sessions (EPPS) inEnglish andSpanish inthe2ndTC-STAREvaluation Campaign. Thespeech recognizers arestate-of-the-art systems using multiple decoding passes withmodels (lexicon, acoustic models, language models) trained forthedifferent transcription tasks. Compared totheLIMSITC-STAR2005EPPSsystems, relative worderror rate reductions ofabout 30%havebeenachieved onthe2006development data. Theworderror rates withthe LIMSIsystems onthe2006EPPSevaluation dataare8.2% forEnglish and7.8%forSpanish. Experiments withcross-site adaptation andsystem combination arealso described. Index Terms - Speech recognition
[1] Andreas Stolcke,et al. SRILM - an extensible language modeling toolkit , 2002, INTERSPEECH.
[2] Jean-Luc Gauvain,et al. Multistage speaker diarization of broadcast news , 2006, IEEE Transactions on Audio, Speech, and Language Processing.
[3] Jonathan G. Fiscus,et al. A post-processing system to yield reduced word error rates: Recognizer Output Voting Error Reduction (ROVER) , 1997, 1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings.