Microsoft Speech Language Translation (MSLT) Corpus: The IWSLT 2016 release for English, French and German
暂无分享,去创建一个
We describe the Microsoft Speech Language Translation (MSLT) corpus, which was created in order to evaluate endto-end conversational speech translation quality. The corpus was created from actual conversations over Skype, and we provide details on the recording setup and the different layers of associated text data. The corpus release includes Test and Dev sets with reference transcripts for speech recognition. Additionally, cleaned up transcripts and reference translations are available for evaluation of machine translation quality. The IWSLT 2016 release described here includes the source audio, raw transcripts, cleaned up transcripts, and translations to or from English for both French and German.
[1] Matt Post,et al. Some insights from translating conversational telephone speech , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).