论文信息 - Towards High-Reliability Speech Translation in the Medical Domain

Towards High-Reliability Speech Translation in the Medical Domain

In this paper, we describe the overall design for a speech translation system that aims to reduce the problems caused by language barriers in medical situations. As first steps to building a system according to this design, we describe a collection of a medical corpus, and some translation experiments performed on this corpus. As a result of the experiments, we find that the best of three modern translation systems is able to translate 33%-81% of the sentences in a way such that the main content is understandable.

[1] Graham Neubig,et al. Travatar: A Forest-to-String Machine Translation Engine based on Tree Transducers , 2013, ACL.

[2] Graham Neubig,et al. Training Dependency Parsers from Partially Annotated Corpora , 2011, IJCNLP.

[3] Eiichiro Sumita,et al. Overview of the Patent Machine Translation Task at the NTCIR-10 Workshop , 2011, NTCIR.

[4] Andreas Stolcke,et al. SRILM - an extensible language modeling toolkit , 2002, INTERSPEECH.

[5] David Chiang,et al. Hierarchical Phrase-Based Translation , 2007, CL.

[6] Daniel Jurafsky,et al. Which words are hard to recognize? Prosodic, lexical, and disfluency factors that increase speech recognition error rates , 2010, Speech Commun..

[7] Philipp Koehn,et al. Moses: Open Source Toolkit for Statistical Machine Translation , 2007, ACL.

[8] Kevin Duh,et al. Automatic Evaluation of Translation Quality for Distant Language Pairs , 2010, EMNLP.

[9] Jerome R. Bellegarda,et al. Statistical language model adaptation: review and perspectives , 2004, Speech Commun..

[10] Philip C. Woodland,et al. Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models , 1995, Comput. Speech Lang..

[11] Daniel Marcu,et al. Statistical Phrase-Based Translation , 2003, NAACL.

[12] Tatsuya Kawahara,et al. A monotonic statistical machine translation approach to speaking style transformation , 2012, Comput. Speech Lang..

[13] Tomoki Toda,et al. An empirical comparison of joint optimization techniques for speech translation , 2013, INTERSPEECH.

[14] Graham Neubig,et al. Pointwise Prediction for Robust, Adaptable Japanese Morphological Analysis , 2011, ACL.

[15] Hermann Ney,et al. Speech translation: coupling of recognition and translation , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[16] Hermann Ney,et al. A Systematic Comparison of Various Statistical Alignment Models , 2003, CL.

[17] Eiichiro Sumita,et al. Toward a Broad-coverage Bilingual Corpus for Speech Translation of Travel Conversations in the Real World , 2002, LREC.

[18] Mauro Cettolo,et al. WIT3: Web Inventory of Transcribed and Translated Talks , 2012, EAMT.

[19] Marilyn A. Walker,et al. Learning to Predict Problematic Situations in a Spoken Dialogue System: Experiments with How May I Help You? , 2000, ANLP.

[20] Mai Miyabe,et al. Parallel-Text Based Support System for Intercultural Communication at Medical Receptions , 2007, IWIC.

[21] Daniel Marcu,et al. Hierarchical Search for Word Alignment , 2010, ACL.

[22] Stephan Vogel,et al. Improving speech synthesis of machine translation output , 2010, INTERSPEECH.

[23] Qun Liu,et al. Forest-Based Translation , 2008, ACL.

[24] Katsuhito Sudoh,et al. Zero Pronoun Resolution can Improve the Quality of J-E Translation , 2012, SSST@ACL.

[25] Daniel Jurafsky,et al. A Conditional Random Field Word Segmenter for Sighan Bakeoff 2005 , 2005, IJCNLP.

[26] Alexander H. Waibel,et al. Improving Statistical Machine Translation in the Medical Domain using the Unified Medical Language system , 2004, COLING.

[27] Taro Watanabe,et al. A Unified Approach in Speech-to-Speech Translation: Integrating Features of Speech recognition and Machine Translation , 2004, COLING.

[28] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.