Slovak Automatic Dictation System for Judicial Domain

This paper describes the design, development and evaluation of the Slovak dictation system for the judicial domain. The speech is recorded using a close-talk microphone and the dictation system is used for on-line or off-line automatic transcription. The system provides an automatic dictation tool in Slovak for the employees of the Ministry of Justice of the Slovak Republic and all the courts in Slovakia. The system is designed for on-line dictation and off-line transcription of legal texts recorded in acoustical conditions of typical office. Details of the technical solution are given and the evaluation of different versions of the system is presented.

[1]  Tatsuya Kawahara,et al.  Recent Development of Open-Source Speech Recognition Engine Julius , 2009 .

[2]  Hermann Ney,et al.  Evaluation of automatic transcription systems for the judicial domain , 2010, 2010 IEEE Spoken Language Technology Workshop.

[3]  Jozef Juhar,et al.  Recent Progress in Development of Language Model for Slovak Large Vocabulary Continuous Speech Recognition , 2012 .

[4]  Morten Hertzum,et al.  Acceptance of speech recognition by physicians: A survey of expectations, experiences, and social influence , 2009, Int. J. Hum. Comput. Stud..

[5]  Philip C. Woodland,et al.  Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models , 1995, Comput. Speech Lang..

[6]  Milos Cernak,et al.  Effective Triphone Mapping for Acoustic Modeling in Speech Recognition , 2011, INTERSPEECH.

[7]  Mark Liberman,et al.  Transcriber: Development and use of a tool for assisting speech corpora production , 2001, Speech Commun..

[8]  Jithendra Vepa,et al.  Juicer: A Weighted Finite-State Transducer Speech Decoder , 2006, MLMI.

[9]  Kiyohiro Shikano,et al.  Julius - an open source real-time large vocabulary recognition engine , 2001, INTERSPEECH.

[10]  Milos Cernak,et al.  Rule-Based Triphone Mapping for Acoustic Modeling in Automatic Speech Recognition , 2011, TSD.

[11]  Andrei Popescu-Belis,et al.  Machine Learning for Multimodal Interaction , 4th International Workshop, MLMI 2007, Brno, Czech Republic, June 28-30, 2007, Revised Selected Papers , 2008, MLMI.

[12]  Andreas Stolcke,et al.  SRILM - an extensible language modeling toolkit , 2002, INTERSPEECH.