Written Form Extraction of Spoken Numeric Sequences in Speech-to-Text Conversion for Ukrainian

The result of automatic speech-to-text conversion is a sequence of words contained in a working dictionary. Hence each number must be added to the dictionary, which is not feasible. Therefore we need to introduce a post-processor block extracting numeric sequences by speech recognition response. We describe a sequence-to-sequence converter that is a finite state transducer ini tially designed to generate phoneme sequences by words for Ukrainian using the expert-specified rules. Further, we apply this model to extract numeric sequence by speech recognition response considering word sequences as well as time and speaker identity estimations for each word. Finally, we discuss experimental results and spot detected problems for further research.