论文信息 - Turkish LVCSR: Database Preparation and Language Modeling for an Agglutinative Language

Turkish LVCSR: Database Preparation and Language Modeling for an Agglutinative Language

Turkish language is an agglutinative language. It is possible to produce a very high number of words from the same root with suffixes [1]. Language modeling for agglutinative languages needs to be different than modeling of languages like English. Such languages also have inflections but not as many as an agglutinative language. Techniques which can be used for modeling agglutinative languages are presented in this work. Turkish is one of the least studied language for speech recognition. For this reason the first step for Turkish speech recognition is preparing a database. The texts to record the database were selected from television programs and newspaper articles. Selection criterion was to cover various subject and to create a phonetically balanced corpus. Additionally it is important to include as many different word as possible. The Speech Training and Recognition Unified Tool (STRUT)1 has been used for training and testing systems for preliminary recognition experiments.

Erhan Mengusoglu | Olivier Deroo | O. Deroo | Erhan Mengusoglu

[1] Andrew J. Viterbi,et al. Error bounds for convolutional codes and an asymptotically optimum decoding algorithm , 1967, IEEE Trans. Inf. Theory.

[2] Lawrence R. Rabiner,et al. A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[3] Hervé Bourlard,et al. Connectionist Speech Recognition: A Hybrid Approach , 1993 .

[4] Kemal Oflazer,et al. Two-level Description of Turkish Morphology , 1993, EACL.

[5] Hynek Hermansky,et al. RASTA processing of speech , 1994, IEEE Trans. Speech Audio Process..

[6] Beryl Hoffman,et al. Computational analysis of the syntax and interpretation of "free" word order in Turkish , 1995 .

[7] Andrew Borthwick. Survey Paper on Statistical Language Modeling , 1997 .

[8] Gethin Williams,et al. Knowing What You Don't Know: Roles for Confidence Measures in Automatic Speech Recognition , 1999 .

[9] Johan Carlberger,et al. Implementing an Efficient Part-Of-Speech Tagger , 1999, Softw. Pract. Exp..

[10] Kemal Oflazer,et al. Statistical morphological disambiguation for agglutinative languages , 2000, COLING 2000.

[11] Gökhan Tür,et al. Statistical Morphological Disambiguation for Agglutinative Languages , 2000, COLING.

[12] Christophe Ris,et al. Use of acoustic prior information for confidence measure in ASR applications , 2001, INTERSPEECH.