论文信息 - Automatic Speech Recognition for Human-Robot Interaction Using an Under-Resourced Language - 字舞流文

Automatic Speech Recognition for Human-Robot Interaction Using an Under-Resourced Language

ii Abstract (in Finnish) iiiin Finnish) iii

[1] Aren Jansen,et al. Data-driven Posterior Features for Low Resource Speech Recognition Applications , 2012, INTERSPEECH.

[2] Simon King,et al. Cross-lingual portability of MLP-based tandem features - a case study for English and Hungarian , 2008, INTERSPEECH.

[3] Vincent Berment,et al. Méthodes pour informatiser les langues et les groupes de langues « peu dotées ». (Methods to computerize "little equipped" languages and groups of languages) , 2004 .

[4] Janne Pylkkönen. Towards Efficient and Robust Automatic Speech Recognition: Decoding Techniques and Discriminative Training , 2013 .

[5] Etienne Barnard,et al. Pooling ASR data for closely related languages , 2010, SLTU.

[6] L. Baum,et al. Statistical Inference for Probabilistic Functions of Finite State Markov Chains , 1966 .

[7] Teemu Hirsimäki,et al. On Growing and Pruning Kneser–Ney Smoothed $ N$-Gram Models , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[8] F. Wilcoxon. Individual Comparisons by Ranking Methods , 1945 .

[9] Mikko Kurimo,et al. Importance of High-Order N-Gram Models in Morph-Based Speech Recognition , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[10] Hynek Hermansky,et al. Multilingual MLP features for low-resource LVCSR systems , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[11] Mikko Kurimo,et al. Morfessor 2.0: Python Implementation and Extensions for Morfessor Baseline , 2013 .

[12] Péter Mihajlik,et al. On morph-based LVCSR improvements , 2010, SLTU.

[13] Lukás Burget,et al. Morphological random forests for language modeling of inflectional languages , 2008, 2008 IEEE Spoken Language Technology Workshop.

[14] P. Eisenlohr. Language Revitalization and New TECHNOLOGIES: Cultures of Electronic Mediation and the Refiguring of Communities , 2004 .

[15] Ebru Arisoy,et al. Morph-based speech recognition and modeling of out-of-vocabulary words across languages , 2007, TSLP.

[16] Ahmad Emami,et al. Syntactic features for Arabic speech recognition , 2009, 2009 IEEE Workshop on Automatic Speech Recognition & Understanding.

[17] Janne Pylkkönen. New pruning criteria for efficient decoding , 2005, INTERSPEECH.

[18] Hermann Ney,et al. Cross-lingual portability of Chinese and english neural network features for French and German LVCSR , 2011, 2011 IEEE Workshop on Automatic Speech Recognition & Understanding.

[19] David Gouaillier,et al. Omni-directional closed-loop walk for NAO , 2010, 2010 10th IEEE-RAS International Conference on Humanoid Robots.

[20] Mark Dredze,et al. Fast Syntactic Analysis for Statistical Language Modeling via Substructure Sharing and Uptraining , 2012, ACL.

[21] Mikko Kurimo,et al. Unlimited vocabulary speech recognition based on morphs discovered in an unsupervised manner , 2003, INTERSPEECH.

[22] Joshua Goodman,et al. A bit of progress in language modeling , 2001, Comput. Speech Lang..

[23] Ruhi Sarikaya,et al. Joint Morphological-Lexical Language Modeling for Machine Translation , 2007, NAACL.

[24] Mathias Creutz,et al. Unsupervised models for morpheme segmentation and morphology learning , 2007, TSLP.

[25] Tatsuya Kawahara,et al. Uyghur morpheme-based language models and ASR , 2010, IEEE 10th INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS.

[26] Mikko Kurimo,et al. Low-Resource Active Learning of North Sámi Morphological Segmentation , 2015 .

[27] L. Baum,et al. A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains , 1970 .

[28] Tibor Fegyó,et al. Improved Recognition of Spontaneous Hungarian Speech—Morphological and Acoustic Modeling Techniques for a Less Resourced Task , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[29] Ngoc Thang Vu,et al. Rapid Building of an ASR System for Under-Resourced Languages Based on Multilingual Unsupervised Training , 2011, INTERSPEECH.

[30] Mary M. Nelan,et al. Responding to Haiti's Earthquake: International Volunteers’ Health Behaviors and Community Relationships , 2012, International Journal of Mass Emergencies & Disasters.

[31] Kristiina Jokinen,et al. WikiTalk human-robot interactions , 2013, ICMI '13.

[32] Pekka Sammallahti,et al. The Saami languages : an introduction , 1999 .

[33] M. Paul Lewis. How many languages are there in the world , 2012 .

[34] Einar Meister,et al. Methods for Estonian Large Vocabulary Speech Recognition , 2006 .

[35] F ChenStanley,et al. An Empirical Study of Smoothing Techniques for Language Modeling , 1996, ACL.

[36] Mathias Creutz,et al. Morphology-aware statistical machine translation based on morphs induced in an unsupervised manner , 2007, MTSUMMIT.

[37] Heiga Zen,et al. Speech Synthesis Based on Hidden Markov Models , 2013, Proceedings of the IEEE.

[38] Mikko Kurimo,et al. Morfessor and variKN machine learning tools for speech and language technology , 2007, INTERSPEECH.

[39] Biing-Hwang Juang,et al. A study on speaker adaptation of the parameters of continuous density hidden Markov models , 1991, IEEE Trans. Signal Process..

[40] C. Yau,et al. Bayesian non‐parametric hidden Markov models with applications in genomics , 2011 .

[41] Mikko Kurimo,et al. Speech retrieval from unsegmented finnish audio using statistical morpheme-like units for segmentation, recognition, and retrieval , 2008, TSLP.

[42] Teemu Hirsimäki. ADVANCES IN UNLIMITED-VOCABULARY SPEECH RECOGNITION FOR MORPHOLOGICALLY RICH LANGUAGES , 2009 .

[43] Mikko Kurimo,et al. Empirical Comparison of Evaluation Methods for Unsupervised Learning of Morphology , 2011, TAL.

[44] Kristiina Jokinen,et al. Multimodal Open-Domain Conversations with the Nao Robot , 2012, Natural Interaction with Robots, Knowbots and Smartphones, Putting Spoken Dialog Systems into Practice.

[45] Etienne Barnard,et al. ASR corpus design for resource-scarce languages , 2009, INTERSPEECH.

[46] Mikko Kurimo,et al. Unlimited vocabulary speech recognition with morph language models applied to Finnish , 2006, Comput. Speech Lang..

[47] Murat Saraclar,et al. Morphology-based and sub-word language modeling for Turkish speech recognition , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[48] S. Levinson,et al. The myth of language universals: language diversity and its importance for cognitive science. , 2009, The Behavioral and brain sciences.

[49] Hae-Chang Rim,et al. Probabilistic Modeling of Korean Morphology , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[50] Mirjam Sepesy Maucec,et al. Large vocabulary continuous speech recognition of an inflected language using stems and endings , 2007, Speech Commun..

[51] Takahira Yamaguchi,et al. Intelligent Humanoid Robot with Japanese Wikipedia Ontology and Robot Action Ontology , 2011, 2011 6th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[52] Martine Adda-Decker. A corpus-based decompounding algorithm for German lexical modeling in LVCSR , 2003, INTERSPEECH.

[53] Lori Lamel,et al. Comparing SMT Methods for Automatic Generation of Pronunciation Variants , 2010, IceTAL.

[54] Mikko Kurimo,et al. Analysing Recognition Errors in Unlimited-Vocabulary Speech Recognition , 2009, HLT-NAACL.

[55] Mark J. F. Gales,et al. The Application of Hidden Markov Models in Speech Recognition , 2007, Found. Trends Signal Process..

[56] Kristiina Jokinen,et al. Open-domain Interaction and Online Content in the Sami Language , 2014, LREC.

[57] Etienne Barnard,et al. Vowel variation in Southern Sotho: an acoustic investigation , 2008 .

[58] Lin Lawrance Chase. Error-responsive feedback mechanisms for speech recognizers , 1997 .

[59] Andreas Stolcke,et al. Cross-Domain and Cross-Language Portability of Acoustic Features Estimated by Multilayer Perceptrons , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[60] Robert J. Elliott,et al. Option pricing and Esscher transform under regime switching , 2005 .

[61] Dimitra Anastasiou,et al. Evaluation of WikiTalk - User Studies of Human-Robot Interaction , 2013, HCI.

[62] Tanja Schultz,et al. Automatic speech recognition for under-resourced languages: A survey , 2014, Speech Commun..

[63] Ebru Arisoy,et al. Unlimited vocabulary speech recognition for agglutinative languages , 2006, NAACL.

[64] Lyle Campbell,et al. Ethnologue: Languages of the world (review) , 2008 .

[65] Martin Karafiát,et al. The language-independent bottleneck features , 2012, 2012 IEEE Spoken Language Technology Workshop (SLT).

[66] S. Shamsuddin,et al. Initial response of autistic children in human-robot interaction therapy with humanoid robot NAO , 2012, 2012 IEEE 8th International Colloquium on Signal Processing and its Applications.

[67] Stan Davis,et al. Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Se , 1980 .

[68] Thomas Pellegrini,et al. Automatic Word Decompounding for ASR in a Morphologically Rich Language: Application to Amharic , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[69] Paul Deléglise,et al. Grapheme to phoneme conversion using an SMT system , 2009, INTERSPEECH.

[70] Fabio Tesser,et al. Spoken language processing in a conversational system for child-robot interaction , 2012, WOCCI.

[71] Alex Acero,et al. Spoken Language Processing: A Guide to Theory, Algorithm and System Development , 2001 .