Automatic Speech Recognition for Human-Robot Interaction Using an Under-Resourced Language

ii Abstract (in Finnish) iiiin Finnish) iii

[1]  Aren Jansen,et al.  Data-driven Posterior Features for Low Resource Speech Recognition Applications , 2012, INTERSPEECH.

[2]  Simon King,et al.  Cross-lingual portability of MLP-based tandem features - a case study for English and Hungarian , 2008, INTERSPEECH.

[3]  Vincent Berment,et al.  Méthodes pour informatiser les langues et les groupes de langues « peu dotées ». (Methods to computerize "little equipped" languages and groups of languages) , 2004 .

[4]  Janne Pylkkönen Towards Efficient and Robust Automatic Speech Recognition: Decoding Techniques and Discriminative Training , 2013 .

[5]  Etienne Barnard,et al.  Pooling ASR data for closely related languages , 2010, SLTU.

[6]  L. Baum,et al.  Statistical Inference for Probabilistic Functions of Finite State Markov Chains , 1966 .

[7]  Teemu Hirsimäki,et al.  On Growing and Pruning Kneser–Ney Smoothed $ N$-Gram Models , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[8]  F. Wilcoxon Individual Comparisons by Ranking Methods , 1945 .

[9]  Mikko Kurimo,et al.  Importance of High-Order N-Gram Models in Morph-Based Speech Recognition , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[10]  Hynek Hermansky,et al.  Multilingual MLP features for low-resource LVCSR systems , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[11]  Mikko Kurimo,et al.  Morfessor 2.0: Python Implementation and Extensions for Morfessor Baseline , 2013 .

[12]  Péter Mihajlik,et al.  On morph-based LVCSR improvements , 2010, SLTU.

[13]  Lukás Burget,et al.  Morphological random forests for language modeling of inflectional languages , 2008, 2008 IEEE Spoken Language Technology Workshop.

[14]  P. Eisenlohr Language Revitalization and New TECHNOLOGIES: Cultures of Electronic Mediation and the Refiguring of Communities , 2004 .

[15]  Ebru Arisoy,et al.  Morph-based speech recognition and modeling of out-of-vocabulary words across languages , 2007, TSLP.

[16]  Ahmad Emami,et al.  Syntactic features for Arabic speech recognition , 2009, 2009 IEEE Workshop on Automatic Speech Recognition & Understanding.

[17]  Janne Pylkkönen New pruning criteria for efficient decoding , 2005, INTERSPEECH.

[18]  Hermann Ney,et al.  Cross-lingual portability of Chinese and english neural network features for French and German LVCSR , 2011, 2011 IEEE Workshop on Automatic Speech Recognition & Understanding.

[19]  David Gouaillier,et al.  Omni-directional closed-loop walk for NAO , 2010, 2010 10th IEEE-RAS International Conference on Humanoid Robots.

[20]  Mark Dredze,et al.  Fast Syntactic Analysis for Statistical Language Modeling via Substructure Sharing and Uptraining , 2012, ACL.

[21]  Mikko Kurimo,et al.  Unlimited vocabulary speech recognition based on morphs discovered in an unsupervised manner , 2003, INTERSPEECH.

[22]  Joshua Goodman,et al.  A bit of progress in language modeling , 2001, Comput. Speech Lang..

[23]  Ruhi Sarikaya,et al.  Joint Morphological-Lexical Language Modeling for Machine Translation , 2007, NAACL.

[24]  Mathias Creutz,et al.  Unsupervised models for morpheme segmentation and morphology learning , 2007, TSLP.

[25]  Tatsuya Kawahara,et al.  Uyghur morpheme-based language models and ASR , 2010, IEEE 10th INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS.

[26]  Mikko Kurimo,et al.  Low-Resource Active Learning of North Sámi Morphological Segmentation , 2015 .

[27]  L. Baum,et al.  A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains , 1970 .

[28]  Tibor Fegyó,et al.  Improved Recognition of Spontaneous Hungarian Speech—Morphological and Acoustic Modeling Techniques for a Less Resourced Task , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[29]  Ngoc Thang Vu,et al.  Rapid Building of an ASR System for Under-Resourced Languages Based on Multilingual Unsupervised Training , 2011, INTERSPEECH.

[30]  Mary M. Nelan,et al.  Responding to Haiti's Earthquake: International Volunteers’ Health Behaviors and Community Relationships , 2012, International Journal of Mass Emergencies & Disasters.

[31]  Kristiina Jokinen,et al.  WikiTalk human-robot interactions , 2013, ICMI '13.

[32]  Pekka Sammallahti,et al.  The Saami languages : an introduction , 1999 .

[33]  M. Paul Lewis How many languages are there in the world , 2012 .

[34]  Einar Meister,et al.  Methods for Estonian Large Vocabulary Speech Recognition , 2006 .

[35]  F ChenStanley,et al.  An Empirical Study of Smoothing Techniques for Language Modeling , 1996, ACL.

[36]  Mathias Creutz,et al.  Morphology-aware statistical machine translation based on morphs induced in an unsupervised manner , 2007, MTSUMMIT.

[37]  Heiga Zen,et al.  Speech Synthesis Based on Hidden Markov Models , 2013, Proceedings of the IEEE.

[38]  Mikko Kurimo,et al.  Morfessor and variKN machine learning tools for speech and language technology , 2007, INTERSPEECH.

[39]  Biing-Hwang Juang,et al.  A study on speaker adaptation of the parameters of continuous density hidden Markov models , 1991, IEEE Trans. Signal Process..

[40]  C. Yau,et al.  Bayesian non‐parametric hidden Markov models with applications in genomics , 2011 .

[41]  Mikko Kurimo,et al.  Speech retrieval from unsegmented finnish audio using statistical morpheme-like units for segmentation, recognition, and retrieval , 2008, TSLP.

[42]  Teemu Hirsimäki ADVANCES IN UNLIMITED-VOCABULARY SPEECH RECOGNITION FOR MORPHOLOGICALLY RICH LANGUAGES , 2009 .

[43]  Mikko Kurimo,et al.  Empirical Comparison of Evaluation Methods for Unsupervised Learning of Morphology , 2011, TAL.

[44]  Kristiina Jokinen,et al.  Multimodal Open-Domain Conversations with the Nao Robot , 2012, Natural Interaction with Robots, Knowbots and Smartphones, Putting Spoken Dialog Systems into Practice.

[45]  Etienne Barnard,et al.  ASR corpus design for resource-scarce languages , 2009, INTERSPEECH.

[46]  Mikko Kurimo,et al.  Unlimited vocabulary speech recognition with morph language models applied to Finnish , 2006, Comput. Speech Lang..

[47]  Murat Saraclar,et al.  Morphology-based and sub-word language modeling for Turkish speech recognition , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[48]  S. Levinson,et al.  The myth of language universals: language diversity and its importance for cognitive science. , 2009, The Behavioral and brain sciences.

[49]  Hae-Chang Rim,et al.  Probabilistic Modeling of Korean Morphology , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[50]  Mirjam Sepesy Maucec,et al.  Large vocabulary continuous speech recognition of an inflected language using stems and endings , 2007, Speech Commun..

[51]  Takahira Yamaguchi,et al.  Intelligent Humanoid Robot with Japanese Wikipedia Ontology and Robot Action Ontology , 2011, 2011 6th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[52]  Martine Adda-Decker A corpus-based decompounding algorithm for German lexical modeling in LVCSR , 2003, INTERSPEECH.

[53]  Lori Lamel,et al.  Comparing SMT Methods for Automatic Generation of Pronunciation Variants , 2010, IceTAL.

[54]  Mikko Kurimo,et al.  Analysing Recognition Errors in Unlimited-Vocabulary Speech Recognition , 2009, HLT-NAACL.

[55]  Mark J. F. Gales,et al.  The Application of Hidden Markov Models in Speech Recognition , 2007, Found. Trends Signal Process..

[56]  Kristiina Jokinen,et al.  Open-domain Interaction and Online Content in the Sami Language , 2014, LREC.

[57]  Etienne Barnard,et al.  Vowel variation in Southern Sotho: an acoustic investigation , 2008 .

[58]  Lin Lawrance Chase Error-responsive feedback mechanisms for speech recognizers , 1997 .

[59]  Andreas Stolcke,et al.  Cross-Domain and Cross-Language Portability of Acoustic Features Estimated by Multilayer Perceptrons , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[60]  Robert J. Elliott,et al.  Option pricing and Esscher transform under regime switching , 2005 .

[61]  Dimitra Anastasiou,et al.  Evaluation of WikiTalk - User Studies of Human-Robot Interaction , 2013, HCI.

[62]  Tanja Schultz,et al.  Automatic speech recognition for under-resourced languages: A survey , 2014, Speech Commun..

[63]  Ebru Arisoy,et al.  Unlimited vocabulary speech recognition for agglutinative languages , 2006, NAACL.

[64]  Lyle Campbell,et al.  Ethnologue: Languages of the world (review) , 2008 .

[65]  Martin Karafiát,et al.  The language-independent bottleneck features , 2012, 2012 IEEE Spoken Language Technology Workshop (SLT).

[66]  S. Shamsuddin,et al.  Initial response of autistic children in human-robot interaction therapy with humanoid robot NAO , 2012, 2012 IEEE 8th International Colloquium on Signal Processing and its Applications.

[67]  Stan Davis,et al.  Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Se , 1980 .

[68]  Thomas Pellegrini,et al.  Automatic Word Decompounding for ASR in a Morphologically Rich Language: Application to Amharic , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[69]  Paul Deléglise,et al.  Grapheme to phoneme conversion using an SMT system , 2009, INTERSPEECH.

[70]  Fabio Tesser,et al.  Spoken language processing in a conversational system for child-robot interaction , 2012, WOCCI.

[71]  Alex Acero,et al.  Spoken Language Processing: A Guide to Theory, Algorithm and System Development , 2001 .