Towards SamiTalk: A Sami-Speaking Robot Linked to Sami Wikipedia

We describe our work towards developing SamiTalk , a robot application for the North Sami language. With SamiTalk, users will hold spoken dialogues with a humanoid robot that speaks and recognizes North Sami. The robot will access information from the Sami Wikipedia, talk about requested topics using the Wikipedia texts, and make smooth topic shifts to related topics using the Wikipedia links. SamiTalk will be based on the existing WikiTalk system for Wikipedia-based spoken dialogues, with newly developed speech components for North Sami.

[1]  Kristiina Jokinen,et al.  Internationalisation and Localisation of Spoken Dialogue Systems , 2016, IWSDS.

[2]  Mikko Kurimo,et al.  Morfessor and variKN machine learning tools for speech and language technology , 2007, INTERSPEECH.

[3]  Emer Gilmartin,et al.  Multimodal conversational interaction with a humanoid robot , 2012, 2012 IEEE 3rd International Conference on Cognitive Infocommunications (CogInfoCom).

[4]  Mikko Kurimo,et al.  Automatic Speech Recognition for Northern Sámi with comparison to other Uralic Languages , 2016 .

[5]  Oliver Watts,et al.  Unsupervised learning for text-to-speech synthesis , 2013 .

[6]  Mikko Kurimo,et al.  Unlimited vocabulary speech recognition with morph language models applied to Finnish , 2006, Comput. Speech Lang..

[7]  Mikko Kurimo,et al.  Importance of High-Order N-Gram Models in Morph-Based Speech Recognition , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[8]  Mikko Kurimo,et al.  Morfessor 2.0: Python Implementation and Extensions for Morfessor Baseline , 2013 .

[9]  Paavo Alku,et al.  HMM-Based Speech Synthesis Utilizing Glottal Inverse Filtering , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[10]  Kristiina Jokinen,et al.  Multimodal Open-Domain Conversations with the Nao Robot , 2012, Natural Interaction with Robots, Knowbots and Smartphones, Putting Spoken Dialog Systems into Practice.

[11]  Juho Leinonen Automatic Speech Recognition for Human-Robot Interaction Using an Under-Resourced Language , 2015 .

[12]  Kristiina Jokinen,et al.  DigiSami and Digital Natives: Interaction Technology for the North Sami Language , 2016, IWSDS.

[13]  Graham Wilcock WikiTalk: A Spoken Wikipedia-based Open-Domain Knowledge Access System , 2012, Coling 2012.

[14]  Janne Pylkkönen AN EFFICIENT ONE-PASS DECODER FOR FINNISH LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION , .