Speech Interaction to Control a Hands-Free Delivery Robot for High-Risk Health Care Scenarios

The Covid-19 pandemic has had a widespread effect across the globe. The major effect on health-care workers and the vulnerable populations they serve has been of particular concern. Near-complete lockdown has been a common strategy to reduce the spread of the pandemic in environments such as live-in care facilities. Robotics is a promising area of research that can assist in reducing the spread of covid-19, while also preventing the need for complete physical isolation. The research presented in this paper demonstrates a speech-controlled, self-sanitizing robot that enables the delivery of items from a visitor to a resident of a care facility. The system is automated to reduce the burden on facility staff, and it is controlled entirely through hands-free audio interaction in order to reduce transmission of the virus. We demonstrate an end-to-end delivery test, and an in-depth evaluation of the speech interface. We also recorded a speech dataset with two conditions: the talker wearing a face mask and the talker not wearing a face mask. We then used this dataset to evaluate the speech recognition system. This enabled us to test the effect of face masks on speech recognition interfaces in the context of autonomous systems.

[1]  Serge Lachapelle,et al.  WebRTC , 2021, ACM Queue.

[2]  Y. Hu,et al.  [Asymptomatic infection of COVID-19 and its challenge to epidemic prevention and control]. , 2020, Zhonghua liu xing bing xue za zhi = Zhonghua liuxingbingxue zazhi.

[3]  N. Shigemoto,et al.  Effectiveness of 222-nm ultraviolet light on disinfecting SARS-CoV-2 surface contamination , 2020, American Journal of Infection Control.

[4]  A. Singer,et al.  Acoustic effects of medical, cloth, and transparent face masks on speech signals , 2020, The Journal of the Acoustical Society of America.

[5]  G. Lamph,et al.  Psychosocial Support for Healthcare Workers During the COVID-19 Pandemic , 2020, Frontiers in Psychology.

[6]  Russell H. Taylor,et al.  Combating COVID-19—The role of robotics in managing public health and infectious diseases , 2020, Science Robotics.

[7]  Ling Zhang,et al.  Progression of Mental Health Services during the COVID-19 Outbreak in China , 2020, International journal of biological sciences.

[8]  M. L. Cristina,et al.  Evaluation of an Ultraviolet C (UVC) Light-Emitting Device for Disinfection of High Touch Surfaces in Hospital Critical Areas , 2019, International journal of environmental research and public health.

[9]  Lei Xie,et al.  Domain Adversarial Training for Improving Keyword Spotting Performance of ESL Speech , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[10]  Giorgio Metta,et al.  Speech Recognition for the iCub Platform , 2018, Front. Robot. AI.

[11]  Tony Belpaeme,et al.  Child Speech Recognition in Human-Robot Interaction: Evaluations and Recommendations , 2017, 2017 12th ACM/IEEE International Conference on Human-Robot Interaction (HRI.

[12]  Erich Elsen,et al.  Deep Speech: Scaling up end-to-end speech recognition , 2014, ArXiv.

[13]  A. Srinivasan,et al.  On Speech Recognition , 2012 .

[14]  Kenneth Heafield,et al.  KenLM: Faster and Smaller Language Model Queries , 2011, WMT@EMNLP.

[15]  Alexander H. Waibel,et al.  Natural human-robot interaction using speech, head pose and gestures , 2004, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566).

[16]  Michael Carbonaro,et al.  Does Popular Speech Recognition Software Work with ESL Speech , 2000 .

[17]  Faisal Zaman,et al.  What is TensorFlow Lite , 2020 .

[18]  A. Abioye,et al.  A review on humanoid robotics in healthcare , 2018 .

[19]  Daniel Povey,et al.  The Kaldi Speech Recognition Toolkit , 2011 .

[20]  方华 google,我,萨娜 , 2006 .

[21]  Jan Noyes,et al.  Speech control , 2001 .