Assessing Virtual Assistant Capabilities with Italian Dysarthric Speech

The usage of smartphone-based virtual assistants (e.g., Siri or Google Assistant) is growing, and their spread has generally a positive impact on device accessibility, e.g., for people with disabilities. However, people with dysarthria or other speech impairments may be unable to use these virtual assistants with proficiency. This paper investigates to which extent people with ALS-induced dysarthria can be understood and get consistent answers by three widely used smartphone-based assistants, namely Siri, Google Assistant, and Cortana. We focus on the recognition of Italian dysarthric speech, to study the behavior of the virtual assistants with this specific population for which no relevant studies are available. We collected and recorded suitable speech samples from people with dysarthria in a dedicated center of the Molinette hospital, in Turin, Italy. Starting from those recordings, the differences between such assistants, in terms of speech recognition and consistency in answer, are investigated and discussed. Results highlight different performance among the virtual assistants. For speech recognition, Google Assistant is the most promising, with around 25% of word error rate per sentence. Consistency in answer, instead, sees Siri and Google Assistant provide coherent answers around 60% of times.

[1]  Mark Hawley,et al.  Speech Recognition as an Input to Electronic Assistive Technology , 2002 .

[2]  Jeffrey P. Bigham,et al.  On How Deaf People Might Use Speech to Control Devices , 2017, ASSETS.

[3]  Frank Rudzicz,et al.  Using articulatory likelihoods in the recognition of dysarthric speech , 2012, Speech Commun..

[4]  Fulvio Corno,et al.  "Hey Siri, Do You Understand Me?": Virtual Assistants and Dysarthria , 2018, Intelligent Environments.

[5]  Prasad D Polur,et al.  Investigation of an HMM/ANN hybrid structure in pattern recognition application using cepstral analysis of dysarthric (distorted) speech signals. , 2006, Medical engineering & physics.

[6]  Luis A. Guerrero,et al.  Alexa vs. Siri vs. Cortana vs. Google Assistant: A Comparison of Speech-Based Natural User Interfaces , 2017 .

[7]  Raja S. Kushalnagar,et al.  Feasibility of Using Automatic Speech Recognition with Voices of Deaf and Hard-of-Hearing Individuals , 2017, ASSETS.

[8]  Andrew Sears,et al.  Physical disabilities and computing technologies: an analysis of impairments , 2002 .

[9]  R. Guiloff,et al.  Dysarthria in amyotrophic lateral sclerosis: A review , 2010, Amyotrophic lateral sclerosis : official publication of the World Federation of Neurology Research Group on Motor Neuron Diseases.

[10]  Jan Derboven,et al.  Designing voice interaction for people with physical and speech impairments , 2014, NordiCHI.

[11]  J. Cedarbaum,et al.  The ALSFRS-R: a revised ALS functional rating scale that incorporates assessments of respiratory function , 1999, Journal of the Neurological Sciences.