Using Pupil Dilation to Measure Cognitive Load When Listening to Text-to-Speech in Quiet and in Noise

With increased use of text-to-speech (TTS) systems in realworld applications, evaluating how such systems influence the human cognitive processing system becomes important. Particularly in situations where cognitive load is high, there may be negative implications such as fatigue. For example, noisy situations generally require the listener to exert increased mental effort. A better understanding of this could eventually suggest new ways of generating synthetic speech that demands low cognitive load. In our previous study, pupil dilation was used as an index of cognitive effort. Pupil dilation was shown to be sensitive to the quality of synthetic speech, but there were some uncertainties regarding exactly what was being measured. The current study resolves some of those uncertainties. Additionally, we investigate how the pupil dilates when listening to synthetic speech in the presence of speech-shaped noise. Our results show that, in quiet listening conditions, pupil dilation does not reflect listening effort but rather attention and engagement. In noisy conditions, increased pupil dilation indicates that listening effort increases as signal-to-noise ratio decreases, under all conditions tested.

[1]  A. Stewart,et al.  Listening effort and fatigue: What exactly are we measuring? A British Society of Audiology Cognition in Hearing Special Interest Group ‘white paper’ , 2014, International journal of audiology.

[2]  A. Zekveld,et al.  Cognitive processing load across a wide range of listening conditions: insights from pupillometry. , 2014, Psychophysiology.

[3]  Stefanie E. Kuchinsky,et al.  Best Practices and Advice for Using Pupillometry to Measure Listening Effort: An Introduction for Those Who Want to Get Started , 2018, Trends in hearing.

[4]  Simon King,et al.  Using Pupillometry to Measure the Cognitive Load of Synthetic Speech , 2018, INTERSPEECH.

[5]  Yannis Stylianou,et al.  Evaluating the intelligibility benefit of speech modifications in known noise conditions , 2013, Speech Commun..

[6]  E. Hess,et al.  Pupil Size in Relation to Mental Activity during Simple Problem-Solving , 1964, Science.

[7]  A. Zekveld,et al.  Cognitive Load During Speech Perception in Noise: The Influence of Age, Hearing Loss, and Cognition on the Pupil Response , 2011, Ear and hearing.

[8]  Simon King,et al.  Impact of Different Speech Types on Listening Effort , 2018, INTERSPEECH.

[9]  T. Lunner,et al.  The Ease of Language Understanding (ELU) model: theoretical, empirical, and clinical advances , 2013, Front. Syst. Neurosci..

[10]  J. Beatty Task-evoked pupillary responses, processing load, and the structure of processing resources. , 1982 .

[11]  Joshua F. Wiley,et al.  Growth Curve Analysis and Visualization Using R , 2014 .

[12]  S. King,et al.  The Blizzard Challenge 2011 , 2011 .

[13]  A. Zekveld,et al.  Pupil Response as an Indication of Effortful Listening: The Influence of Sentence Intelligibility , 2010, Ear and hearing.

[14]  A. Zekveld,et al.  Pupil Dilation Uncovers Extra Listening Effort in the Presence of a Single-Talker Masker , 2012, Ear and hearing.

[15]  P. Toffanin,et al.  The Timing and Effort of Lexical Access in Natural and Degraded Speech , 2016, Front. Psychol..