Classification of auditory steady-state responses to speech data

This paper presents an auditory steady-state response (ASSR)-based brain-computer interface (BCI) that uses artificial speech data synthesized by a text-to-speech (TTS) system. Many ASSR-based BCI systems that use pure tone (sinusoid) or an abrupt beep as auditory stimuli have been proposed. However, while these systems have achieved high classification accuracy, our group has found that participants find the monotonous stimuli to be hypnotic and annoying. Practical BCI systems should have user-friendly designs. Thus, as a first step, we develop a new experimental BCI system in which we change the type of stimuli from pure tone carrier to artificial speech data, which are clear enough for participants to recognize the meaning of sentences. With eight participants, the average accuracy of the system is 78.6 ± 5.32% for the binary classification problem. This suggests that the proposed system can be used in practical BCI.

[1]  D H Klatt,et al.  Review of text-to-speech conversion for English. , 1987, The Journal of the Acoustical Society of America.

[2]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[3]  D. O'Shaughnessy,et al.  Auditory BCIs for Visually Impaired Users : Should Developers Worry About the Quality of Text-to-Speech Readers ? , 2013 .

[4]  Toshihisa Tanaka,et al.  EEG auditory steady state responses classification for the novel BCI , 2011, 2011 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[5]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[6]  T. Picton,et al.  Human auditory steady-state responses: Respuestas auditivas de estado estable en humanos , 2003, International journal of audiology.

[7]  William H. Press,et al.  Numerical recipes in C , 2002 .

[8]  G Pfurtscheller,et al.  EEG-based communication: improved accuracy by response verification. , 1998, IEEE transactions on rehabilitation engineering : a publication of the IEEE Engineering in Medicine and Biology Society.

[9]  K.-R. Muller,et al.  Optimizing Spatial filters for Robust EEG Single-Trial Analysis , 2008, IEEE Signal Processing Magazine.

[10]  W. Klimesch,et al.  EEG alpha oscillations: The inhibition–timing hypothesis , 2007, Brain Research Reviews.

[11]  John J. Foxe,et al.  Visual spatial attention tracking using high-density SSVEP data for independent brain-computer communication , 2005, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[12]  Chang-Hwan Im,et al.  Classification of selective attention to auditory stimuli: Toward vision-free brain–computer interfacing , 2011, Journal of Neuroscience Methods.

[13]  Shoji Makino,et al.  Auditory steady-state response stimuli based BCI application - the optimization of the stimuli types and lengths , 2012, Proceedings of The 2012 Asia Pacific Signal and Information Processing Association Annual Summit and Conference.