Discrimination of Vowels with a Multi-finger Tactual Display

The present study compared the performance of an ideal observer and a human participant in a vowel discrimination task using a speech-to-touch coding scheme designed for a three finger tactual display. The coding scheme extracted speech features and presented them as high-frequency vibrational and low-frequency motional waveforms. The high-frequency vibrations presented crude spectral information from three distinct speech bands on three fingerpads of the left hand. The same information was presented, redundantly, by the low-frequency waveforms. Performance of the ideal observer, where only high-frequency vibrational signals were considered, was evaluated by a signal detection theory using several tokens of a pair of vowels. Results showed that the acoustic cues corresponding to the first two formants were sufficient for discrimination of a seven vowel stimulus set. The participant was then tested in an absolute identification task with 640 tokens of ten non-diphthong vowels spoken by two female speakers. Both high- and low-frequency waveforms were presented to the participant. Discrimination scores for each pair of vowel were similar to the best scores obtained with the ideal observer indicating that the coding scheme was effective and the participant acted like an ideal observer.

[1]  W. M. Rabinowitz,et al.  Information transmission with a multifinger tactual display , 1999, Perception & psychophysics.

[2]  J M Weisenberger,et al.  Evaluation of a principal-components tactile aid for the hearing-impaired. , 1991, The Journal of the Acoustical Society of America.

[3]  I. Summers Tactile Aids for the Hearing Impaired , 1992 .

[4]  W. R. Garner Uncertainty and structure as psychological concepts , 1975 .

[5]  J. Hillenbrand,et al.  Acoustic characteristics of American English vowels. , 1994, The Journal of the Acoustical Society of America.

[6]  Roger W. Cholewiak,et al.  Sensory and Physiological Bases of Touch , 1991 .

[7]  Robert H. Gault,et al.  Progress in experiments on tactual interpretation of oral speech. , 1924 .

[8]  S. Bolanowski,et al.  Four channels mediate the mechanical aspects of touch. , 1988, The Journal of the Acoustical Society of America.

[9]  Hong Z. Tan,et al.  A Two DOF Controller for a Multi-Finger Tactual Display Using a Loop-Shaping Technique , 2004 .

[10]  Hanfeng Yuan,et al.  Tactual display of consonant voicing to supplement lipreading , 2005, The Journal of the Acoustical Society of America.

[11]  C M Reed,et al.  Research on the Tadoma method of speech communication. , 1983, The Journal of the Acoustical Society of America.

[12]  S. Zahorian Principal‐Components Analysis for Low Redundancy Encoding of Speech Spectra , 1979 .

[13]  L D Braida,et al.  Effects of token variability on our ability to distinguish between vowels , 1998, Perception & psychophysics.

[14]  P K Kuhl,et al.  The contribution of fundamental frequency, amplitude envelope, and voicing duration cues to speechreading in normal-hearing subjects. , 1985, The Journal of the Acoustical Society of America.

[15]  Neil A. Macmillan,et al.  Detection Theory: A User's Guide , 1991 .

[16]  Hong Z. Tan,et al.  A New Mult-Finger Tactual Display , 1996, Dynamic Systems and Control.

[17]  H. Tan,et al.  Frequency and amplitude discrimination along the kinestheticcutaneous continuum in the presence of masking stimuli. , 2006, The Journal of the Acoustical Society of America.

[18]  N I Durlach,et al.  Tactile communication of speech: comparison of two computer-based displays. , 1988, Journal of rehabilitation research and development.

[19]  H Levitt,et al.  The use of tactile supplements in lipreading Swedish and English: a single-subject study. , 2000, Journal of Speech, Language and Hearing Research.

[20]  Nathaniel I. Durlach,et al.  Note on Information Transfer Rates in Human Communication , 1998, Presence.