Tactual display of consonant voicing to supplement lipreading

This research is concerned with the development and evaluation of a tactual display of consonant voicing to supplement the information available through lipreading for persons with profound hearing impairment. The voicing cue selected is based on the envelope onset asynchrony derived from two different filtered bands (a low-pass band and a high-pass band) of speech. The amplitude envelope of each of the two bands was used to modulate a different carrier frequency which in turn was delivered to one of the two fingers of a tactual stimulating device. Perceptual evaluations of speech reception through this tactual display included the pairwise discrimination of consonants contrasting voicing and identification of a set of 16 consonants under conditions of the tactual cue alone (T), lipreading alone (L), and the combined condition (L + T). The tactual display was highly effective for discriminating voicing at the segmental level and provided a substantial benefit to lipreading on the consonant-identification task. No such benefits of the tactual cue were observed, however, for lipreading of words in sentences due perhaps to difficulties in integrating the tactual and visual cues and to insufficient training on the more difficult task of connected-speech reception.

[1]  Hanfeng Yuan,et al.  Temporal onset-order discrimination through the tactual sense. , 2005, The Journal of the Acoustical Society of America.

[2]  Hanfeng Yuan,et al.  Envelope-onset asynchrony as a cue to voicing in initial english consonants. , 2004, The Journal of the Acoustical Society of America.

[3]  Coarticulation • Suprasegmentals,et al.  Acoustic Phonetics , 2019, The SAGE Encyclopedia of Human Communication Sciences and Disorders.

[4]  Charlotte M Reed,et al.  Temporal masking of multidimensional tactual stimuli. , 2003, The Journal of the Acoustical Society of America.

[5]  Michael Kiefte,et al.  Temporal information in gated stop consonants , 2003, Speech Commun..

[6]  K. Grant,et al.  Measures of auditory-visual integration for speech understanding: a theoretical perspective. , 2002, The Journal of the Acoustical Society of America.

[7]  J. Rauschecker,et al.  Sending Sound to the Brain , 2002, Science.

[8]  S. Bolanowski,et al.  A four-channel analysis of the tactile sensitivity of the fingertip: frequency selectivity, spatial summation, and temporal summation , 2002, Somatosensory & motor research.

[9]  C M Reed,et al.  A Single-Band Envelope Cue as a Supplement to Speechreading of Segmentals: A Comparison of Auditory versus Tactual Presentation , 2001, Ear and hearing.

[10]  S. Bolanowski,et al.  The frequency selectivity of information-processing channels in the tactile sensory system , 2001, Somatosensory & motor research.

[11]  A Dumont,et al.  Use of temporal envelope cues by children with developmental dyslexia. , 2000, Journal of speech, language, and hearing research : JSLHR.

[12]  Kenneth O. Johnson,et al.  Tactile Functions of Mechanoreceptive Afferents Innervating the Hand , 2000, Journal of clinical neurophysiology : official publication of the American Electroencephalographic Society.

[13]  A. Jongman,et al.  Acoustic characteristics of English fricatives. , 2000, The Journal of the Acoustical Society of America.

[14]  Paul Duchnowski,et al.  Development of speechreading supplements based on automatic speech recognition , 2000, IEEE Trans. Biomed. Eng..

[15]  L. Bernstein,et al.  Speech perception without hearing , 2000, Perception & psychophysics.

[16]  Jeung-Yoon Choi,et al.  Detection of consonant voicing: a module for a hierarchical speech recognition system , 1999 .

[17]  W. M. Rabinowitz,et al.  Information transmission with a multifinger tactual display , 1999, Perception & psychophysics.

[18]  M. Wittmann,et al.  Time perception and temporal processing levels of the brain. , 1999, Chronobiology international.

[19]  R V Shannon,et al.  Effects of amplitude nonlinearity on phoneme recognition by cochlear implant users and normal-hearing listeners. , 1998, The Journal of the Acoustical Society of America.

[20]  L. Bernstein,et al.  Temporal and spatio-temporal vibrotactile displays for voice fundamental frequency: an initial evaluation of a new vibrotactile speech perception aid with normal-hearing and hearing-impaired individuals. , 1998, The Journal of the Acoustical Society of America.

[21]  Lynne E. Bernstein,et al.  Modeling the interaction of phonemic intelligibility and lexical structure in audiovisual word recognition , 1998, Speech Commun..

[22]  K. Grant,et al.  Auditory-visual speech recognition by hearing-impaired subjects: consonant recognition, sentence recognition, and auditory-visual integration. , 1998, The Journal of the Acoustical Society of America.

[23]  F. Zeng,et al.  Speech recognition with altered spectral distribution of envelope cues. , 1996, The Journal of the Acoustical Society of America.

[24]  D. Massaro Perceiving talking faces: from speech perception to a behavioral principle , 1999 .

[25]  S. Blumstein,et al.  The characteristics of voicing in syllable-initial fricatives in American English. , 1997, The Journal of the Acoustical Society of America.

[26]  E. Pöppel,et al.  A hierarchical model of temporal perception , 1997, Trends in Cognitive Sciences.

[27]  Hong Z. Tan,et al.  Information transmission with a multi-finger tactual display , 1997, Scandinavian audiology. Supplementum.

[28]  Lynne E. Bernstein,et al.  Lipreading supplemented by voice fundamental frequency: to what extent does the addition of voicing increase lexical uniqueness for the lipreader , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[29]  A Boothroyd,et al.  Speechreading enhancement: a comparison of spatial-tactile display of voice fundamental frequency (F0) with auditory F0. , 1996, The Journal of the Acoustical Society of America.

[30]  R E Pastore,et al.  Measuring the difference limen for identification of order of onset for complex auditory stimuli , 1996, Perception & psychophysics.

[31]  Hong Z. Tan,et al.  A New Mult-Finger Tactual Display , 1996, Dynamic Systems and Control.

[32]  N. Durlach,et al.  A Comparison of Auditory and Tactual Presentation of a Single-Band Envelope Cue as a Supplement to Speechreading , 1995 .

[33]  Lorraine A. Delhorne,et al.  Current Results of a Field Study of Adult Users of Tactile Aids , 1995 .

[34]  R V Shannon,et al.  Speech Recognition with Primarily Temporal Cues , 1995, Science.

[35]  J M Weisenberger,et al.  The Transmission of Phoneme‐Level Information by Multichannel Tactile Speech Perception Aids , 1995, Ear and hearing.

[36]  A Boothroyd,et al.  Speechreading supplemented by single-channel and multichannel tactile displays of voice fundamental frequency. , 1995, Journal of speech and hearing research.

[37]  A Boothroyd,et al.  Comparison of Two Multichannel Tactile Devices as Supplements to Speechreading in a Postlingually Deafened Adult , 1995, Ear and hearing.

[38]  C W Turner,et al.  Use of temporal envelope cues in speech recognition by normal and hearing-impaired listeners. , 1995, The Journal of the Acoustical Society of America.

[39]  B H Brown,et al.  Vibrotactile and electrotactile perception of time-varying pulse trains. , 1994, The Journal of the Acoustical Society of America.

[40]  P. Ries Prevalence and characteristics of persons with hearing trouble: United States, 1990-91. , 1994, Vital and health statistics. Series 10, Data from the National Health Survey.

[41]  L D Braida,et al.  Auditory supplements to speechreading: combining amplitude envelope cues from different spectral regions of speech. , 1992, The Journal of the Acoustical Society of America.

[42]  D H Whalen,et al.  FO gives voicing information even with unambiguous voice onset times. , 1993, The Journal of the Acoustical Society of America.

[43]  A Boothroyd,et al.  Speechreading enhancement by voice fundamental frequency: the effects of Fo contour distortions. , 1992, Journal of speech and hearing research.

[44]  W M Rabinowitz,et al.  Relations among different measures of speech reception in subjects using a cochlear implant. , 1992, The Journal of the Acoustical Society of America.

[45]  D J Van Tasell,et al.  Temporal cues for consonant recognition: training, talker generalization, and use in evaluation of cochlear implants. , 1992, The Journal of the Acoustical Society of America.

[46]  S. Blumstein,et al.  Acoustic and perceptual characteristics of voicing in fricatives and fricative clusters. , 1992, The Journal of the Acoustical Society of America.

[47]  A Faulkner,et al.  Speech pattern hearing aids for the profoundly hearing impaired: speech perception and auditory abilities. , 1992, The Journal of the Acoustical Society of America.

[48]  Shubha Kadambe,et al.  Application of the wavelet transform for pitch detection of speech signals , 1992, IEEE Trans. Inf. Theory.

[49]  M E Demorest,et al.  Lipreading sentences with vibrotactile vocoders: performance of normal-hearing and hearing-impaired subjects. , 1991, The Journal of the Acoustical Society of America.

[50]  H. Kunov,et al.  Peripheral vision lipreading aid , 1991, IEEE Transactions on Biomedical Engineering.

[51]  L D Braida,et al.  Single Band Amplitude Envelope Cues as an Aid to Speechreading , 1991, The Quarterly journal of experimental psychology. A, Human experimental psychology.

[52]  L. Braida Crossmodal Integration in the Identification of Consonant Segments , 1991, The Quarterly journal of experimental psychology. A, Human experimental psychology.

[53]  N. Viemeister,et al.  Temporal integration and multiple looks. , 1991, The Journal of the Acoustical Society of America.

[54]  P J Blamey,et al.  Role of a multichannel electrotactile speech processor in a cochlear implant program for profoundly hearing-impaired adults. , 1991, Ear and hearing.

[55]  T. G. Forrest,et al.  Detection of silent temporal gaps in sinusoidal markers. , 1991, The Journal of the Acoustical Society of America.

[56]  Neil A. Macmillan,et al.  Detection Theory: A User's Guide , 1991 .

[57]  M F Dorman,et al.  Acoustic cues for consonant identification by patients who use the Ineraid cochlear implant. , 1990, The Journal of the Acoustical Society of America.

[58]  M E Demorest,et al.  Speechreading sentences with single-channel vibrotactile presentation of voice fundamental frequency. , 1990, The Journal of the Acoustical Society of America.

[59]  R. N. Ohde,et al.  Spectral and duration properties of front vowels as cues to final stop-consonant voicing. , 1990, The Journal of the Acoustical Society of America.

[60]  C. L. Doren,et al.  Vibrotactile temporal gap detection as a function of age. , 1990, The Journal of the Acoustical Society of America.

[61]  J. Craig,et al.  Temporal order and tactile patterns , 1990, Perception & psychophysics.

[62]  Piotr Jaśkowski,et al.  Temporal-order judgments and reaction time for stimuli of different modalities , 1990, Psychological research.

[63]  C M Reed,et al.  Analytic study of the Tadoma method: effects of hand position on segmental speech perception. , 1989, Journal of speech and hearing research.

[64]  Douglas D. O'Shaughnessy,et al.  Automatic and reliable estimation of glottal closure instant and period , 1989, IEEE Trans. Acoust. Speech Signal Process..

[65]  J M Weisenberger,et al.  Evaluation of two multichannel tactile aids for the hearing impaired. , 1989, The Journal of the Acoustical Society of America.

[66]  I Summers Evaluation of tactile aids. , 1989, British journal of audiology.

[67]  A. Jongman Duration of frication noise required for identification of English fricatives. , 1989, The Journal of the Acoustical Society of America.

[68]  Charlotte M. Reed,et al.  Research on Tactual Communication of Speech: Ideas, Issues, and Findings. , 1989 .

[69]  P J Blamey,et al.  Speech perception using combinations of auditory, visual, and tactile information. , 1989, Journal of rehabilitation research and development.

[70]  P J Blamey,et al.  Phonemic information transmitted by a multichannel electrotactile speech processor. , 1988, Journal of speech and hearing research.

[71]  A Boothroyd Perception of Speech Pattern Contrasts from Auditory Presentation of Voice Fundamental Frequency , 1988, Ear and hearing.

[72]  T Hnath-Chisolm,et al.  Tactile Presentation of Voice Fundamental Frequency as an Aid to the Perception of Speech Pattern Contrasts , 1988, Ear and hearing.

[73]  T Hnath-Chisolm,et al.  Perception of Frequency Contours via Temporal and Spatial Tactile Transforms , 1988, Ear and hearing.

[74]  A Boothroyd,et al.  Tactile Presentation of Voice Fundamental Frequency as an Aid to the Speechreading of Sentences , 1988, Ear and hearing.

[75]  A. Boothroyd,et al.  Voice Fundamental Frequency as an Auditory Supplement to the Speechreading of Sentences , 1988, Ear and hearing.

[76]  Arthur Boothroyd,et al.  A Wearable Multichannel Tactile Display of Voice Fundamental Frequency , 1988, Ear and hearing.

[77]  S. Bolanowski,et al.  Four channels mediate the mechanical aspects of touch. , 1988, The Journal of the Acoustical Society of America.

[78]  A E Carney,et al.  Vibrotactile perception of segmental features of speech: a comparison of single-channel and multichannel instruments. , 1988, Journal of speech and hearing research.

[79]  S. Blumstein,et al.  Acoustic characteristics of English voiceless fricatives: a descriptive analysis , 1988 .

[80]  T. Crystal,et al.  Segmental durations in connected‐speech signals: Current results , 1988 .

[81]  N I Durlach,et al.  Multidimensional tactile displays: identification of vibratory intensity, frequency, and contactor area. , 1987, The Journal of the Acoustical Society of America.

[82]  D J Van Tasell,et al.  Speech waveform envelope cues for consonant recognition. , 1987, The Journal of the Acoustical Society of America.

[83]  S R Baum,et al.  Preliminary observations on the use of duration as a cue to syllable-initial fricative consonant voicing in English. , 1987, The Journal of the Acoustical Society of America.

[84]  J M Weisenberger Sensitivity to amplitude-modulated vibrotactile signals. , 1986, The Journal of the Acoustical Society of America.

[85]  P K Kuhl,et al.  The transmission of prosodic information via an electrotactile speechreading aid. , 1986, Ear and hearing.

[86]  S. Bolanowski,et al.  The effects of skin temperature on the psychophysical responses to vibration on glabrous and hairy skin. , 1986, The Journal of the Acoustical Society of America.

[87]  C. J. Keemink,et al.  Envelope detection of amplitude-modulated high-frequency sinusoidal signals by skin mechanoreceptors. , 1986, The Journal of the Acoustical Society of America.

[88]  R Plomp,et al.  Speechreading supplemented with auditorily presented speech parameters. , 1986, The Journal of the Acoustical Society of America.

[89]  E. Owens,et al.  Visemes observed by hearing-impaired and normal-hearing adult viewers. , 1985, Journal of speech and hearing research.

[90]  P K Kuhl,et al.  The contribution of fundamental frequency, amplitude envelope, and voicing duration cues to speechreading in normal-hearing subjects. , 1985, The Journal of the Acoustical Society of America.

[91]  C M Reed,et al.  Research on the Tadoma method of speech communication. , 1983, The Journal of the Acoustical Society of America.

[92]  Arthur Boothroyd,et al.  A sentence test of speech perception: reliability, set equivalence, and short term learning , 1985 .

[93]  M. Breeuwer,et al.  Speechreading supplemented with frequency‐selective sound‐pressure information , 1984 .

[94]  J Hillenbrand,et al.  Perception of the voiced-voiceless contrast in syllable-final stops. , 1984, The Journal of the Acoustical Society of America.

[95]  R. N. Ohde,et al.  Fundamental frequency as an acoustic correlate of stop consonant voicing. , 1984, The Journal of the Acoustical Society of America.

[96]  P L Brooks,et al.  Evaluation of a tactile vocoder for work recognition. , 1983, The Journal of the Acoustical Society of America.

[97]  George A. Gescheider,et al.  Vibrotactile masking: Effects of oneand two-site stimulation , 1983, Perception & psychophysics.

[98]  R T Verrillo,et al.  Vibrotacile masking of Pacinian and non-Pacinian channels. , 1983, The Journal of the Acoustical Society of America.

[99]  R E Pastore,et al.  Temporal order judgment of auditory stimulus offset , 1983, Perception & psychophysics.

[100]  G A Gescheider,et al.  Prediction of vibrotactile masking functions. , 1982, The Journal of the Acoustical Society of America.

[101]  Peter Ning,et al.  Electrocutaneous stimulation III. The perception of temporal order , 1982, Perception & psychophysics.

[102]  S J Bolanowski,et al.  Temperature and criterion effects in a somatosensory subsystem: a neurophysiological and psychophysical study. , 1982, Journal of neurophysiology.

[103]  C M Reed,et al.  Research on tactile communication of speech: a review. , 1982, ASHA monographs.

[104]  C M Reed,et al.  Analytic study of the Tadoma method: identification of consonants and vowels by an experienced Tadoma user. , 1982, Journal of speech and hearing research.

[105]  Richard E. Pastore,et al.  Temporal order identification: Some parameter dependencies , 1982 .

[106]  B E Walden,et al.  Some effects of training on speech recognition by hearing-impaired adults. , 1981, Journal of speech and hearing research.

[107]  B. Yegnanarayana,et al.  Epoch extraction from linear prediction residual for identification of closed glottis interval , 1979 .

[108]  C M Reed,et al.  Analytic study of the Tadoma method: discrimination ability of untrained observers. , 1978, Journal of speech and hearing research.

[109]  Catherine G. Wolf,et al.  Voicing cues in English final stops , 1978 .

[110]  Allen A. Montgomery,et al.  Erratum: Effects of Training on the Visual Recognition of Consonants , 1977 .

[111]  A. Montgomery,et al.  Perceptual dimensions underlying vowellipreading performance. , 1976, Journal of speech and hearing research.

[112]  Victor W. Zue,et al.  Acoustic Characteristics of Stop Consonants: A Controlled Study , 1976 .

[113]  R. Cole,et al.  Perception of voicing in English affricates and fricatives. , 1975, The Journal of the Acoustical Society of America.

[114]  B. Yegnanarayana,et al.  Epoch extraction of voiced speech , 1975 .

[115]  D. Klatt Vowel Lengthening is Syntactically Determined in a Connected Discourse. , 1975 .

[116]  W. R. Garner Uncertainty and structure as psychological concepts , 1975 .

[117]  H. Strube Determination of the instant of glottal closure from the speech wave. , 1974, The Journal of the Acoustical Society of America.

[118]  M. Ross,et al.  Average magnitude difference function pitch extractor , 1974 .

[119]  N. P. Erber,et al.  Visual perception of speech by deaf children: recent developments and continuing needs. , 1974, The Journal of speech and hearing disorders.

[120]  K. Stevens,et al.  Role of formant transitions in the voiced-voiceless distinction for stops. , 1974, The Journal of the Acoustical Society of America.

[121]  J. Kirman Tactile communication of speech: a review and an analysis. , 1973, Psychological bulletin.

[122]  J. Markel,et al.  The SIFT algorithm for fundamental frequency estimation , 1972 .

[123]  L. Raphael Preceding vowel duration as a cue to the perception of the voicing characteristic of word-final consonants in American English. , 1972, The Journal of the Acoustical Society of America.

[124]  Ronald T. Verrillo Vibrotactile thresholds measured at the finger , 1971 .

[125]  A. House,et al.  A masking noise with speech-envelope characteristics for studying intelligibility. , 1971, The Journal of the Acoustical Society of America.

[126]  H. Levitt Transformed up-down methods in psychoacoustics. , 1971, The Journal of the Acoustical Society of America.

[127]  Cari E. Sherrick,et al.  Temporal Ordering of Events in Haptic Space , 1970 .

[128]  C. G. Fisher,et al.  Confusions among visually perceived consonants. , 1968, Journal of speech and hearing research.

[129]  F. J. Hill,et al.  Speech recognition as a function of channel capacity in a discrete set of channels. , 1968, The Journal of the Acoustical Society of America.

[130]  M. Sondhi,et al.  New methods of pitch extraction , 1968 .

[131]  H W Upton,et al.  Wearable eyeglass speechreading aid. , 1968, American annals of the deaf.

[132]  G. D. Goff Differential discrimination of frequency of cutaneous mechanical vibration. , 1967, Journal of experimental psychology.

[133]  A. Noll Cepstrum pitch determination. , 1967, The Journal of the Acoustical Society of America.

[134]  R T Verrillo,et al.  Vibrotactile thresholds for hairy skin. , 1966, Journal of experimental psychology.

[135]  R. T. Verrillo,et al.  Vibrotactile sensitivity and the frequency response of the Pacinian corpuscle , 1966 .

[136]  D. M. Green,et al.  Signal detection theory and psychophysics , 1966 .

[137]  R. T. Verrillo TEMPORAL SUMMATION IN VIBROTACTILE SENSITIVITY. , 1965, The Journal of the Acoustical Society of America.

[138]  C. Sherrick,et al.  EFFECTS OF DOUBLE SIMULTANEOUS STIMULATION OF THE SKIN. , 1964, The American journal of psychology.

[139]  L. Lisker,et al.  A Cross-Language Study of Voicing in Initial Stops: Acoustical Measurements , 1964 .

[140]  R. T. Verrillo,et al.  Effect of Contactor Area on the Vibrotactile Threshold , 1963 .

[141]  I. Hirsh,et al.  Perceived order in different sense modalities. , 1961, Journal of experimental psychology.

[142]  A. House On Vowel Duration in English , 1961 .

[143]  J. Zwislocki Theory of Temporal Auditory Summation , 1960 .

[144]  Ira J. Hirsh,et al.  Auditory Perception of Temporal Order , 1959 .

[145]  A. Liberman,et al.  Some Cues for the Distinction Between Voiced and Voiceless Stops in Initial Position , 1957 .

[146]  Frank R. Clarke,et al.  Constant‐Ratio Rule for Confusion Matrices in Speech Communication , 1957 .

[147]  P. Denes Effect of Duration on the Perception of Voicing , 1955 .

[148]  G. A. Miller,et al.  An Analysis of Perceptual Confusions Among Some English Consonants , 1955 .

[149]  A. House,et al.  The Influence of Consonant Environment upon the Secondary Acoustical Characteristics of Vowels , 1953 .

[150]  G. A. Miller,et al.  The intelligibility of speech as a function of the context of the test materials. , 1951, Journal of experimental psychology.

[151]  Vern O. Knudsen,et al.  “Hearing” with the Sense of Touch , 1928 .

[152]  R. H. Gault LXXVII. On the Interpretation of Speech Sounds by Means of Their Tactual Correlates , 1926 .

[153]  Robert H. Gault,et al.  Progress in experiments on tactual interpretation of oral speech. , 1924 .