Intelligibility of locally time-reversed speech: A multilingual comparison

A set of experiments was performed to make a cross-language comparison of intelligibility of locally time-reversed speech, employing a total of 117 native listeners of English, German, Japanese, and Mandarin Chinese. The experiments enabled to examine whether the languages of three types of timing—stress-, syllable-, and mora-timed languages—exhibit different trends in intelligibility, depending on the duration of the segments that were temporally reversed. The results showed a strikingly similar trend across languages, especially when the time axis of segment duration was normalised with respect to the deviation of a talker’s speech rate from the average in each language. This similarity is somewhat surprising given the systematic differences in vocalic proportions characterising the languages studied which had been shown in previous research and were largely replicated with the present speech material. These findings suggest that a universal temporal window shorter than 20–40 ms plays a crucial role in perceiving locally time-reversed speech by working as a buffer in which temporal reorganisation can take place with regard to lexical and semantic processing.

[1]  David Poeppel,et al.  The analysis of speech in different temporal integration windows: cerebral lateralization as 'asymmetric sampling in time' , 2003, Speech Commun..

[2]  D. Poeppel,et al.  The cortical organization of speech processing , 2007, Nature Reviews Neuroscience.

[3]  K. Saberi,et al.  Cognitive restoration of reversed speech , 1999, Nature.

[4]  S. Rosen Temporal information in speech: acoustic, auditory and linguistic aspects. , 1992, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[5]  Charles Darwin,et al.  The Perception of Speech , 1961, Nature.

[6]  Aniruddh D. Patel,et al.  Temporal modulations in speech and music , 2017, Neuroscience & Biobehavioral Reviews.

[7]  F. Ramus,et al.  Correlates of linguistic rhythm in the speech signal , 1999, Cognition.

[8]  Y. Nakajima,et al.  Acoustic Analyses of Speech Sounds and Rhythms in Japanese- and English-Learning Infants , 2013, Front. Psychol..

[9]  Tamara C. Cristescu,et al.  Auditory language comprehension of temporally reversed speech signals in native and non-native speakers. , 2008, Acta Neurobiologiae Experimentalis.

[10]  Steven Greenberg,et al.  The relation between speech intelligibility and the complex modulation spectrum , 2001, INTERSPEECH.

[11]  Wolfgang Ellermeier,et al.  The psychoacoustics of the irrelevant sound effect , 2014 .

[12]  宏明 高橋 逆音声(Reversed Speech) , 1962 .

[13]  Wouter A Dreschler,et al.  Release from informational masking by time reversal of native and non-native interfering speech. , 2005, The Journal of the Acoustical Society of America.

[14]  Michael Kiefte,et al.  Cochlea-scaled spectral entropy predicts rate-invariant intelligibility of temporally distorted sentences. , 2010, The Journal of the Acoustical Society of America.

[15]  Dylan M. Jones,et al.  Disruption of proofreading by irrelevant speech: Effects of attention, arousal or memory? , 1990 .

[16]  David Poeppel,et al.  Testing multi-scale processing in the auditory system , 2016, Scientific Reports.

[17]  A. Samuel,et al.  Some people are “More Lexical” than others , 2016, Cognition.

[18]  Yoshitaka Nakajima,et al.  An acoustic key to eight languages/dialects: Factor analyses of critical-band-filtered speech , 2017, Scientific reports.

[19]  F. Meunier,et al.  Mesure d'intelligibilité de segments de parole à l'envers en français , 2002 .

[20]  W. Meyer‐Eppler Reversed Speech and Repetition Systems as Means of Phonetic Research , 1950 .

[21]  David Poeppel,et al.  Cortical oscillations and speech processing: emerging computational principles and operations , 2012, Nature Neuroscience.

[22]  David Poeppel,et al.  Discrimination of speech stimuli based on neuronal response phase patterns depends on acoustics but not comprehension. , 2010, Journal of neurophysiology.

[23]  Lisa D. Sanders,et al.  Local and global auditory processing: Behavioral and ERP evidence , 2007, Neuropsychologia.

[24]  Paul Boersma,et al.  Praat: doing phonetics by computer , 2003 .

[25]  G. A. Miller The Perception of Speech. , 1951 .

[26]  R. Remez,et al.  Modulation sensitivity in the perceptual organization of speech , 2013, Attention, Perception, & Psychophysics.

[27]  Steven Greenberg,et al.  What are the Essential Cues for Understanding Spoken Language? , 2001, IEICE Trans. Inf. Syst..

[28]  V. Rich Personal communication , 1989, Nature.

[29]  Steven Greenberg,et al.  Multi-time resolution analysis of speech: evidence from psychophysics , 2015, Front. Neurosci..

[30]  Robert Fuchs,et al.  9. A sonority-based account of speech rhythm in Chinese learners of English , 2015 .

[31]  Robert L. Goldstone,et al.  Similarity-Dissimilarity Competition in Disjunctive Classification Tasks , 2013, Front. Psychology.

[32]  Yukari Hirata,et al.  Training native English speakers to perceive Japanese length contrasts in word versus sentence contexts. , 2004, The Journal of the Acoustical Society of America.