Hearing the Moment: Measures and Models of the Perceptual Centre

The perceptual centre (P-centre) is the hypothetical specific moment at which a brief event is perceived to occur. Several P-centre models are described in the literature and the first collective implementation and rigorous evaluation of these models using a common corpus is described in this thesis, thus addressing a significant open question: which model should one use? The results indicate that none of the models reliably handles all sound types. Possibly this is because the data for model development are too sparse, because inconsistent measurement methods have been used, or because the assumptions underlying the measurement methods are untested. To address this, measurement methods are reviewed and two of them, rhythm adjustment and tap asynchrony, are evaluated alongside a new method based on the phase correction response (PCR) in a synchronized tapping task. Rhythm adjustment and the PCR method yielded consistent P-centre estimates and showed no evidence of P-centre context dependence. Moreover, the PCR method appears most time efficient for generating accurate P-centre estimates. Additionally, the magnitude of the PCR is shown to vary systematically with the onset complexity of speech sounds, which presumably reflects the perceived clarity of a sound’s P-centre. The ideal outcome of any P-centre measurement technique is to detect the true moment of perceived event occurrence. To this end a novel P-centre measurement method, based on auditory evoked potentials, is explored as a possible objective alternative to the conventional approaches examined earlier. The results are encouraging and suggest that a neuroelectric correlate of the P-centre does exist, thus opening up a new avenue of P-centre research. Finally, an up to date and comprehensive review of the P-centre is included, integrating recent findings and reappraising previous research. The main open questions are identified, particularly those most relevant to P-centre modelling.

[1]  D H Whalen,et al.  Perceived timing is produced timing: A reply to Howell , 1988, Perception & psychophysics.

[2]  T. Zanto,et al.  Neural correlates of rhythmic expectancy , 2006 .

[3]  Douglas H. Whalen,et al.  The syllable’s rhyme affects its P-center as a unit , 1988 .

[4]  J. Pernier,et al.  Stimulus Specificity of Phase-Locked and Non-Phase-Locked 40 Hz Visual Responses in Human , 1996, The Journal of Neuroscience.

[5]  R Efron,et al.  The minimum duration of a perception. , 1970, Neuropsychologia.

[6]  Guy Madison,et al.  Human sensorimotor tracking of continuous subliminal deviations from isochrony , 2004, Neuroscience Letters.

[7]  A. Gregory Perception of clicks in music. , 1977, Perception & psychophysics.

[8]  B. Repp Sensorimotor synchronization: A review of the tapping literature , 2005, Psychonomic bulletin & review.

[9]  Bruno H. Repp Automaticity and voluntary control of phase correction following event onset shifts in sensorimotor synchronization. , 2002 .

[10]  H. Schütte,et al.  Ein funktionsschema für die wahrnehmung eines gleichmäßigen rhythmus in schallimpulsfolgen , 1978, Biological Cybernetics.

[11]  Karl J. Friston,et al.  Frequency-Specific Coupling in the Cortico-Cerebellar Auditory System , 2008, Journal of neurophysiology.

[12]  Hans Forssberg,et al.  Listening to rhythms activates motor and premotor cortices , 2009, Cortex.

[13]  J. W. Gordon The perceptual attack time of musical tones. , 1987, The Journal of the Acoustical Society of America.

[14]  Theodore P. Zanto,et al.  Gamma-Band Responses to Perturbed Auditory Sequences: Evidence for Synchronization of Perceptual Processes , 2005 .

[15]  Elissa L. Newport,et al.  Segmenting nonsense: an event-related potential index of perceived onsets in continuous speech , 2002, Nature Neuroscience.

[16]  W. Klimesch,et al.  Are event-related potential components generated by phase resetting of brain oscillations? A critical discussion , 2007, Neuroscience.

[17]  Ernst Piippel A hierarchical model of temporal perception , 1997 .

[18]  S. Grondin,et al.  From physical time to the first and second moments of psychological time. , 2001, Psychological bulletin.

[19]  Josep Marco-Pallarés,et al.  Modulation of spectral power and of phase resetting of EEG contributes differentially to the generation of auditory event-related potentials , 2006, NeuroImage.

[20]  K. Tremblay,et al.  Test-Retest Reliability of Cortical Evoked Potentials Using Naturally Produced Speech Sounds , 2003, Ear and hearing.

[21]  M. Alegre,et al.  Cortical gamma activity during auditory tone omission provides evidence for the involvement of oscillatory activity in top-down processing , 2006, Experimental Brain Research.

[22]  Hugo Fastl,et al.  Psychoacoustics Facts and Models. 2nd updated edition , 1999 .

[23]  Julio Artieda,et al.  Activation of Human Cerebral and Cerebellar Cortex by Auditory Stimulation at 40 Hz , 2002, The Journal of Neuroscience.

[24]  G. Allen The Location of Rhythmic Stress Beats in English: an Experimental Study I , 1972, Language and speech.

[25]  W. Klimesch,et al.  Event-related phase reorganization may explain evoked neural dynamics , 2007, Neuroscience & Biobehavioral Reviews.

[26]  Seung Kee Han,et al.  Phase analysis of single-trial EEGs: Phase resetting of alpha and theta rhythms , 2006, Neurocomputing.

[27]  R. Efron,et al.  Effect of stimulus duration on perceptual onset and offset latencies , 1970 .

[28]  Hugo Fastl,et al.  Psychoacoustics: Facts and Models , 1990 .

[29]  Mark G. Grotefend The Perception Is... , 2009 .

[30]  Hannu Tiitinen,et al.  Auditory event-related responses are generated independently of ongoing brain activity , 2005, NeuroImage.

[31]  Anders Löfqvist,et al.  THE ACOUSTICS AND KINEMATICS OF REGULARLY TIMED SPEECH: A DATABASE AND METHOD FOR THE STUDY OF THE P-CENTER PROBLEM , 1999 .

[32]  J. Morton,et al.  Perceptual centers (P-centers). , 1976 .

[33]  J Mates,et al.  The Perceptual Centre of a Stimulus as the Cue for Synchronization to a Metronome: Evidence from Asynchronies , 1995, The Quarterly journal of experimental psychology. A, Human experimental psychology.

[34]  A.-P. Benguerel,et al.  Time-warping and the perception of rhythm in speech , 1986 .

[35]  Peter E Keller,et al.  Sensorimotor synchronization with chords containing tone-onset asynchronies , 2007, Perception & psychophysics.

[36]  Nick Collins,et al.  Investigating computational models of perceptual attack time , 2006 .

[37]  K. Wernecke,et al.  Objective detection of transiently evoked otoacoustic emissions , 2001, Scandinavian audiology.

[38]  C. Fowler,et al.  P-Center Judgments Are Generally Insensitive to the Instructions Given , 1989, Phonetica.

[39]  P. Fraisse Perception and estimation of time. , 1984, Annual review of psychology.

[41]  F. Varela,et al.  Perception's shadow: long-distance synchronization of human brain activity , 1999, Nature.

[42]  M. Döllinger,et al.  The Influence of Temporal Stimulus Changes on Speech-Evoked Potentials Revealed by Approximations of Tone-Evoked Waveforms , 2009, Ear and hearing.

[43]  Jonathan G. Fiscus,et al.  Darpa Timit Acoustic-Phonetic Continuous Speech Corpus CD-ROM {TIMIT} | NIST , 1993 .

[44]  Loudness functions for long and short tones , 2001 .

[45]  J. Eggermont Between sound and perception: reviewing the search for a neural code , 2001, Hearing Research.

[46]  Rudi C. Villing,et al.  Automatic Blind Syllable Segmentation for Continuous Speech , 2004 .

[47]  P Howell,et al.  An Acoustic Determinant of Perceived and Produced Anisochrony , 1984 .

[48]  C. Fowler,et al.  THE CONTRIBUTION OF AMPLITUDE TO THE PERCEPTION OF ISOCHRONY , 2009 .

[49]  C. Fowler Converging sources of evidence on spoken and perceived rhythms of speech: cyclic production of vowels in monosyllabic stress feet. , 1983, Journal of experimental psychology. General.

[50]  J. Eggermont,et al.  The Neurophysiology of Auditory Perception: From Single Units to Evoked Potentials , 2002, Audiology and Neurotology.

[51]  C. Palmer,et al.  Synchronization of Timing and Motion 435 , 2022 .

[52]  Jeffrey M. Zacks,et al.  Event perception: a mind-brain perspective. , 2007, Psychological bulletin.

[53]  S Buus,et al.  Temporal integration of loudness, loudness discrimination, and the form of the loudness function. , 1997, The Journal of the Acoustical Society of America.

[54]  R. Bakeman Recommended effect size statistics for repeated measures designs , 2005, Behavior research methods.

[55]  R. Schlauch,et al.  Duration discrimination and subjective duration for ramped and damped sounds. , 2001, The Journal of the Acoustical Society of America.

[56]  J. Algina,et al.  Generalized eta and omega squared statistics: measures of effect size for some common research designs. , 2003, Psychological methods.

[57]  J. Palva,et al.  Distinct Gamma-Band Evoked Responses to Speech and Non-Speech Sounds in Humans , 2002, The Journal of Neuroscience.

[58]  D. Poeppel,et al.  Phase Patterns of Neuronal Responses Reliably Discriminate Speech in Human Auditory Cortex , 2007, Neuron.

[59]  G. Aschersleben Temporal Control of Movements in Sensorimotor Synchronization , 2002, Brain and Cognition.

[60]  Bernd Pompino-Marschall Segments, syllables, and the perception of speech rate and rhythm , 1987, ECST.

[61]  R. Barry Evoked activity and EEG phase resetting in the genesis of auditory Go/NoGo ERPs , 2009, Biological Psychology.

[62]  Catalin V. Buhusi,et al.  What makes us tick? Functional and neural mechanisms of interval timing , 2005, Nature Reviews Neuroscience.

[63]  JC Seton,et al.  A psychophysical investigation of auditory rhythmic beat perception. , 1989 .

[64]  S. M. Marcus Acoustic determinants of perceptual center (P-center) location , 1981, Perception & psychophysics.

[65]  B H Repp,et al.  Detectability of duration and intensity increments in melody tones: A partial connection between music perception and performance , 1995, Perception & psychophysics.

[66]  R. Rasch,et al.  The perceptual onset of musical tones , 1981, Perception & psychophysics.

[67]  O. Bertrand,et al.  Oscillatory gamma activity in humans and its role in object representation , 1999, Trends in Cognitive Sciences.

[68]  Jeffrey M. Zacks,et al.  Event structure in perception and conception. , 2001, Psychological bulletin.

[69]  J. Snyder,et al.  Gamma-band activity reflects the metric structure of rhythmic tone sequences. , 2005, Brain research. Cognitive brain research.

[70]  C. Fowler,et al.  Some articulatory correlates of perceptual isochrony , 1980, Perception & psychophysics.

[71]  Bernd Pompino-Marschall,et al.  On the Psychoacoustic Nature of the P-Center Phenomenon , 1989 .

[72]  T. Sejnowski,et al.  Dynamic Brain Sources of Visual Evoked Responses , 2002, Science.

[73]  U. Hoppe,et al.  Contribution of Spectrotemporal Features on Auditory Event-Related Potentials Elicited by Consonant-Vowel Syllables , 2009, Ear and hearing.

[74]  Dirk Vorberg,et al.  Linear Phase Correction Models for Synchronization: Parameter Identification and Estimation of Parameters , 2002, Brain and Cognition.

[75]  C A Harsin,et al.  Perceptual-center modeling is affected by including acoustic rate-of-change modulations , 1997, Perception & Psychophysics.

[76]  S. Roux,et al.  Auditory evoked potentials to tones and syllables in adults: evidence of specific influence on N250 wave , 2005, Neuroscience Letters.

[77]  P Howell,et al.  Prediction of P-center location from the distribution of energy in the amplitude envelope: II , 1988, Perception & Psychophysics.

[78]  Kazuyoshi Yoshii,et al.  A robot uses its own microphone to synchronize its steps to musical beats while scatting and singing , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[79]  Carol A. Fowler,et al.  “Perceptual centers” in speech production and perception , 1979 .

[80]  D H Whalen,et al.  P-centers are unaffected by phonetic categorization , 1986, Perception & psychophysics.

[81]  Rudi C. Villing,et al.  P-Centre Extraction from Speech: the need for a more reliable measure , 2003 .

[82]  Guy Madison,et al.  On the limits of anisochrony in pulse attribution , 2002, Psychological research.

[83]  Gisa Aschersleben,et al.  A psychophysical approach to action timing , 2004 .

[84]  B. Moore,et al.  A Model of Loudness Applicable to Time-Varying Sounds , 2002 .

[85]  Stanley A. Gelfand,et al.  Hearing: An Introduction to Psychological and Physiological Acoustics, Fourth Edition , 1998 .

[86]  Ruth Rasch,et al.  Synchronization in performed ensemble music , 1979 .

[87]  Yoshitaka Nakajima,et al.  Auditory Isochrony: Time Shrinking and Temporal Patterns , 1995, Perception.

[88]  Simon Hanslmayr,et al.  Distinguishing the evoked response from phase reset: A comment to Mäkinen et al. , 2006, NeuroImage.

[89]  W. Klimesch,et al.  Alpha phase synchronization predicts P1 and N1 latency and amplitude size. , 2005, Cerebral cortex.

[90]  Manfred R. Schroeder,et al.  Synthesis of low-peak-factor signals and binary sequences with low autocorrelation (Corresp.) , 1970, IEEE Trans. Inf. Theory.

[91]  Albert S. Bregman,et al.  The Auditory Scene. (Book Reviews: Auditory Scene Analysis. The Perceptual Organization of Sound.) , 1990 .

[92]  I. Lehiste Rhythmic units and syntactic units in production and perception. , 1973, The Journal of the Acoustical Society of America.

[93]  J. Snyder,et al.  Tempo dependence of middle- and long-latency auditory responses: power and phase modulation of the EEG at multiple time-scales , 2004, Clinical Neurophysiology.

[94]  P. Heil,et al.  Temporal Integration of Sound Pressure Determines Thresholds of Auditory-Nerve Fibers , 2001, The Journal of Neuroscience.

[95]  K. Tremblay,et al.  Speech Evoked Potentials: From the Laboratory to the Clinic , 2008, Ear and hearing.

[96]  P. Heil,et al.  Auditory cortical onset responses revisited. I. First-spike timing. , 1997, Journal of neurophysiology.

[97]  Bruno H Repp,et al.  Automaticity and voluntary control of phase correction following event onset shifts in sensorimotor synchronization. , 2002, Journal of experimental psychology. Human perception and performance.

[98]  S. M. Mason,et al.  Evoked potentials and their clinical application , 2004 .

[99]  R. Luce,et al.  TESTING A NEW THEORY OF PSYCHOPHYSICAL SCALING : TEMPORAL LOUDNESS INTEGRATION , 2022 .

[100]  Rudi C. Villing,et al.  Performance Limits for Envelope based Automatic Syllable Segmentation , 2006 .

[101]  B. Merker,et al.  On the role and origin of isochrony in human rhythmic entrainment , 2009, Cortex.

[102]  Christo Pantev,et al.  The perception of coherent and non-coherent auditory objects: a signature in gamma frequency band , 2000, Hearing Research.

[103]  J. Marshall,et al.  Perceptual centres for Dutch digits. , 1980, Acta psychologica.

[104]  R Efron,et al.  The relationship between the duration of a stimulus and the duration of a perception. , 1970, Neuropsychologia.

[105]  Sophie K. Scott,et al.  The point of P-centres , 1998 .

[106]  I. Lehiste,et al.  Effect of Unstressed Affixes on Stress-Beat Location in Speech Production and Perception , 1987, Perceptual and motor skills.

[107]  Jeffrey M. Zacks,et al.  Segmentation in the perception and memory of events , 2008, Trends in Cognitive Sciences.

[108]  C A Fowler,et al.  Listeners do hear sounds, not tongues. , 1996, The Journal of the Acoustical Society of America.

[109]  P Howell,et al.  Prediction of P-center location from the distribution of energy in the amplitude envelope: I , 1988, Perception & psychophysics.

[110]  B. Merker Synchronous Chorusing and Human Origins , 2000 .

[111]  Kenneth de Jong,et al.  The correlation of P-center adjustments with articulatory and acoustic events , 1994 .

[112]  C. Hoequist,et al.  The Perceptual Center and Rhythm Categories , 1983, Language and speech.

[113]  Chris Chafe,et al.  The shape of an instant: measuring and modeling perceptual attack time with probability density functions (if a tree falls in the forest, when did 57 people hear it make a sound?) , 2008 .

[114]  C A Fowler,et al.  Perception of syllable timing by prebabbling infants. , 1986, The Journal of the Acoustical Society of America.

[115]  L. V. Noorden Temporal coherence in the perception of tone sequences , 1975 .

[116]  R. Bickford,et al.  Brain stem auditory evoked potentials: the use of noise estimate. , 1980, Electroencephalography and clinical neurophysiology.

[117]  Yin Fen Low,et al.  EEG phase reset due to auditory attention: an inverse time-scale approach , 2009, Physiological measurement.

[118]  C Elberling,et al.  Quality estimation of averaged auditory brainstem responses. , 1984, Scandinavian audiology.

[119]  Johan Sundberg,et al.  TIME DISCRIMINATION IN A MONOTONIC, ISOCHRONOUS SEQUENCE , 1995 .