Mixture of orthogonal sequences made from extended time-stretched pulses enables measurement of involuntary voice fundamental frequency response to pitch perturbation

Auditory feedback plays an essential role in the regulation of the fundamental frequency of voiced sounds. The fundamental frequency also responds to auditory stimulation other than the speaker’s voice. We propose to use this response of the fundamental frequency of sustained vowels to frequency-modulated test signals for investigating involuntary control of voice pitch. This involuntary response is difficult to identify and isolate by the conventional paradigm, which uses step-shaped pitch perturbation. We recently developed a versatile measurement method using a mixture of orthogonal sequences made from a set of extended time-stretched pulses (TSP). In this article, we extended our approach and designed a set of test signals using the mixture to modulate the fundamental frequency of artificial signals. For testing the response, the experimenter presents the modulated signal aurally while the subject is voicing sustained vowels. We developed a tool for conducting this test quickly and interactively. We make the tool available as an open-source and also provide executable GUI-based applications. Preliminary tests revealed that the proposed method consistently provides compensatory responses with about 100 ms latency, representing involuntary control. Finally, we discuss future applications of the proposed method for objective and non-invasive auditory response measurements.

[1]  Alain de Cheveigné,et al.  Pitch perception models , 2005 .

[2]  Tomoki Toda,et al.  A New Cosine Series Antialiasing Function and its Application to Aliasing-Free Glottal Source Models for Speech and Singing Synthesis , 2017, INTERSPEECH.

[3]  Hideki Kawahara,et al.  Effects of auditory feedback on F0 trajectory generation , 1996, ICSLP.

[4]  Anders Löfqvist,et al.  Toward a consensus on symbolic notation of harmonics, resonances, and formants in vocalization. , 2015, The Journal of the Acoustical Society of America.

[5]  Robert J. Zatorre,et al.  Neural networks involved in voluntary and involuntary vocal pitch regulation in experienced singers , 2010, Neuropsychologia.

[6]  Angelo Farina,et al.  Simultaneous Measurement of Impulse Response and Distortion with a Swept-Sine Technique , 2000 .

[7]  C. Larson,et al.  ERP correlates of pitch error detection in complex Tone and Voice auditory feedback with missing fundamental , 2012, Brain Research.

[8]  Malcolm J. Hawksford,et al.  Distortion immunity of MLS-derived impulse response measurements , 1993 .

[9]  Ingo R. Titze,et al.  Principles of voice production , 1994 .

[10]  Dario D'Orazio,et al.  Impulse Responses Measured with MLS or Swept-Sine Signals Applied to Architectural Acoustics: An In-depth Analysis of the Two Methods and Some Case Studies of Measurements Inside Theaters , 2015 .

[11]  C. Stepp,et al.  Relationships between vocal pitch perception and production: a developmental perspective , 2020, Scientific Reports.

[12]  E. Owens,et al.  An Introduction to the Psychology of Hearing , 1997 .

[13]  H. Brumm,et al.  The evolution of the Lombard effect: 100 years of psychoacoustic research , 2011 .

[14]  Jeffery A. Jones,et al.  Auditory-motor mapping for pitch control in singers and nonsingers , 2008, Experimental Brain Research.

[15]  Hideki Kawahara,et al.  Cascaded All-Pass Filters with Randomized Center Frequencies and Phase Polarity for Acoustic and Speech Measurement and Data Augmentation , 2021, ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[16]  Oleg Korzyukov,et al.  Vocal and Neural Responses to Unexpected Changes in Voice Pitch Auditory Feedback During Register Transitions. , 2016, Journal of voice : official journal of the Voice Foundation.

[17]  Donald A. Robin,et al.  Sensory Processing: Advances in Understanding Structure and Function of Pitch-Shifted Auditory Feedback in Voice Control , 2016 .

[18]  Rita R. Patel,et al.  Recommended Protocols for Instrumental Assessment of Voice: American Speech-Language-Hearing Association Expert Panel to Develop a Protocol for Instrumental Assessment of Vocal Function. , 2018, American journal of speech-language pathology.

[19]  Hideki Kawahara,et al.  Interactions between speech production and perception under auditory feedback perturbations on fundamental frequencies , 1994 .

[20]  Ciara Leydon,et al.  The role of auditory feedback in sustaining vocal vibrato. , 2003, The Journal of the Acoustical Society of America.

[21]  C. Larson,et al.  Voice F0 responses to pitch-shifted auditory feedback: a preliminary study. , 1997, Journal of voice : official journal of the Voice Foundation.

[22]  N. Aoshima Computer‐generated pulse signal applied for sound measurement , 1981 .

[23]  M. Schroeder Integrated‐impulse method measuring sound decay without using impulses , 1979 .

[24]  R. Behroozmand,et al.  Modulation of vocal pitch control through high-definition transcranial direct current stimulation of the left ventral motor cortex , 2020, Experimental Brain Research.

[25]  Yi Xu,et al.  Maximum speed of pitch change and how it may relate to speech. , 2002, The Journal of the Acoustical Society of America.

[26]  Guy-Bart Stan,et al.  Comparison of different impulse response measurement techniques , 2002 .

[27]  C. Larson,et al.  Instructing subjects to make a voluntary response reveals the presence of two components to the audio-vocal reflex , 1999, Experimental Brain Research.

[28]  Jeffery A. Jones,et al.  A Causal Role of the Cerebellum in Auditory Feedback Control of Vocal Production , 2021, The Cerebellum.

[29]  R. Patterson,et al.  A pulse ribbon model of monaural phase perception. , 1987, The Journal of the Acoustical Society of America.

[30]  E. Chang,et al.  Human cortical sensorimotor network underlying feedback control of vocal pitch , 2013, Proceedings of the National Academy of Sciences.

[31]  Jason A. Tourville,et al.  Neural mechanisms underlying auditory feedback control of speech , 2008, NeuroImage.

[32]  Hideki Kawahara,et al.  Simultaneous measurement of time-invariant linear and nonlinear, and random and extra responses using frequency domain variant of velvet noise , 2020, ArXiv.

[33]  B. Moore An introduction to the psychology of hearing, 3rd ed. , 1989 .

[34]  Ingo R Titze,et al.  A reflex resonance model of vocal vibrato. , 2002, The Journal of the Acoustical Society of America.

[35]  Jay J Bauer,et al.  Voice responses to changes in pitch of voice or tone auditory feedback. , 2005, The Journal of the Acoustical Society of America.

[36]  Manfred R. Schroeder,et al.  Synthesis of low-peak-factor signals and binary sequences with low autocorrelation (Corresp.) , 1970, IEEE Trans. Inf. Theory.