How to precisely measure the volume velocity transfer function of physical vocal tract models by external excitation

Recently, 3D printing has been increasingly used to create physical models of the vocal tract with geometries obtained from magnetic resonance imaging. These printed models allow measuring the vocal tract transfer function, which is not reliably possible in vivo for the vocal tract of living humans. The transfer functions enable the detailed examination of the acoustic effects of specific articulatory strategies in speaking and singing, and the validation of acoustic plane-wave models for realistic vocal tract geometries in articulatory speech synthesis. To measure the acoustic transfer function of 3D-printed models, two techniques have been described: (1) excitation of the models with a broadband sound source at the glottis and measurement of the sound pressure radiated from the lips, and (2) excitation of the models with an external source in front of the lips and measurement of the sound pressure inside the models at the glottal end. The former method is more frequently used and more intuitive due to its similarity to speech production. However, the latter method avoids the intricate problem of constructing a suitable broadband glottal source and is therefore more effective. It has been shown to yield a transfer function similar, but not exactly equal to the volume velocity transfer function between the glottis and the lips, which is usually used to characterize vocal tract acoustics. Here, we revisit this method and show both, theoretically and experimentally, how it can be extended to yield the precise volume velocity transfer function of the vocal tract.

[1]  O. Fujimura,et al.  Sweep-tone measurements of vocal-tract characteristics. , 1971, The Journal of the Acoustical Society of America.

[2]  P. Birkholz Modeling Consonant-Vowel Coarticulation for Articulatory Speech Synthesis , 2013, PloS one.

[3]  Christiane Neuschaefer-Rube,et al.  A method for measurement of the vocal tract impedance at the mouth. , 2002, Medical engineering & physics.

[4]  J. Dang,et al.  Acoustic characteristics of the human paranasal sinuses derived from transmission characteristic measurement and morphological observation. , 1996, The Journal of the Acoustical Society of America.

[5]  J. Dang,et al.  Visualisation of hypopharyngeal cavities and vocal-tract acoustic modelling , 2010, Computer methods in biomechanics and biomedical engineering.

[6]  Jer-Ming Chen,et al.  An Experimentally Measured Source–Filter Model: Glottal Flow, Vocal Tract Gain and Output Sound from a Physical Model , 2016 .

[7]  Kiyoshi Honda,et al.  Transfer functions of solid vocal-tract models constructed from ATR MRI database of Japanese vowel production , 2009 .

[8]  Anton A. Poznyakovskiy,et al.  A Fast Semiautomatic Algorithm for Centerline-Based Vocal Tract Segmentation , 2015, BioMed research international.

[9]  Milan Sonka,et al.  3D Slicer as an image computing platform for the Quantitative Imaging Network. , 2012, Magnetic resonance imaging.

[10]  Yves Laprie,et al.  Extension of the single-matrix formulation of the vocal tract: Consideration of bilateral channels and connection of self-oscillating models of the vocal folds with a glottal chink , 2016, Speech Commun..

[11]  Gunnar Fant,et al.  Acoustic Theory Of Speech Production , 1960 .

[12]  J. Remacle,et al.  Gmsh: A 3‐D finite element mesh generator with built‐in pre‐ and post‐processing facilities , 2009 .

[13]  Tatsuya Kitamura,et al.  Acoustic analysis of the vocal tract during vowel production by finite-difference time-domain method. , 2008, The Journal of the Acoustical Society of America.

[14]  Tabea V Flügge,et al.  Articulation and vocal tract acoustics at soprano subject's high fundamental frequencies. , 2015, The Journal of the Acoustical Society of America.

[15]  Anton A. Poznyakovskiy,et al.  Lower Vocal Tract Morphologic Adjustments Are Relevant for Voice Timbre in Singing , 2015, PloS one.

[16]  Henrik Møller Fundamentals of binaural technology , 1991 .

[17]  Ian Vince McLoughlin,et al.  Measuring resonances of the vocal tract using frequency sweeps at the lips , 2012, 2012 5th International Symposium on Communications, Control and Signal Processing.

[18]  Julien Epps,et al.  A novel instrument to measure acoustic resonances of the vocal tract during phonation , 1997 .

[19]  D. Mürbe,et al.  Formant frequencies and bandwidths of the vocal tract transfer function are affected by the mechanical impedance of the vocal tract wall , 2014, Biomechanics and Modeling in Mechanobiology.

[20]  David Howard,et al.  A New Method to Explore the Spectral Impact of the Piriform Fossae on the Singing Voice: Benchmarking Using MRI-Based 3D-Printed Vocal Tracts , 2014, PloS one.

[21]  Anders Logg,et al.  The FEniCS Project Version 1.5 , 2015 .

[22]  David M. Howard,et al.  Three-Dimensional Digital Waveguide Mesh Simulation of Cylindrical Vocal Tract Analogs , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[23]  Greg Turk,et al.  Simplification and Repair of Polygonal Models Using Volumetric Techniques , 2003, IEEE Trans. Vis. Comput. Graph..

[24]  Christophe Geuzaine,et al.  Gmsh: A 3‐D finite element mesh generator with built‐in pre‐ and post‐processing facilities , 2009 .

[25]  Pascal Perrier,et al.  Measurement of the acoustic transfer function of the vocal tract: a fast and accurate method , 1991 .

[26]  Allan D. Pierce,et al.  Acoustics , 1989 .

[27]  Pierre Badin Fricative consonants: acoustic and X-ray measurements , 1991 .

[28]  Olov Engwall,et al.  Influence of lips on the production of vowels based on finite element simulations and experiments. , 2016, The Journal of the Acoustical Society of America.

[29]  Angelo Farina,et al.  Simultaneous Measurement of Impulse Response and Distortion with a Swept-Sine Technique , 2000 .