论文信息 - Large scale data acquisition of simultaneous MRI and speech

Large scale data acquisition of simultaneous MRI and speech

We describe an arrangement for simultaneous recording of speech and vocal tract geometry in patients undergoing surgery involving this area. Experimental design is considered from an articulatory phonetic point of view. The speech signals are recorded with an acoustic-electrical arrangement. The vocal tract is simultaneously imaged with MRI. A MATLAB-based system controls the timing of speech recording and MR image acquisition. The speech signals are cleaned from acoustic MRI noise by an adaptive signal processing algorithm. Finally, a vowel data set from pilot experiments is qualitatively compared both with validation data from the anechoic chamber and with Helmholtz resonances of the vocal tract volume, obtained using FEM.

[1] Jarmo Malinen,et al. Resonances and mode shapes of the human vocal tract during vowel production , 2013 .

[2] P. W. Nye,et al. Analysis of vocal tract shape and dimensions using magnetic resonance imaging: vowels. , 1991, The Journal of the Acoustical Society of America.

[3] Jarmo Malinen,et al. Recording speech during magnetic resonance imaging , 2007, MAVEBA.

[4] H. K. Dunn. The Calculation of Vowel Resonances , 1950 .

[5] J. Přibil,et al. Two Methods of Mechanical Noise Reduction of Recorded Speech During Phonation in an MRI device , 2011 .

[6] Jarmo Malinen,et al. 5th International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, MAVEBA2007, Florence, Italy, Dec. 13-15, 2007 , 2007 .

[7] Takayoshi Nakai,et al. Finite element simulation of sound transmission in vocal tract , 1993 .

[8] Ritu Sharma. Speech Synthesis , 2019, The SAGE Encyclopedia of Human Communication Sciences and Disorders.

[9] Tomáš Vampola,et al. FE Modeling of Human Vocal Tract Acoustics. Part I: Production of Czech Vowels , 2008 .

[10] E. Hoffman,et al. Vocal tract area functions from magnetic resonance imaging. , 1996, The Journal of the Acoustical Society of America.

[11] J C Gore,et al. Application of MRI to the analysis of speech production. , 1987, Magnetic resonance imaging.

[12] Daniel Aalto,et al. Algorithmic Surface Extraction from MRI Data - Modelling the Human Vocal Tract , 2013, BIODEVICES.

[13] Didier Demolin,et al. Mid-sagittal cut to area function transformations: Direct measurements of mid-sagittal distance and area with MRI , 2002, Speech Commun..

[14] Paavo Alku,et al. A LF-pulse from a simple glottal flow model , 2009, MAVEBA.

[15] Martti Vainio,et al. Recording Speech Sound and Articulation in MRI , 2011, BIODEVICES.

[16] Shinobu Masaki,et al. A bone-conduction system for auditory stimulation in MRI , 2007 .

[17] Shrikanth Narayanan,et al. An approach to real-time magnetic resonance imaging for speech production. , 2003, The Journal of the Acoustical Society of America.

[18] Martti Vainio,et al. How far are vowel formants from computed vocal tract resonances? , 2012, ArXiv.

[19] J. Cadzow. Maximum Entropy Spectral Analysis , 2006 .

[20] Francesc Alías,et al. Effects of head geometry simplifications on acoustic radiation of vowel sounds based on time-domain finite-element simulations. , 2013, The Journal of the Acoustical Society of America.

[21] Yuvi Kahana,et al. Recording high quality speech during tagged cine‐MRI studies using a fiber optic microphone , 2006, Journal of magnetic resonance imaging : JMRI.

[22] John Makhoul,et al. Spectral linear prediction: Properties and applications , 1975 .

[23] David M. Howard,et al. Waveguide physical modeling of vocal tract acoustics: flexible formant bandwidth control from increased model dimensionality , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[24] G D Pond,et al. MR angiography of the foot and ankle , 1995, Journal of magnetic resonance imaging : JMRI.

[25] O. Aaltonen,et al. ARTICULATING FINNISH VOWELS: RESULTS FROM MRI AND SOUND DATA , 2012 .

[26] H. K. Dunn. The Calculation of Vowel Resonances, and an Electrical Vocal Tract , 1950 .

[27] Jaromír Horá,et al. Numerical Modelling of Production of Czech Vowel /a/ based on FE Model of the Vocal Tract , 2004 .

[28] Shrikanth S. Narayanan,et al. Geometry, kinematics, and acoustics of Tamil liquid consonants. , 1999, The Journal of the Acoustical Society of America.

[29] E. Wadbro,et al. Optimization of a variable mouth acoustic horn , 2011 .

[30] Christine Ericsdotter,et al. Articulatory-Acoustic Relationships in Swedish Vowel Sounds , 2005 .

[31] Olli Aaltonen,et al. Effects of Genioglossal Muscle Advancement on Speech: An Acoustic Study of Vowel Sounds , 2005, Otolaryngology--head and neck surgery : official journal of American Academy of Otolaryngology-Head and Neck Surgery.

[32] Pierre Badin,et al. Development of the transmission line matrix method in acoustics applications to higher modes in the vocal tract and other complex ducts , 1998 .

[33] Tiina Murtola. Modelling Vowel Production , 2014 .

[34] H. Helmholtz. Die Lehre Von Den Tonempfindungen ALS Physiologische Grundlage Fur Die Theorie Der Musik , 2013 .

[35] Shrikanth S. Narayanan,et al. An articulatory study of fricative consonants using magnetic resonance imaging , 1995 .

[36] Jacqueline Vaissière,et al. Vocal tract area function for vowels using three-dimensional magnetic resonance imaging. A preliminary study. , 2007, Journal of voice : official journal of the Voice Foundation.

[37] S. Boll,et al. Suppression of acoustic noise in speech using spectral subtraction , 1979 .

[38] J. Flanagan,et al. Synthesis of voiced sounds from a two-mass model of the vocal cords , 1972 .

[39] N. Rofsky,et al. Abdominal MR imaging with a volumetric interpolated breath-hold examination. , 1999, Radiology.

[40] Antti Hannukainen,et al. Vowel formants from the wave equation. , 2007, The Journal of the Acoustical Society of America.

[41] Jarmo Malinen,et al. Recording speech during MRI: part II , 2009, MAVEBA.

[42] J. Švec,et al. Human vocal tract resonances and the corresponding mode shapes investigated by three-dimensional finite-element modelling based on CT measurement , 2015, Logopedics, phoniatrics, vocology.

[43] Tomáš Vampola,et al. Finite element modelling of vocal tract changes after voice therapy , 2011 .

[44] Pertti Palo. A wave equation model for vowels: Measurements for validation , 2011 .

[45] Shrikanth Narayanan,et al. Synchronized and noise-robust audio recordings during realtime magnetic resonance imaging scans. , 2006, The Journal of the Acoustical Society of America.

[46] Martti Vainio,et al. Estimates for the Measurement and Articulatory Error in MRI Data from Sustained Vowel Production , 2011, ICPhS.

[47] Gunnar Fant,et al. Acoustic Theory Of Speech Production , 1960 .

[48] Jaromír Horácek,et al. Numerical modelling of effect of tonsillectomy on production of Czech vowels /a/ and /i/ , 2005, MAVEBA.

[49] Shrikanth S. Narayanan,et al. Accelerated three‐dimensional upper airway MRI using compressed sensing , 2009, Magnetic resonance in medicine.

[50] Martti Vainio,et al. Developing a speech intelligibility test based on measuring speech reception thresholds in noise for English and Finnish. , 2005, The Journal of the Acoustical Society of America.

[51] Uneda. The vowel “A.” , 1880 .

[52] Gérard Bailly,et al. Three-dimensional linear articulatory modeling of tongue, lips and face, based on MRI and video images , 2002, J. Phonetics.

[53] Kenneth N. Stevens,et al. On the Derivation of Area Functions and Acoustic Spectra from Cinéradiographic Films of Speech , 1964 .

[54] J. Makhoul,et al. Linear prediction: A tutorial review , 1975, Proceedings of the IEEE.

[55] Shinji Maeda,et al. A digital simulation method of the vocal-tract system , 1982, Speech Commun..

[56] Olli Aaltonen,et al. Acoustic comparison of vowel sounds produced before and after orthognathic surgery for mandibular advancement. , 2006, Journal of oral and maxillofacial surgery : official journal of the American Association of Oral and Maxillofacial Surgeons.

[57] Pierre Badin,et al. Collecting and analysing two- and three- dimensional MRI data for Swedish , 1999 .

[58] Manabu Sakuta,et al. [one hundred books which built up neurology (52)--Hermann von Helmholtz "Die Lehre von den Tonempfindungen als physiologische Grundlage fur die Theorie der Musik" (1863)]. , 2011, Brain and nerve = Shinkei kenkyu no shinpo.

[59] Tomáš Vampola,et al. FE Modeling of Human Vocal Tract Acoustics. Part II: Influence of Velopharyngeal Insufficiency on Phonation of Vowels , 2008 .

[60] T. Chiba. The vowel, its nature and structure , 1958 .

[61] Alvin M. Liberman,et al. Speech: A Special Code , 1996 .

[62] Louis-Jean Boë,et al. The potential Neandertal vowel space was as large as that of modern humans , 2002, J. Phonetics.

[63] Xin Chen,et al. The suppression of selected acoustic frequencies in MRI , 2010 .

[64] Petr Šidlof,et al. Parallel CFD simulation of flow in a 3D model of vibrating human vocal folds , 2013 .

[65] J. Horáček,et al. Airflow visualization in a model of human glottis near the self-oscillating vocal folds model , 2011 .

[66] John Nicholas Holmes,et al. Speech synthesis , 1972 .

[67] Jiří Přibil,et al. Analysis of spectral properties of acoustic noise produced during magnetic resonance imaging , 2012 .

[68] C. C. Goodyear,et al. Measurements of vocal tract shapes using magnetic resonance imaging , 1992 .