Database of Volumetric and Real-Time Vocal Tract MRI for Speech Science

We present the USC Speech and Vocal Tract Morphology MRI Database, a 17-speaker magnetic resonance imaging database for speech research. The database consists of real-time magnetic resonance images (rtMRI) of dynamic vocal tract shaping, denoised audio recorded simultaneously with rtMRI, and 3D volumetric MRI of vocal tract shapes during sustained speech sounds. We acquired 2D real-time MRI of vocal tract shaping during consonant-vowel-consonant sequences, vowelconsonant-vowel sequences, read passages, and spontaneous speech. We acquired 3D volumetric MRI of the full set of vowels and continuant consonants of American English. Each 3D volumetric MRI was acquired in one 7-second scan in which the participant sustained the sound. This is the first database to combine rtMRI of dynamic vocal tract shaping and 3D volumetric MRI of the entire vocal tract. The database provides a unique resource with which to examine the relationship between vocal tract morphology and vocal tract function. The USC Speech and Vocal Tract Morphology MRI Database is provided free for research use at http://sail.usc.edu/span/morphdb.

[1]  Viviana Toro-Ibacache,et al.  The relationship between skull morphology, masticatory muscle force and cranial skeletal deformation during biting. , 2016, Annals of anatomy = Anatomischer Anzeiger : official organ of the Anatomische Gesellschaft.

[2]  Shrikanth Narayanan,et al.  Morphological variation in the adult hard palate and posterior pharyngeal wall. , 2013, Journal of speech, language, and hearing research : JSLHR.

[3]  L. Puymérail,et al.  Analysis of Hyoid Bone Using 3D Geometric Morphometrics: An Anatomical Study and Discussion of Potential Clinical Implications , 2013, Dysphagia.

[4]  Marzena Wylezinska,et al.  Speech MRI: morphology and function. , 2014, Physica medica : PM : an international journal devoted to the applications of physics to medicine and biology : official journal of the Italian Association of Biomedical Physics.

[5]  Shrikanth Narayanan,et al.  Interspeaker variability in hard palate morphology and vowel production. , 2013, Journal of speech, language, and hearing research : JSLHR.

[6]  Raanan Arens,et al.  Identification of upper airway anatomic risk factors for obstructive sleep apnea with volumetric magnetic resonance imaging. , 2003, American journal of respiratory and critical care medicine.

[7]  Shrikanth Narayanan,et al.  Synchronized and noise-robust audio recordings during realtime magnetic resonance imaging scans. , 2006, The Journal of the Acoustical Society of America.

[8]  Raymond D. Kent,et al.  X‐ray microbeam speech production database , 1990 .

[9]  Shrikanth S. Narayanan,et al.  Accelerated three‐dimensional upper airway MRI using compressed sensing , 2009, Magnetic resonance in medicine.

[10]  Shrikanth S. Narayanan,et al.  Flexible retrospective selection of temporal resolution in real‐time speech MRI using a golden‐ratio spiral view order , 2011, Magnetic resonance in medicine.

[11]  Thomas F. Quatieri,et al.  Relating Estimated Cyclic Spectral Peak Frequency to Measured Epilarynx Length Using Magnetic Resonance Imaging , 2016, INTERSPEECH.

[12]  B. Chrcanovic,et al.  Morphological variation in dentate and edentulous human mandibles , 2011, Surgical and Radiologic Anatomy.

[13]  Shrikanth Narayanan,et al.  An approach to real-time magnetic resonance imaging for speech production. , 2003, The Journal of the Acoustical Society of America.

[14]  Shrikanth S. Narayanan,et al.  Accelerated 3D MRI of vocal tract shaping using compressed sensing and parallel imaging , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[15]  Shrikanth S. Narayanan,et al.  State-of-the-Art MRI Protocol for Comprehensive Assessment of Vocal Tract Structure and Function , 2016, INTERSPEECH.

[16]  Shrikanth S. Narayanan,et al.  Automatic Classification of Palatal and Pharyngeal Wall Shape Categories from Speech Acoustics and Inverted Articulatory Signals , 2013 .

[17]  Raymond D. Kent,et al.  Anatomic development of the oral and pharyngeal portions of the vocal tract: an imaging study. , 2009, The Journal of the Acoustical Society of America.

[18]  J.M. Santos,et al.  Flexible real-time magnetic resonance imaging framework , 2004, The 26th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[19]  Shrikanth Narayanan,et al.  Real-time magnetic resonance imaging and electromagnetic articulography database for speech production research (TC). , 2014, The Journal of the Acoustical Society of America.

[20]  Shrikanth S. Narayanan,et al.  Advances in real-time magnetic resonance imaging of the vocal tract for speech science and technology research , 2016, APSIPA Transactions on Signal and Information Processing.

[21]  Shrikanth S. Narayanan,et al.  Characterizing Vocal Tract Dynamics Across Speakers Using Real-Time MRI , 2016, INTERSPEECH.