Experimental assessment of the tongue incompressibility hypothesis during speech production

The human tongue is an important organ for speech production. Its deformation and motion control the shape of the vocal tract significantly and thereby the acoustic properties of the speech signal produced. Thus, much effort in the speech research community has been directed towards its biomechanical modeling. A common assumption incorporated into many models of the human tongue is the tissue incompressibility hypothesis: the tongue is considered a muscular hydrostat and therefore its volume should remain constant regardless of its posture. To the best of our knowledge, experimental assessment of the constant volume hypothesis during actual speech production is limited. In this work, the aim is to experimentally assess the incompressibility hypothesis during actual speech production using a dataset of volumetric Magnetic Resonance (MR) images of 17 subjects sustaining contextualized continuants (27 continuants per subject). A seeded region growing based algorithm is used to segment the tongue and calculate its volume. Then, the intra-subject variability of the tongue volume along the different tongue postures is examined. Within the accuracy of our tongue volume measurements, our empirical results seem consistent with the incompressibility hypothesis.

[1]  Shrikanth Narayanan,et al.  Experimental evaluation of the constant tongue volume hypothesis , 2014 .

[2]  Catherine Pelachaud,et al.  Modeling and animating the human tongue during speech production , 1994, Proceedings of Computer Animation '94.

[3]  Yohan Payan,et al.  A control model of human tongue movements in speech , 1997, Biological Cybernetics.

[4]  Rolf Adams,et al.  Seeded Region Growing , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  William S. Levine,et al.  Controlling the shape of a muscular hydrostat : A tongue or tentacle , 2005 .

[6]  Shrikanth Narayanan,et al.  Semi-automatic modeling of tongue surfaces using volumetric structural MRI , 2011 .

[7]  Shrikanth S. Narayanan,et al.  Dynamic 3-D Visualization of Vocal Tract Shaping During Speech , 2013, IEEE Transactions on Medical Imaging.

[8]  J. Dang,et al.  Construction and control of a physiological articulatory model. , 2004, The Journal of the Acoustical Society of America.

[9]  S. Shapiro,et al.  An Analysis of Variance Test for Normality (Complete Samples) , 1965 .

[10]  H. Levene Robust tests for equality of variances , 1961 .

[11]  Yohan Payan,et al.  Synthesis of V-V sequences with a 2D biomechanical tongue model controlled by the Equilibrium Point Hypothesis , 1997, Speech Commun..

[12]  W. Kier,et al.  Tongues, tentacles and trunks: the biomechanics of movement in muscular‐hydrostats , 1985 .

[13]  Gérard Bailly,et al.  Three-dimensional linear articulatory modeling of tongue, lips and face, based on MRI and video images , 2002, J. Phonetics.

[14]  Pierre Badin,et al.  Three-dimensional linear modeling of tongue: Articulatory data and models , 2006 .

[15]  Z. Muhl,et al.  Estimation of tongue volume from magnetic resonance imaging. , 1991, The Angle orthodontist.

[16]  Jitendra Malik,et al.  Scale-Space and Edge Detection Using Anisotropic Diffusion , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  J. Perkell,et al.  Influences of tongue biomechanics on speech movements during the production of velar stop consonants: a modeling study. , 2003, Journal of the Acoustical Society of America.

[18]  H. B. Mann,et al.  On a Test of Whether one of Two Random Variables is Stochastically Larger than the Other , 1947 .

[19]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[20]  A A Lowe,et al.  Three-dimensional CT reconstructions of tongue and airway in adult subjects with obstructive sleep apnea. , 1986, American journal of orthodontics and dentofacial orthopedics : official publication of the American Association of Orthodontists, its constituent societies, and the American Board of Orthodontics.

[21]  Shrikanth S. Narayanan,et al.  Toward automatic vocal tract area function estimation from accelerated three-dimensional magnetic resonance imaging , 2013 .

[22]  M. Friedman The Use of Ranks to Avoid the Assumption of Normality Implicit in the Analysis of Variance , 1937 .