Functional data analyses of lip motion.

The vocal tract's motion during speech is a complex patterning of the movement of many different articulators according to many different time functions. Understanding this myriad of gestures is important to a number of different disciplines including automatic speech recognition, speech and language pathologies, speech motor control, and experimental phonetics. Central issues are the accurate description of the shape of the vocal tract and determining how each articulator contributes to this shape. A problem facing all of these research areas is how to cope with the multivariate data from speech production experiments. In this paper techniques are described that provide useful tools for describing multivariate functional data such as the measurement of speech movements. The choice of data analysis procedures has been motivated by the need to partition the articulator movement in various ways: end effects separated from shape effects, partitioning of syllable effects, and the splitting of variation within an articulator site from variation from between sites. The techniques of functional data analysis seem admirably suited to the analyses of phenomena such as these. Familiar multivariate procedures such as analysis of variance and principal components analysis have their functional counterparts, and these reveal in a way more suited to the data the important sources of variation in lip motion. Finally, it is found that the analyses of acceleration were especially helpful in suggesting possible control mechanisms. The focus is on using these speech production data to understand the basic principles of coordination. However, it is believed that the tools will have a more general use.

[1]  J. Snow From the National Institute on Deafness and other Communication Disorders , 1994, The American journal of otology.

[2]  David J. Ostry,et al.  Functional data analyses of lip motion , 1995 .

[3]  J A Kelso,et al.  Lip-larynx coordination in speech: effects of mechanical perturbations to the lower lip. , 1994, The Journal of the Acoustical Society of America.

[4]  V L Gracco,et al.  Some organizational characteristics of speech movement control. , 1994, Journal of speech and hearing research.

[5]  B. Silverman,et al.  Nonparametric Regression and Generalized Linear Models: A roughness penalty approach , 1993 .

[6]  J. Ramsay,et al.  Some Tools for Functional Data Analysis , 1991 .

[7]  Shinji Maeda,et al.  Compensatory Articulation During Speech: Evidence from the Analysis and Synthesis of Vocal-Tract Shapes Using an Articulatory Model , 1990 .

[8]  J. Ramsay The analysis of replicated spatial functions of time , 1989 .

[9]  Berthold K. P. Horn,et al.  Closed-form solution of absolute orientation using unit quaternions , 1987 .

[10]  J. Ramsay,et al.  Principal components analysis of sampled functions , 1986 .

[11]  C. Atkeson,et al.  Kinematic features of unrestrained vertical arm movements , 1985, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[12]  J. Ramsay When the data are functions , 1982 .

[13]  P. Ladefoged,et al.  Factor analysis of tongue shapes. , 1971, Journal of the Acoustical Society of America.