Script-based facial gesture and speech animation using a NURBS based face model

Abstract In this paper, we present a technique for simulating different facial gestures and speech. The distinguishing features of this work are 2-fold. First, we adopt a four-level hierarchical, non-uniform rational B-spline (NURBS) based face model. The use of NURBS surface representation of the face has the advantages of increased smoothness and ease of reshape over other forms of geometric representation. Second, the mouth movement animation and sound production in speech are phoneme based and an English text to phoneme parser is used to translate any English text in speech into its phoneme equivalent. As phoneme is the basic unit of mouth movement and sound production, a phoneme based approach of speech animation resembles actual speech and allows arbitrary English text rather than a restricted set of tokens be spoken. A Facial Action Coding System is also adopted to control the modification of the face model as it describes the basis of facial expression. Further, a user interface is developed which allows the user to edit interactively or load in a text script describing the animation sequence in terms of facial gesture names and English text. The system parses the English text in the text script to phoneme strings. The animation sequence described by the ccript can then be generated and played back in a flexible way.

[1]  Frederic I. Parke,et al.  Control Parameterization for Facial Animation , 1991 .

[2]  Colin Yallop,et al.  An Introduction to Phonetics and Phonology , 1990 .

[3]  Mark Steedman,et al.  Animated conversation: rule-based generation of facial expression, gesture & spoken intonation for multiple conversational agents , 1994, SIGGRAPH.

[4]  F. Lavagetto,et al.  Converting speech into lip movements: a multimedia telephone for hard of hearing people , 1995 .

[5]  Rodney W. Johnson,et al.  Automatic translation of english text to phonetics by means of letter-to-sound rules (nrl report 794 , 1976 .

[6]  N. Badler,et al.  Linguistic Issues in Facial Animation , 1991 .

[7]  Demetri Terzopoulos,et al.  Analysis and Synthesis of Facial Image Sequences Using Physical and Anatomical Models , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Vicki Bruce,et al.  Processing Images of Faces , 1992 .

[9]  Brian Wyvill,et al.  Speech and expression: a computer solution to face animation , 1986 .

[10]  W. Smalley Manual of articulatory phonetics , 1961 .

[11]  Demetri Terzopoulos,et al.  Techniques for Realistic Facial Modeling and Animation , 1991 .

[12]  R. Kikinis,et al.  Computer‐Assisted Three‐Dimensional Planning in Craniofacial Surgery , 1993, Plastic and reconstructive surgery.

[13]  Keith Waters,et al.  A muscle model for animation three-dimensional facial expression , 1987, SIGGRAPH.

[14]  G. Boulogne,et al.  The Mechanism of Human Facial Expression , 1990 .

[15]  M. Carter Computer graphics: Principles and practice , 1997 .