Final Report to Nsf of the Standards for Facial Animation Workshop Final Report to Nsf of the Standards for Facial Animation Workshop

Report to the National Science Foundation based on the Standards for Facial Animation Workshop, November 11-12, 1993, sponsored by NSF’s Division of Information, Robotics, and Intelligent Systems, and the Institute for Research in Cognitive Science. Comments University of Pennsylvania Institute for Research in Cognitive Science Technical Report No. IRCS-94-21. This technical report is available at ScholarlyCommons: http://repository.upenn.edu/ircs_reports/167

[1]  Michael M. Cohen,et al.  Modeling Coarticulation in Synthetic Visual Speech , 1993 .

[2]  W. Larrabee,et al.  A finite element model of skin deformation. II. An experimental model of skin deformation , 1986, The Laryngoscope.

[3]  Thomas W. Sederberg,et al.  Free-form deformation of solid geometric models , 1986, SIGGRAPH.

[4]  M. Turk,et al.  Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[5]  C. Pelachaud Communication and coarticulation in facial animation , 1992 .

[6]  E. Petajan,et al.  An improved automatic lipreading system to enhance speech recognition , 1988, CHI '88.

[7]  D. Pisoni,et al.  Training Japanese listeners to identify English /r/ and /l/: a first report. , 1991, The Journal of the Acoustical Society of America.

[8]  Anders Löfqvist,et al.  Speech as Audible Gestures , 1990 .

[9]  Daniel Thalmann,et al.  Sculpting with the `ball and mouse' metaphor , 1991 .

[10]  Yehezkel Yeshurun,et al.  Robust detection of facial features by generalized symmetry , 1992, [1992] Proceedings. 11th IAPR International Conference on Pattern Recognition.

[11]  Peter Goldenthal,et al.  Posing and Judging Facial Expressions of Emotion: The Effects of Social Skills , 1985 .

[12]  Demetri Terzopoulos,et al.  Modelling and animating faces using scanned data , 1991, Comput. Animat. Virtual Worlds.

[13]  David B. Pisoni,et al.  Text-to-speech: the mitalk system , 1987 .

[14]  B. Lindblom,et al.  Acoustical consequences of lip, tongue, jaw, and larynx movement. , 1970, The Journal of the Acoustical Society of America.

[15]  David R. Forsey,et al.  Hierarchical B-spline refinement , 1988, SIGGRAPH.

[16]  Thomas K. Pilgram,et al.  Facial surface scanner , 1991, IEEE Computer Graphics and Applications.

[17]  Yochai Konig,et al.  "Eigenlips" for robust speech recognition , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[18]  Manjula Patel,et al.  FACES: Facial Animation, Construction and Editing System , 1991, Eurographics.

[19]  A. Montgomery,et al.  Physical characteristics of the lips underlying vowel lipreading performance. , 1983, The Journal of the Acoustical Society of America.

[20]  A. Scheflen THE SIGNIFICANCE OF POSTURE IN COMMUNICATION SYSTEMS. , 1964, Psychiatry.

[21]  Catherine Pelachaud,et al.  Rule-Structured Facial Animation System , 1993, IJCAI.

[22]  K. D. Kryter,et al.  ARTICULATION-TESTING METHODS: CONSONANTAL DIFFERENTIATION WITH A CLOSED-RESPONSE SET. , 1965, The Journal of the Acoustical Society of America.

[23]  Daniel Thalmann,et al.  SMILE: A Multilayered Facial Animation System , 1991, Modeling in Computer Graphics.

[24]  S. Kaiser,et al.  Automated coding of facial behavior in human-computer interactions with facs , 1992 .

[25]  Hiroshi Harashima,et al.  A Media Conversion from Speech to Facial Image for Intelligent Man-Machine Interface , 1991, IEEE J. Sel. Areas Commun..

[26]  Alexander H. Waibel,et al.  Improving connected letter recognition by lipreading , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[27]  Alex Pentland,et al.  A vision system for observing and extracting facial action parameters , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[28]  Prof. Dr. Nadia Magnenat Thalmann,et al.  Synthetic Actors , 1990, Computer Science Workbench.

[29]  Mark Steedman,et al.  Animated conversation: rule-based generation of facial expression, gesture & spoken intonation for multiple conversational agents , 1994, SIGGRAPH.

[30]  Eric David Petajan,et al.  Automatic Lipreading to Enhance Speech Recognition (Speech Reading) , 1984 .

[31]  Martine Grice,et al.  Multilingual synthesiser assessment using semantically unpredictable sentences , 1989, EUROSPEECH.

[32]  M E Demorest,et al.  A computational approach to analyzing sentential speech perception: phoneme-to-phoneme stimulus-response alignment. , 1994, The Journal of the Acoustical Society of America.

[33]  Igor Zlokarnik Experiments with an articulatory speech recognizer , 1993, EUROSPEECH.

[34]  Frederick I. Parke,et al.  Computer generated animation of faces , 1972, ACM Annual Conference.

[35]  N. Badler,et al.  Linguistic Issues in Facial Animation , 1991 .

[36]  N Magnenat Thalmann,et al.  Creating Realistic Three-Dimensional Human Shape Characters for Computer-Generated Films , 1991 .

[37]  T. Evans,et al.  Effect of Context on Perception of Emotion among Psychiatric Patients , 1986, Perceptual and motor skills.

[38]  W. Larrabee A finite element model of skin deformation. I. Biomechanics of skin and soft tissue: A review , 1986, The Laryngoscope.

[39]  A. Yuille Deformable Templates for Face Recognition , 1991, Journal of Cognitive Neuroscience.

[40]  Harold Knudsen,et al.  The effects of verbal statements of context on facial expressions of emotion , 1983 .

[41]  Frank Biocca,et al.  A Survey of Position Trackers , 1992, Presence: Teleoperators & Virtual Environments.

[42]  Louis C. W. Pols Multi-lingual synthesis evaluation methods , 1992, ICSLP.

[43]  Marie-Luce Viaud,et al.  Facial animation with wrinkles , 1992 .

[44]  Steven Donald Pieper,et al.  More than skin deep : physical modeling of facial tissue , 1989 .

[45]  Frederic I. Parke,et al.  A model for human faces that allows speech synchronized animation , 1974, SIGGRAPH '74.

[46]  Lance Williams,et al.  Performance-driven facial animation , 1990, SIGGRAPH.

[47]  John P. Lewis,et al.  Automated lip-synch and speech synthesis for character animation , 1987, CHI 1987.

[48]  Thomas S. Huang,et al.  Final Report To NSF of the Planning Workshop on Facial Expression Understanding , 1992 .

[49]  Irfan Essa,et al.  Tracking facial motion , 1994, Proceedings of 1994 IEEE Workshop on Motion of Non-rigid and Articulated Objects.

[50]  Kenji Kurosu,et al.  Speech Recognition by Image Processing of Lip Movements , 1986 .

[51]  Daniel Thalmann,et al.  A Multimedia Testbed for Facial Animation Control , 1993, MMM.

[52]  Larry S. Davis,et al.  Computing spatio-temporal representations of human faces , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[53]  Demetri Terzopoulos,et al.  Constructing Physics-Based Facial Models of Individuals , 1993 .

[54]  Shigeo Morishima,et al.  Speech-to-image media conversion based on VQ and neural network , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[55]  Keith Waters,et al.  A muscle model for animation three-dimensional facial expression , 1987, SIGGRAPH.

[56]  Christian Benoît,et al.  A 3-d model of the lips for visual speech synthesis , 1994, SSW.

[57]  Jean-Pierre Zerling Aspects articulatoires de la labialite vocalique en francais. Contribution a la modelisation a partir de labiophotographies, labiofilms et films radiologiques. Etude statique, dynamique et contrastive , 1990 .

[58]  Demetri Terzopoulos,et al.  Analysis and Synthesis of Facial Image Sequences Using Physical and Anatomical Models , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[59]  Pertti Roivainen,et al.  3-D Motion Estimation in Model-Based Facial Image Coding , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[60]  Raymond D. Kent,et al.  Coarticulation in recent speech production models , 1977 .

[61]  Brian Wyvill,et al.  Speech and expression: a computer solution to face animation , 1986 .

[62]  C. Benoît,et al.  A set of French visemes for visual speech synthesis , 1994 .

[63]  Peter Bull,et al.  Body movement and emphasis in speech , 1985 .

[64]  Demetri Terzopoulos,et al.  Techniques for Realistic Facial Modeling and Animation , 1991 .

[65]  Alex Pentland,et al.  Automatic lipreading by optical-flow analysis , 1989 .

[66]  Peter M. Will,et al.  Grid Coding: A Preprocessing Technique for Robot and Machine Vision , 1971, IJCAI.

[67]  S. Zietz,et al.  Bioengineering approach to non-invasive measurement of body composition. , 1994, Biomedical sciences instrumentation.

[68]  Alex Pentland,et al.  Visually Controlled Graphics , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[69]  Keith Waters,et al.  The computer synthesis of expressive three-dimensional facial character animation , 1988 .

[70]  Pat Hanrahan,et al.  Reflection from layered surfaces due to subsurface scattering , 1993, SIGGRAPH.

[71]  Stephane Cotin,et al.  Craniofacial surgery simulation testbed , 1994, Other Conferences.

[72]  Alex Pentland,et al.  View-based and modular eigenspaces for face recognition , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[73]  Allen A. Montgomery,et al.  Automatic optically-based recognition of speech , 1988, Pattern Recognit. Lett..

[74]  Demetri Terzopoulos,et al.  A physical model of facial tissue and muscle articulation , 1990, [1990] Proceedings of the First Conference on Visualization in Biomedical Computing.

[75]  Yukio Kobayashi,et al.  Method Of Detecting Face Direction Using Image Processing For Human Interface , 1988, Other Conferences.

[76]  Gregory J. Wolff,et al.  Neural network lipreading system for improved speech recognition , 1992, [Proceedings 1992] IJCNN International Joint Conference on Neural Networks.

[77]  P. Ekman,et al.  Unmasking the face : a guide to recognizing emotions from facial clues , 1975 .

[78]  Peter C. Litwinowicz,et al.  Facial Animation by Spatial Mapping , 1991 .

[79]  Stephen Michael Platt,et al.  A structural model of the human face (graphics, animation, object representation) , 1985 .

[80]  D W Massaro,et al.  Discovery and expository methods in teaching visual consonant and word identification. , 1992, Journal of speech and hearing research.

[81]  Janet E. Cahn Generating expression in synthesized speech , 1989 .

[82]  C. von der Malsburg,et al.  Distortion invariant object recognition by matching hierarchically labeled graphs , 1989, International 1989 Joint Conference on Neural Networks.

[83]  B. Walden,et al.  Effects of training on the visual recognition of consonants. , 1977, Journal of speech and hearing research.

[84]  Daniel Thalmann,et al.  The Direction of Synthetic Actors in the Film Rendez-Vous a Montreal , 1987, IEEE Computer Graphics and Applications.

[85]  Steven D. Pieper,et al.  CAPS: computer-aided plastic surgery , 1992 .

[86]  Stephen M. Omohundro,et al.  Surface Learning with Applications to Lipreading , 1993, NIPS.

[87]  D H Klatt,et al.  Review of text-to-speech conversion for English. , 1987, The Journal of the Acoustical Society of America.

[88]  Frederic I. Parke,et al.  A parametric model for human faces. , 1974 .

[89]  C. Benoit,et al.  On the assessment of synthetic speech , 1992 .

[90]  Tsuneya Kurihara,et al.  A Transformation Method for Modeling and Animation of the Human Face from Photographs , 1991 .

[91]  Parke,et al.  Parameterized Models for Facial Animation , 1982, IEEE Computer Graphics and Applications.

[92]  Dominic W. Massaro,et al.  Synthesis of visible speech , 1990 .

[93]  Carol Wang,et al.  Langwidere: A Hierarchical Spline Based Facial Animation System with Simulated Muscles , 1993 .

[94]  A. Kendon Movement coordination in social interaction: some examples described. , 1970, Acta psychologica.

[95]  Clea T. Waite,et al.  The facial action control editor, face : a parametric facial expression editor for computer generated animation , 1989 .

[96]  Shigeru Muraki,et al.  Volumetric shape description of range data using “Blobby Model” , 1991, SIGGRAPH.

[97]  Ken-ichi Anjyo,et al.  A simple method for extracting the natural beauty of hair , 1992, SIGGRAPH.

[98]  Norman I. Badler,et al.  Animating facial expressions , 1981, SIGGRAPH '81.

[99]  Ian Craw,et al.  Automatic extraction of face-features , 1987, Pattern Recognit. Lett..

[100]  Christian Abry,et al.  "Laws" for lips , 1986, Speech Commun..

[101]  S A Duffy,et al.  Comprehension of Synthetic Speech Produced by Rule: A Review and Theoretical Interpretation , 1992, Language and speech.

[102]  Eric D. Petajan Automatic lipreading to enhance speech recognition , 1984 .

[103]  William D. Voiers PERFORMANCE EVALUATION OF SPEECH PROCESSING DEVICES. III. DIAGNOSTIC EVALUATION OF SPEECH INTELLIGIBILITY , 1967 .

[104]  Demetri Terzopoulos,et al.  Physically-based facial modelling, analysis, and animation , 1990, Comput. Animat. Virtual Worlds.

[105]  D Terzopoulos,et al.  The computer synthesis of expressive faces. , 1992, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[106]  A. Montgomery,et al.  Perceptual dimensions underlying vowellipreading performance. , 1976, Journal of speech and hearing research.

[107]  V. Fromkin Lip Positions in American English Vowels , 1964 .

[108]  Alan Jeffrey Goldschen,et al.  Continuous automatic speech recognition by lipreading , 1993 .

[109]  Irfan Essa,et al.  Analysis, interpretation and synthesis of facial expressions , 1995 .

[110]  G. Fairbanks Test of Phonemic Differentiation: The Rhyme Test , 1958 .

[111]  Michael M. Cohen,et al.  Real-time analysis-synthesis and intelligibility of talking faces , 1994, SSW.

[112]  J. P. Lewis,et al.  Automated lip-synch and speech synthesis for character animation , 1987, CHI '87.

[113]  Martin A. Fischler,et al.  The Representation and Matching of Pictorial Structures , 1973, IEEE Transactions on Computers.

[114]  Roberto Brunelli,et al.  Face Recognition: Features Versus Templates , 1993, IEEE Trans. Pattern Anal. Mach. Intell..