论文信息 - M Ami a Taxonomy of Multimodal Interaction in the Human Information Processing System

M Ami a Taxonomy of Multimodal Interaction in the Human Information Processing System

Editors Abstract This document has been prepared in the ESPRIT BRA No. 8579, Multimodal Integration for Advanced Multimedia Interfaces | in the following referred to as MIAMI | in order to serve as a basis for future work. The basic terms which will be used in MIAMI will be deened and an overview on man-machine-interfaces will be given. The term \taxonomy" is used in the following sense, adapted from 217]: \1: the study of the general principles of scientiic classiication: SYSTEMATICS; 2: CLASSIFICATION; specif: orderly clas-siication of plants and animals according to their presumed natural relationships"; but instead of plants and animals, we attempt to classify input and output modalities.

[1] Joel S. Warm,et al. Psychology of Perception , 1957 .

[2] W. Lindemann. Extension of a binaural cross-correlation model by contralateral inhibition. I. Simulation of lateralization for stationary signals. , 1986, The Journal of the Acoustical Society of America.

[3] Michael M. Cohen,et al. Modeling Coarticulation in Synthetic Visual Speech , 1993 .

[4] Alexander H. Waibel,et al. Improving connected letter recognition by lipreading , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5] Parke,et al. Parameterized Models for Facial Animation , 1982, IEEE Computer Graphics and Applications.

[6] W. Gaik,et al. Combined evaluation of interaural time and intensity differences: psychoacoustic results and computer modeling. , 1993, The Journal of the Acoustical Society of America.

[7] Maurizio Gentilucci,et al. On gesture and speech , 2015 .

[8] E. Reed. The Ecological Approach to Visual Perception , 1989 .

[9] R. Plamondon,et al. The relation between pen force and pen-point kinematics in handwriting , 1990, Biological Cybernetics.

[10] Mark T. Maybury,et al. Intelligent multimedia interfaces , 1994, CHI Conference Companion.

[11] D H Warren,et al. Spatial Localization under Conflict Conditions: Is There a Single Explanation? , 1979, Perception.

[12] P. Ekman,et al. Facial action coding system , 2019 .

[13] P. Ladefoged. WHAT ARE LINGUISTIC SOUNDS MADE OF , 1980 .

[14] S. McAdams,et al. Auditory Cognition. (Book Reviews: Thinking in Sound. The Cognitive Psychology of Human Audition.) , 1993 .

[15] Stuart K. Card,et al. Evaluation of mouse, rate-controlled isometric joystick, step keys, and text keys, for text selection on a CRT , 1987 .

[16] Lindsay William Macdonald,et al. Interacting with Virtual Environments , 1993 .

[17] S. S. Stevens. On the psychophysical law. , 1957, Psychological review.

[18] Rae A. Earnshaw,et al. Virtual Reality Systems , 1993 .

[19] Palmer Morrel-Samuels,et al. Clarifying the Distinction Between Lexical and Gestural Commands , 1990, Int. J. Man Mach. Stud..

[20] Ken Pimentel,et al. Virtual reality - through the new looking glass , 1993 .

[21] Louis D. Braida,et al. Evaluating the articulation index for auditory-visual input. , 1987, The Journal of the Acoustical Society of America.

[22] Innes A. Ferguson. TouringMachines: an architecture for dynamic, rational, mobile agents , 1992 .

[23] Joseph S. Perkell,et al. On the Use of Feedback in Speech Production , 1981 .

[24] Monique Nahas,et al. Animation of a B-Spline figure , 1988, The Visual Computer.

[25] M. Halle,et al. Preliminaries to Speech Analysis: The Distinctive Features and Their Correlates , 1961 .

[26] Giancarlo Ferrigno,et al. Automatic analysis of lips and jaw kinematics in VCV sequences , 1989, EUROSPEECH.

[27] H. Hudde,et al. Estimation of the area function of human ear canals by sound pressure measurements. , 1983, The Journal of the Acoustical Society of America.

[28] Abigail Sellen,et al. A comparison of input devices in element pointing and dragging tasks , 1991, CHI.

[29] C. Pelachaud. Communication and coarticulation in facial animation , 1992 .

[30] J. Vroomen,et al. Hearing Voices and Seeing Lips. Investigations in the Psychology of Lipreading , 1992 .

[31] Karun B. Shimoga,et al. A survey of perceptual feedback issues in dexterous telemanipulation. I. Finger force feedback , 1993, Proceedings of IEEE Virtual Reality Annual International Symposium.

[32] B.M. Jau,et al. Anthropomorhic Exoskeleton dual arm/hand telerobot controller , 2002, IEEE International Workshop on Intelligent Robots.

[33] David Wessel,et al. Improvisation with Highly Interactive Real-Time Performance Systems , 1991, ICMC.

[34] Abigail Sellen,et al. A study in interactive 3-D rotation using 2-D control devices , 1988, SIGGRAPH.

[35] Dominic W. Massaro,et al. Synthesis of visible speech , 1990 .

[36] Noam Chomsky,et al. The Sound Pattern of English , 1968 .

[37] Hiroshi Harashima,et al. A Media Conversion from Speech to Facial Image for Intelligent Man-Machine Interface , 1991, IEEE J. Sel. Areas Commun..

[38] Wolfgang Felger,et al. Die Virtuelle Umgebung - Eine neue Epoche in der Mensch-Maschine-Kommunikation, Teil I: Einordnung, Begriffe und Geräte , 1994, Inform. Spektrum.

[39] Stephen Brewster,et al. A Detailed Investigation into the Effectiveness of Earcons , 1997 .

[40] P Bertelson,et al. Auditory-visual interaction and the timing of inputs , 1987, Psychological research.

[41] E. Bizzi,et al. Mechanisms underlying achievement of final head position. , 1976, Journal of neurophysiology.

[42] Allen Newell,et al. The keystroke-level model for user performance time with interactive systems , 1980, CACM.

[43] Hewitt D. Crane,et al. Pen and voice unite , 1993 .

[44] Demetri Terzopoulos,et al. Techniques for Realistic Facial Modeling and Animation , 1991 .

[45] M. Bodden. Modeling human sound-source localization and the cocktail-party-effect , 1993 .

[46] L. Braida. Crossmodal Integration in the Identification of Consonant Segments , 1991, The Quarterly journal of experimental psychology. A, Human experimental psychology.

[47] Donald Laming,et al. Information theory of choice-reaction times , 1968 .

[48] Klaus Genuit,et al. Evaluating sound environments with binaural technology-Some basic consideration , 1993 .

[49] Biing-Hwang Juang,et al. Hidden Markov Models for Speech Recognition , 1991 .

[50] A. King,et al. Auditory function: Neurobiological bases of hearing G.M. Edelman W.E. , 1990, Neuroscience.

[51] Raymond D. Kent,et al. Coarticulation in recent speech production models , 1977 .

[52] Christian Abry,et al. Plateaus, catastrophes and the structuring of vowel systems , 1989 .

[53] Gary M. Olson,et al. The growth of cognitive modeling in human-computer interaction since GOMS , 1990 .

[54] Daniel Thalmann,et al. Abstract muscle action procedures for human face animation , 1988, The Visual Computer.

[55] Pietro Morasso,et al. Self-organizing topographic maps and motor planning , 1994 .

[56] Joëlle Coutaz,et al. A design space for multimodal systems: concurrent processing and data fusion , 1993, INTERCHI.

[57] Giancarlo Ferrigno,et al. Articulatory dynamics of lips in Italian /'vpv/ and /'vbv/ sequences , 1993, EUROSPEECH.

[58] P. Ladefoged. A course in phonetics , 1975 .

[59] Wilhelm Burger,et al. Digital Image Processing - An Algorithmic Introduction using Java , 2008, Texts in Computer Science.

[60] Abigail Sellen,et al. Two-handed input in a compound task , 1994, CHI 1994.

[61] N. Badler,et al. Linguistic Issues in Facial Animation , 1991 .

[62] K. Lashley. The problem of serial order in behavior , 1951 .

[63] D. H. Warren,et al. The role of visual-auditory “compellingness” in the ventriloquism effect: Implications for transitivity among the spatial senses , 1981, Perception & psychophysics.

[64] Sieb G. Nooteboom,et al. The target theory of speech production , 1970 .

[65] Tayeb Mohamadi,et al. Synthèse à partir du texte de visages parlants : réalisation d'un prototype et mesures d'intelligibilité bimodale , 1993 .

[66] Gillian Rhodes,et al. Cross-modal effects on visual and auditory object perception , 1984, Perception & psychophysics.

[67] Michael Good,et al. Participatory design of a portable torque-feedback device , 1992, CHI.

[68] C D Marsden,et al. Latency measurements compatible with a cortical pathway for the stretch reflex in man. , 1973, The Journal of physiology.

[69] Garner Wr. An informational analysis of absolute judgments of loudness. , 1953 .

[70] Axel Mulder. Virtual Musical Instruments: Accessing the Sound Synthesis Universe as a Performer , 2007 .

[71] Ravin Balakrishnan,et al. Virtual hand tool with force feedback , 1994, CHI '94.

[72] James H. Abbs,et al. chapter 5 – Peripheral Mechanisms of Speech Motor Control , 1976 .

[73] Keith Waters,et al. A muscle model for animation three-dimensional facial expression , 1987, SIGGRAPH.

[74] Wayne A. Wickelgran. Context-sensitive coding, associative memory, and serial order in (speech) behavior. , 1969 .

[75] Makoto Shimojo,et al. Edge tracing of virtual shape using input device with force feedback , 1992, Systems and Computers in Japan.

[76] K. K. Neely. Effect of Visual Factors on the Intelligibility of Speech , 1956 .

[77] P. Fitts. The information capacity of the human motor system in controlling the amplitude of movement. , 1954, Journal of experimental psychology.

[78] Caroline Henton,et al. Saying and seeing it with feeling: techniques for synthesizing visible, emotional speech , 1994, SSW.

[79] A. Mlcoch,et al. Speech Production Models as Related to the Concept of Apraxia of Speech , 1980 .

[80] A A Montgomery,et al. Auditory and visual contributions to the perception of consonants. , 1974, Journal of speech and hearing research.

[81] A.,et al. Cognitive Engineering , 2008, Encyclopedia of GIS.

[82] F. Lavagetto,et al. Lipreadable frame animation driven by speech parameters , 1994, Proceedings of ICSIPNN '94. International Conference on Speech, Image Processing and Neural Networks.

[83] Mohamed Tahar Lallouache,et al. Un poste "visage-parole" couleur : acquisition et traitement automatique des contours des lèvres , 1991 .

[84] B. Stein,et al. Interactions among converging sensory inputs in the superior colliculus. , 1983, Science.

[85] N. P. Erber,et al. Auditory, visual, and auditory-visual recognition of consonants by children with normal and impaired hearing. , 1972, Journal of speech and hearing research.

[86] A. Benguerel,et al. Coarticulation of Upper Lip Protrusion in French , 1974, Phonetica.

[87] David Frohlich,et al. The Design Space of Interfaces , 1992 .

[88] M. Radeau. Auditory-visual spatial interaction and modularity , 1994, Current psychology of cognition = Cahiers de psychologie cognitive : CPC.

[89] Ning Xiang,et al. A miniature dummy head for binaural evaluation of tenth-scale acoustic models , 1991 .

[90] Jean R. Ward,et al. Digitizer Technology: Performance Characteristics and the Effects on the User Interface , 1987, IEEE Computer Graphics and Applications.

[91] E. H. Dooijes. Analysis of handwriting movements , 1983 .

[92] P. Falzon,et al. Architecture of a Multimodal Dialogue Interface for Knowledge-Based Systems , 1990 .

[93] Kerry P. Green,et al. Exploring the basis of the “McGurk effect”: Can perceivers combine information from a female face and a male voice? , 1990 .

[94] R. Fox. Modularity and the Motor Theory of Speech Perception , 1994 .

[95] L. Kaufman,et al. Handbook of perception and human performance , 1986 .

[96] Ruth Campbell,et al. Tracing Lip Movements: Making Speech Visible , 1988 .

[97] Shuji Hashimoto,et al. A computer music system that follows a human conductor , 1991, Computer.

[98] Wolfgang Felger,et al. How interactive visualization can benefit from multidimensional input devices , 1992, Electronic Imaging.

[99] P. Johnson-Laird. Mental models , 1989 .

[100] Ching Y. Suen,et al. The State of the Art in Online Handwriting Recognition , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[101] Leonello Tarabella. Guest Editor's introduction to the special Issue on Man‐Machine interaction in live performance , 1993 .

[102] Kristinn R. Thórisson,et al. Integrating Simultaneous Input from Speech, Gaze, and Hand Gestures , 1991, AAAI Workshop on Intelligent Multimedia Interfaces.

[103] Marc Leman,et al. AI-based Music Signal Applications - A Hybrid Approach , 1997 .

[104] Q Summerfield,et al. Use of Visual Information for Phonetic Perception , 1979, Phonetica.

[105] G. A. Miller. THE PSYCHOLOGICAL REVIEW THE MAGICAL NUMBER SEVEN, PLUS OR MINUS TWO: SOME LIMITS ON OUR CAPACITY FOR PROCESSING INFORMATION 1 , 1956 .

[106] M. Pichora-Fuller,et al. Coarticulation effects in lipreading. , 1982, Journal of speech and hearing research.

[107] William Colgrove. Intelligent user interfaces and the Internet , 1995 .

[108] Gregory J. Wolff,et al. Neural network lipreading system for improved speech recognition , 1992, [Proceedings 1992] IJCNN International Joint Conference on Neural Networks.

[109] Joëlle Coutaz,et al. Applying the Wizard of Oz Technique to the Study of Multimodal Systems , 1993, EWHCI.

[110] Pattie Maes,et al. Designing autonomous agents: Theory and practice from biology to engineering and back , 1990, Robotics Auton. Syst..

[111] Joseph T. Chung,et al. Hyperinstruments: Musically Intelligent and Interactive Performance and Creativity Systems , 1989, ICMC.

[112] David A. Wroblewski,et al. Architectural qualities and principles for multimodal and multimedia interfaces , 1992 .

[113] S. Nishida. Speech recognition enhancement by lip information , 1986, CHI '86.

[114] D. Massaro,et al. Models of integration given multiple sources of information. , 1990, Psychological review.

[115] Christian Benoît,et al. A 3-d model of the lips for visual speech synthesis , 1994, SSW.

[116] Thomas H. Massie,et al. The PHANToM Haptic Interface: A Device for Probing Virtual Objects , 1994 .

[117] Lance Williams,et al. Performance-driven facial animation , 1990, SIGGRAPH Courses.

[118] Hans-Leo Teulings,et al. Digital recording and processing of handwriting movements , 1984 .

[119] N. P. Erber. Interaction of audition and vision in the recognition of oral speech stimuli. , 1969, Journal of speech and hearing research.

[120] Robert J. Beaton,et al. User Evaluation Of Cursor-Positioning Devices For 3-D Display Workstations , 1988, Photonics West - Lasers and Applications in Science and Engineering.

[121] Michael A. Gigante,et al. Virtual Reality: Definitions, History and Applications , 1993, Virtual Reality Systems.

[122] John S. Lew,et al. Optimal Accelerometer Layouts for Data Recovery in Signature Verification , 1980, IBM J. Res. Dev..

[123] Thecla Schiphorst,et al. Tools for interaction with the creative process of composition , 1990, CHI '90.

[124] LAMBERT R. B. SCHOMAKER,et al. A computational model of cursive handwriting , 1987 .

[125] J. Hannerz,et al. Contraction time and voluntary discharge properties of individual short toe extensor motor units in man. , 1979, The Journal of physiology.

[126] L. E. Murphy. Absolute judgments of duration. , 1966 .

[127] H. W. Campbell,et al. Phoneme recognition by ear and by eye: a distinctive feature analysis , 1974 .

[128] J Schroeter. The use of acoustical test fixtures for the measurement of hearing protector attenuation. Part I: Review of previous work and the design of an improved test fixture. , 1986, The Journal of the Acoustical Society of America.

[129] D W Massaro,et al. American Psychological Association, Inc. Evaluation and Integration of Visual and Auditory Information in Speech Perception , 2022 .

[130] I. Scott MacKenzie,et al. Extending Fitts' law to two-dimensional tasks , 1992, CHI.

[131] Osamu Fujimura. Elementary Gestures and Temporal Organization — What Does an Articulatory Constraint Mean? , 1981 .

[132] A. Meltzoff,et al. The bimodal perception of speech in infancy. , 1982, Science.

[133] Motoyuki Akamatsu,et al. A multi-modal mouse with tactile and force feedback , 1994, Int. J. Hum. Comput. Stud..

[134] J. Bortz. Lehrbuch der Statistik , 1979 .

[135] B.P. Yuhas,et al. Integration of acoustic and visual speech signals using neural networks , 1989, IEEE Communications Magazine.

[136] C. Benoît,et al. A set of French visemes for visual speech synthesis , 1994 .

[137] Peter Ladefoged,et al. PHONETIC PREREQUISITES FOR A DISTINCTIVE FEATURE THEORY , 1972 .

[138] D Goodman,et al. On the nature of human interlimb coordination. , 1979, Science.

[139] Wulf Pompetzki. Psychoakustische Verifikation von Computermodellen zur binauralen Raumsimulation , 1993 .

[140] R. Campbell,et al. Hearing by Eye , 1980, The Quarterly journal of experimental psychology.

[141] Roger B. Dannenberg,et al. Panel I-computer-generated music and multimedia computing , 1994, 1994 Proceedings of IEEE International Conference on Multimedia Computing and Systems.

[142] Neil P. McAngus Todd,et al. The auditory “Primal Sketch”: A multiscale model of rhythmic grouping , 1994 .

[143] Yves Demazeau,et al. Principles and techniques for sensor data fusion , 1993, Signal Process..

[144] Tsuneya Kurihara,et al. A Transformation Method for Modeling and Animation of the Human Face from Photographs , 1991 .

[145] N. P. Erber. Auditory-visual perception of speech. , 1975, The Journal of speech and hearing disorders.

[146] Peter C. Litwinowicz,et al. Facial Animation by Spatial Mapping , 1991 .

[147] S. R. Ellis. Nature and origins of virtual environments: a bibliographical essay , 1991 .

[148] Sidney S. Simon,et al. Merging of the Senses , 2008, Front. Neurosci..

[149] C M Reed,et al. Research on the Tadoma method of speech communication. , 1983, The Journal of the Acoustical Society of America.

[150] Kiyoharu Aizawa,et al. Model-based analysis synthesis image coding (MBASIC) system for a person's face , 1989, Signal Process. Image Commun..

[151] P. Viviani,et al. Motor-perceptual interactions , 1992 .

[152] Marc Leman. Schema-based tone center recognition of musical signals , 1994 .

[153] J. Gibson. The Senses Considered As Perceptual Systems , 1967 .

[154] N. F. Dixon,et al. The Detection of Auditory Visual Desynchrony , 1980, Perception.

[155] D. Papadias,et al. Computational Imagery , 1992, Cogn. Sci..

[156] W. Lindemann. Extension of a binaural cross-correlation model by contralateral inhibition. II. The law of the first wave front. , 1986, The Journal of the Acoustical Society of America.

[157] P. MacNeilage. Motor control of serial ordering of speech. , 1970, Psychological review.

[158] E. Owens,et al. Visemes observed by hearing-impaired and normal-hearing adult viewers. , 1985, Journal of speech and hearing research.

[159] Lambert Schomaker,et al. Using stroke- or character-based self-organizing maps in the recognition of on-line, connected cursive script , 1993, Pattern Recognit..

[160] Hans J. Charwat. Lexikon der Mensch-Maschine-Kommunikation , 1992 .

[161] Luc Steels,et al. Emergent Frame Recognition and Its Use in Artificial Creatures , 1991, IJCAI.

[162] R. Wurtz,et al. Organization of monkey superior colliculus: enhanced visual response of superficial layer cells. , 1976, Journal of neurophysiology.

[163] Jock D. Mackinlay,et al. The cognitive coprocessor architecture for interactive user interfaces , 1989, UIST '89.

[164] Ivan E. Sutherland,et al. The Ultimate Display , 1965 .

[165] A. Macleod,et al. Quantifying the contribution of vision to speech perception in noise. , 1987, British journal of audiology.

[166] Allen A. Montgomery,et al. ANIMAT: A set of programs to generate, edit, and display sequences of vector-based images , 1982 .

[167] Allen Newell,et al. The psychology of human-computer interaction , 1983 .

[168] R. M. Halsey,et al. Absolute Judgments of Spectrum Colors , 1956 .

[169] James Lubker. Representation and Context Sensitivity , 1981 .

[170] Paula M. T. Smeele,et al. The contribution of vision to speech perception , 1991, EUROSPEECH.

[171] Réjean Plamondon,et al. An evaluation of motor models of handwriting , 1989, IEEE Trans. Syst. Man Cybern..

[172] P. L. Adams. THE ORIGINS OF INTELLIGENCE IN CHILDREN , 1976 .

[173] E. Robinson. Cybernetics, or Control and Communication in the Animal and the Machine , 1963 .

[174] Georg Geiser. Mensch-Maschine-Kommunikation , 1990 .

[175] Paul O'Rorke,et al. Explaining Emotions , 1994, Cogn. Sci..

[176] Gerd Hirzinger. Multisensory shared autonomy and tele-sensor programming - Key issues in space robotics , 1993, Robotics Auton. Syst..

[177] D C Shepherd,et al. Visual-neural correlate of speechreading ability in normal-hearing adults: reliability. , 1982, Journal of speech and hearing research.

[178] Marc Leman,et al. Introduction to auditory models in music research , 1994 .

[179] Allen Newell,et al. SOAR: An Architecture for General Intelligence , 1987, Artif. Intell..

[180] Denis Baggi,et al. Readings in computer-generated music , 1992 .

[181] Stephen Michael Platt,et al. A structural model of the human face (graphics, animation, object representation) , 1985 .

[182] H W HAKE,et al. Multidimensional stimulus differences and accuracy of discrimination. , 1955, Journal of experimental psychology.

[183] Michael M. Cohen,et al. Real-time analysis-synthesis and intelligibility of talking faces , 1994, SSW.

[184] W. H. Sumby,et al. Visual contribution to speech intelligibility in noise , 1954 .

[185] Tamas Ungvary,et al. NUNTIUS: A Computer System for the Interactive Composition and Analysis of Music and Dance , 1992 .

[186] A. Risberg,et al. Speech , Music and Hearing Quarterly Progress and Status Report Prosody and speechreading , 2007 .

[187] Brenda Laurel,et al. PLACEHOLDER: landscape and narrative in virtual environments , 1994, MULTIMEDIA '94.

[188] Ronald J. Brachman,et al. An overview of the KL-ONE Knowledge Representation System , 1985 .

[189] Daniel Thalmann,et al. The Direction of Synthetic Actors in the Film Rendez-Vous a Montreal , 1987, IEEE Computer Graphics and Applications.

[190] P. Bertelson,et al. Cross-modal bias and perceptual fusion with auditory-visual spatial discordance , 1981, Perception & psychophysics.

[191] P. Smolensky. On the proper treatment of connectionism , 1988, Behavioral and Brain Sciences.

[192] Novia Weiman,et al. An Evaluation Of Input Devices For 3-D Computer Display Workstations , 1987, Photonics West - Lasers and Applications in Science and Engineering.

[193] Carol A. Fowler,et al. Coarticulation and theories of extrinsic timing , 1980 .

[194] J. Abbs,et al. Lip and Jaw Motor Control during Speech: Motor Reorganization Responses to External Interference , 1974 .

[195] David Taylor. Hearing by Eye: The Psychology of Lip-Reading , 1988 .

[196] D. Massaro,et al. Perception of Synthesized Audible and Visible Speech , 1990 .

[197] R. Hammarberg. The metaphysics of coarticulation , 1976 .

[198] M. L. Meeks,et al. MEASUREMENT OF DYNAMIC DIGITIZER PERFORMANCE , 1990 .

[199] Y. Hatwell,et al. Toucher l'espace : la main et la perception tactile de l'espace , 1986 .

[200] Eric David Petajan,et al. Automatic Lipreading to Enhance Speech Recognition (Speech Reading) , 1984 .

[201] N Magnenat Thalmann,et al. Creating Realistic Three-Dimensional Human Shape Characters for Computer-Generated Films , 1991 .

[202] B. Wyvill,et al. Animating speech: an automated approach using speech synthesised by rules , 1988, The Visual Computer.

[203] Dennis H. Klatt,et al. Software for a cascade/parallel formant synthesizer , 1980 .

[204] I. Pollack. The Information of Elementary Auditory Displays , 1952 .

[205] H. McGurk,et al. Hearing lips and seeing voices , 1976, Nature.

[206] J. P. Lewis,et al. Automated lip-synch and speech synthesis for character animation , 1987, CHI '87.

[207] Doug Riecken. Intelligent agents , 1994, CACM.

[208] R. L. Elliott. Computer-generated movies as an analytic tool , 1978 .

[209] E.R. Brocklehurst. The NPL Electronic Paper Project , 1991, Int. J. Man Mach. Stud..

[210] Harry Hollien,et al. A Neural Model for Language and Speech. , 1978 .

[211] A.. Temporal Backpropagation for FIR Neural Networks , 2004 .

[212] David Wessel,et al. Real-Time Neural Network Processing of Gestural and Acoustic Signals , 1991, ICMC.

[213] Shen Lin,et al. A Advanced Telerobotic Control System for a Mobile Robot with Multisensor Feedback , 1995 .

[214] Hiroo Iwata,et al. Artificial reality with force-feedback: development of desktop virtual space with compact master manipulator , 1990, SIGGRAPH.

[215] Ning Xiang,et al. Binaural scale modelling for auralisation and prediction of acoustics in auditoria , 1993 .

[216] Jens Blauert,et al. Principles of binaural room simulation , 1992 .

[217] Catherine G. Wolf,et al. The Use of Hand-Drawn Gestures for Text Editing , 1987, Int. J. Man Mach. Stud..

[218] C. Faure. Pen and voice interface for incremental design of graphic documents , 1994 .

[219] Kurzfassung,et al. Calibrating the Active Stereo Vision System Kastor 1 for Real-time Robot Navigation Karlsruhe Active Stereo Real-time Vision System , 1994 .

[220] David Goldberg,et al. Touch-typing with a stylus , 1993, INTERCHI.

[221] Kim J. Vicente,et al. Ecological interface design: theoretical foundations , 1992, IEEE Trans. Syst. Man Cybern..

[222] C. Fowler. An event approach to the study of speech perception from a direct realist perspective , 1986 .

[223] David E. Kieras,et al. An Approach to the Formal Analysis of User Complexity , 1999, Int. J. Man Mach. Stud..

[224] J. Schroeter,et al. The use of acoustical test fixtures for the measurement of hearing protector attenuation. Part II: Modeling the external ear, simulating bone conduction, and comparing test fixture and real-ear data. , 1986, The Journal of the Acoustical Society of America.

[225] Brian Wyvill,et al. Speech and expression: a computer solution to face animation , 1986 .

[226] V J Samar,et al. Visual evoked-response components related to speechreading and spatial skills in hearing and hearing-impaired adults. , 1984, Journal of speech and hearing research.

[227] Norman I. Badler,et al. Animating facial expressions , 1981, SIGGRAPH '81.

[228] Jens Rasmussen,et al. Information Processing and Human-Machine Interaction: An Approach to Cognitive Engineering , 1986 .

[229] Antonio Camurri,et al. Music and Multimedia Knowledge Representation and Reasoning: The HARP System , 1995 .

[230] Alan Jeffrey Goldschen,et al. Continuous automatic speech recognition by lipreading , 1993 .

[231] Roger B. Dannenberg,et al. Multimedia interface design , 1992 .

[232] P L Olson,et al. Perception-Response Time to Unexpected Roadway Hazards , 1986, Human factors.

[233] David Gradwell,et al. Human Information Processing , 2017 .

[234] Dennis H. Klatt,et al. Speech perception: a model of acoustic–phonetic analysis and lexical access , 1979 .

[235] J Rhyne. Dialogue management for gestural interfaces , 1987, COMG.

[236] L. Steels. The Artiicial Life Roots of Artiicial Intelligence , 1993 .

[237] P. N. Kugler,et al. 1 On the Concept of Coordinative Structures as Dissipative Structures: I. Theoretical Lines of Convergence* , 1980 .

[238] E. Bizzi. 7 Central and Peripheral Mechanisms in Motor Control , 1980 .

[239] Marie-Luce Viaud. Animation faciale avec rides d'expression vieillissement et parole , 1992 .

[240] Dominic W. Massaro,et al. The motor theory of speech perception revisited , 2008, Psychonomic bulletin & review.

[241] Joachim Grollmann,et al. On the Software Structure of User Interface Management Systems , 1989, Eurographics.

[242] Kiyoharu Aizawa,et al. Real-time facial action image synthesis system driven by speech and text , 1990, Other Conferences.

[243] D C Donderi,et al. The effect of sound on visual apparent movement. , 1983, The American journal of psychology.

[244] Ben Shneiderman,et al. Designing the user interface - strategies for effective human-computer interaction, 2nd Edition , 1992 .

[245] N. P. Erber,et al. Voice/mouth synthesis and tactual/visual perception of /pa, ba, ma/. , 1978, The Journal of the Acoustical Society of America.

[246] Cynthia H. Null,et al. A white paper: NASA virtual environment research, applications, and technology , 1993 .

[247] H. Hudde,et al. Measurement of the eardrum impedance of human ears. , 1983, The Journal of the Acoustical Society of America.

[248] F. I. Parke June,et al. Computer Generated Animation of Faces , 1972 .

[249] Kaisa Väänänen,et al. Gesture Driven Interaction as a Human Factor in Virtual Environments - An Approach with Neural Networks , 1993, Virtual Reality Systems.

[250] Ben Pinkowski. LPC spectral moments for clustering acoustic transients , 1993, IEEE Trans. Speech Audio Process..