Animated Interface Agent Applying Atms-Based Multimodal Input Interpretation

Two requirements should be met in order to develop a practical multimodal interface system , i . e ., ( 1 ) integration of delayed arrival of data and ( 2 ) elimination of ambiguity in recognition results of each modality . This paper presents an efficient and generic methodology for interpretation of multimodal input to satisfy these requirements . The proposed methodology can integrate delayed - arrival data satisfactorily and efficiently interpret multimodal input that contains ambiguity . In the input interpretation the multimodal interpretation process is regarded as hypothetical reasoning , and the control mechanismof interpretation is formalized by applying the assumption - based truth maintenance system ( ATMS ). The proposed method is applied to an interface agent system that accepts multimodal input consisting of voice and direct indication gesture on a touch display . The systemcommunicates to the user through a human - like interface agent's three - dimensional motion image with facial express...

[1]  Richard A. Bolt,et al.  “Put-that-there”: Voice and gesture at the graphics interface , 1980, SIGGRAPH '80.

[2]  Philip R. Cohen Natural Language Techniques for Multimodal Interaction (マルチモ-ダルインタフェ-スと要素技術論文 ) , 1994 .

[3]  Pattie Maes,et al.  Agents that reduce work and information overload , 1994, CACM.

[4]  Oliviero Stock,et al.  Natural Language and Exploration of an Information Space: The ALFresco Interactive System , 1991, IJCAI.

[5]  Seigo Arita,et al.  Multi-Modal Definite Clause Grammar , 1994, COLING 1994.

[6]  Hideki Hashimoto,et al.  A real-time speech dialogue system using spontaneous speech understanding , 1992, ICSLP.

[7]  Michael Johnston,et al.  Unification-based Multimodal Parsing , 1998, ACL.

[8]  Sharon L. Oviatt,et al.  Unification-based Multimodal Integration , 1997, ACL.

[9]  Timothy W. Finin,et al.  KQML as an agent communication language , 1994, CIKM '94.

[10]  Antonella De Angeli,et al.  Integration and synchronization of input modes during multimodal human-computer interaction , 1997, CHI.

[11]  Stuart C. Shapiro,et al.  Intelligent Multi-Media Interface Technology , 1988, SGCH.

[12]  Brian C. Williams,et al.  Back to Backtracking: Controlling the ATMS , 1986, AAAI.

[13]  Kristinn R. Thórisson,et al.  Integrating Simultaneous Input from Speech, Gaze, and Hand Gestures , 1991, AAAI Workshop on Intelligent Multimedia Interfaces.

[14]  Martin Kay,et al.  Algorithm schemata and data structures in syntactic processing , 1986 .

[15]  Mark T. Maybury,et al.  Intelligent multimedia interfaces , 1994, CHI Conference Companion.

[16]  Philip R. Cohen,et al.  QuickSet: multimodal interaction for distributed applications , 1997, MULTIMEDIA '97.