Referent Identification Requests in Multi-Modal Dialogs

This paper describes an empirical study on what kinds of information are appropriate for referent identification requests in multi-modal dialogs, and how that information should be communicated in order to achieve the request desired. We conduct experiments in which experts explain the installation of a telephone in four situations: spoken-mode monolog; spoken-mode dialog; multi-modal monolog; and multi-modal dialog. Referent identification requests could be well analyzed from two perspectives: information communicated and the style of goal achievement. We find that there is a close relationship between the information conveyed via different communicative modes, and sketch a model that explains these results. In the model, information cannot be divided into the semantic content conveyed and the communicative modes employed, and is treated as the primitive unit for consideration. Pointing is considered as information in this sense. We also find that in dialogs, especially in spoken-mode dialogs, the speakers realize identification requests as series of fine-grained steps, and try to achieve them step by step.

[1]  H. H. Clark,et al.  Referring as a collaborative process , 1986, Cognition.

[2]  Marilyn A. Walker,et al.  Redundancy in Collaborative Dialogue , 1992, COLING.

[3]  Marilyn A. Walker,et al.  Experimentally Evaluating Communicative Strategies: The Effect of the Task , 1994, AAAI.

[4]  Alex Lascarides,et al.  Abducing Temporal Discourse , 1992, NLG.

[5]  Hiyan Alshawi,et al.  Memory and context for language interpretation , 1987 .

[6]  Yukiko Ishikawa Communicative mode dependent contribution from the recipient in information providing dialogue , 1994, ICSLP.

[7]  Wolfgang Wahlster,et al.  Designing Illustrated Texts: How Language Production Is Influenced by Graphics Generation , 1991, EACL.

[8]  Douglas E. Appelt,et al.  Planning English Referring Expressions , 1985, Artif. Intell..

[9]  Candace L. Sidner,et al.  Attention, Intentions, and the Structure of Discourse , 1986, CL.

[10]  Wim Claassen,et al.  Generating Referring Expressions in a Multimodal Environment , 1992, NLG.

[11]  Philip R. Cohen The Pragmatics of Referring and the Modality of Communication , 1984, Comput. Linguistics.

[12]  Mark T. Maybury,et al.  Planning Multimedia Explanations Using Communicative Acts , 1991, AAAI Workshop on Intelligent Multimedia Interfaces.

[13]  Thomas Rist,et al.  Referring To World Objects With Text And Pictures , 1994, COLING.

[14]  Philip R. Cohen,et al.  Discourse structure and performance efficiency in interactive and non-interactive spoken modalities☆ , 1991 .

[15]  W. Levelt,et al.  Monitoring and self-repair in speech , 1983, Cognition.

[16]  Steven K. Feiner,et al.  Coordinating Text and Graphics in Explanation Generation , 1989, HLT.