论文信息 - Multi-Modal Question-Answering: Questions without Keyboards

Multi-Modal Question-Answering: Questions without Keyboards

This paper describes our work to allow players in a virtual world to pose questions without relying on textual input. Our approach is to create enhanced virtual photographs by annotating them with semantic information from the 3D environment’s scene graph. The player can then use these annotated photos to interact with inhabitants of the world through automatically generated queries that are guaranteed to be relevant, grammatical and unambiguous. While the range of queries is more limited than a text input system would permit, in the gaming environment that we are exploring these limitations are offset by the practical concerns that make text input inappropriate.

Gary Kacmarcik

[1] Richard A. Bolt,et al. “Put-that-there”: Voice and gesture at the graphics interface , 1980, SIGGRAPH '80.

[2] Blair MacIntyre,et al. Annotating the real world with knowledge-based graphics on a see-through head-mounted display , 1992 .

[3] Tim Berners-Lee,et al. Hypertext Markup Language - 2.0 , 1995, RFC.

[4] Dagmar Schmauks,et al. Natural and Simulated Pointing , 1987, EACL.

[5] Thomas Rist,et al. Referring To World Objects With Text And Pictures , 1994, COLING.

[6] Terry Winograd,et al. Understanding natural language , 1974 .

[7] Takako Aikawa,et al. Dynamic Language Learning Tools , 2004 .

[8] B. Landau,et al. “What” and “where” in spatial language and spatial cognition , 1993 .

[9] Oliviero Stock,et al. AL FRESCO: Enjoying The Combination of NLP and Hypermedia for Information Exploration , 1993 .

[10] Wolfgang Wahlster,et al. Combining Deictic Gestures and Natural Language for Referent Identification , 1986, COLING.

[11] Donald P. Greenberg,et al. Improved Computational Methods for Ray Tracing , 1984, TOGS.

[12] James C. Lester,et al. Generating Coordinated Natural Language and 3D Animations for Complex Spatial Explanations , 1998, AAAI/IAAI.

[13] Richard Campbell,et al. Language-Neutral Representation of Syntactic Structure , 2002 .