Natural Conversational Interfaces to Geospatial Databases

Natural (spoken) language, combined with gestures and other human modalities, provides a promising alternative for interacting with computers, but such benefit has not been explored for interactions with geographical information systems. This paper presents a conceptual framework for enabling conversational humanGIS interactions. Conversations with a GIS are modeled as human-computer collaborative activities within a task domain. We adopt a mental state view of collaboration and discourse and propose a plan-based computational model for conversational grounding and dialogue generation. At the implementation level, our approach is to introduce a dialogue agent, GeoDialogue , between a user and a geographical information server. GeoDialogue actively recognizes user’s information needs, reasons about detailed cartographic and database procedures, and acts cooperatively to assist user’s problem solving. GeoDialogue serves as a semantic ‘bridge’ between the human language and the formal language that a GIS understands. The behavior of such dialogue-assisted human-GIS interfaces is illustrated through a scenario simulating a session of emergency response during a hurricane event.

[1]  Conn V. Copas,et al.  GIS Support for Distributed Group-Work in Regional Planning , 1997, Int. J. Geogr. Inf. Sci..

[2]  Victor Zue,et al.  Multilingual spoken-language understanding in the MIT Voyager system , 1995, Speech Commun..

[3]  Sharon L. Oviatt,et al.  Designing the User Interface for Multimodal Speech and Pen-Based Gesture Applications: State-of-the-Art Systems and Future Research Directions , 2000, Hum. Comput. Interact..

[4]  Lawrence G. Mondschein The role of spatial information systems in environmental emergency management , 1994 .

[5]  Michela Bertolotto,et al.  Multi-Modal Spatial Querying , 2003 .

[6]  M. Egenhofer,et al.  The GIS WallBoard : Interactions with Spatial Information on Large-Scale Displays , 1998 .

[7]  James F. Allen,et al.  Towards Conversational Human-Computer Interaction , 2000 .

[8]  David M. Mark,et al.  User Interfaces for Geographic Information Systems: Report on the Specialist Meeting (92-3) , 1992 .

[9]  Loren G. Terveen,et al.  Overview of human-computer collaboration , 1995, Knowl. Based Syst..

[10]  A BoltRichard,et al.  Put-that-there , 1980 .

[11]  Jeannette G. Neal,et al.  Multi-modal output composition for human-computer dialogues , 1989, [1989] Proceedings. The Annual AI Systems in Government Conference.

[12]  C. Y. Thielman,et al.  Natural Language with Integrated Deictic and Graphic Gestures , 1989, HLT.

[13]  Alan M. MacEachren,et al.  Developing a conceptual framework for visually-enabled geocollaboration , 2004, Int. J. Geogr. Inf. Sci..

[14]  Aaron F. Bobick,et al.  Parametric Hidden Markov Models for Gesture Recognition , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  William C. Mann,et al.  RHETORICAL STRUCTURE THEORY: A THEORY OF TEXT ORGANIZATION , 1987 .

[16]  Candace L. Sidner,et al.  Discourse structure and intention recognition. , 2000 .

[17]  Victor Zue,et al.  Recent Progress on the VOYAGER System , 1990, HLT.

[18]  R. J. Torres Practitioner's Handbook for User Interface Design and Development , 2001 .

[19]  A. Roadmapof A Roadmap of Agent Research and Development , 1995 .

[20]  Mohammed Yeasin,et al.  Speech-gesture driven multimodal interfaces for crisis management , 2003, Proc. IEEE.

[21]  Rajeev Sharma,et al.  Designing a human-centered, multimodal GIS interface to support emergency management , 2002, GIS '02.

[22]  Marian G. Williams,et al.  Why are geographic information systems hard to use? , 1995, CHI 95 Conference Companion.

[23]  Lawrence G. Mondschein The Role of Spatial Information Systems in Environmental Emergency Management , 1994, J. Am. Soc. Inf. Sci..

[24]  Sabine Timpf Geographic Activity Models , 2003, Foundations of Geographic Information Science.

[25]  Michael E. Bratman,et al.  Shared Cooperative Activity , 1991 .

[26]  C. Sidner,et al.  Plans for Discourse , 1988 .

[27]  H. Clark,et al.  Grounding in Communication', 127-149 in Resnick LB, Levine JM and Teasley SD , 1991 .

[28]  Douglas D. O'Shaughnessy,et al.  Interacting with computers by voice: automatic speech recognition and synthesis , 2003, Proc. IEEE.

[29]  Alan M. MacEachren,et al.  Communicating Vague Spatial Concepts in Human-GIS Interactions: A Collaborative Dialogue Approach , 2003, COSIT.

[30]  Victor Zue,et al.  Conversational interfaces: advances and challenges , 1997, Proceedings of the IEEE.

[31]  Suguru Ishizaki,et al.  GeoSpace: an interactive visualization system for exploring complex information spaces , 1995, CHI '95.

[32]  Sharon L. Oviatt,et al.  Multimodal interfaces for dynamic interactive maps , 1996, CHI.

[33]  Sarit Kraus,et al.  Collaborative Plans for Complex Group Action , 1996, Artif. Intell..

[34]  Jerry R. Hobbs Coherence and Coreference , 1979, Cogn. Sci..

[35]  Gerhard Fischer,et al.  User Modeling in Human–Computer Interaction , 2001, User Modeling and User-Adapted Interaction.

[36]  Giuseppe Riccardi,et al.  How may I help you? , 1997, Speech Commun..

[37]  James F. Allen,et al.  Toward Conversational Human-Computer Interaction , 2001, AI Mag..

[38]  F. Wang,et al.  Handling Grammatical Errors, Ambiguity and Impreciseness in GIS Natural Language Queries , 2003, Trans. GIS.

[39]  S. Furui,et al.  Automatic recognition and understanding of spoken language - a first step toward natural human-machine communication , 2000, Proceedings of the IEEE.

[40]  Marc P. Armstrong,et al.  A Conceptual Framework for Improving Human-Computer Interaction in Locational Decision-Making , 1995 .

[41]  A. MacEachren,et al.  Research Challenges in Geovisualization , 2001, KN - Journal of Cartography and Geographic Information.

[42]  Susan E. Brennan,et al.  The Grounding Problem in Conversations With and Through Computers , 2000 .

[43]  Eli Hagen,et al.  An Approach to Mixed Initiative Spoken Information Retrieval Dialogue , 2004, User Modeling and User-Adapted Interaction.

[44]  Thomas S. Huang,et al.  Scanning the issue - Special issue on human-computer multimodal interface , 2003 .

[45]  Philip R. Cohen,et al.  Plans as Complex Mental Attitudes , 2003 .

[46]  Candace L. Sidner,et al.  Attention, Intentions, and the Structure of Discourse , 1986, CL.

[47]  Debora Shaw,et al.  Handbook of usability testing: How to plan, design, and conduct effective tests , 1996 .

[48]  Rajeev Sharma,et al.  Enabling collaborative geoinformation access and decision‐making through a natural, multimodal interface , 2005, Int. J. Geogr. Inf. Sci..

[49]  Philip R. Cohen,et al.  MULTIMODAL INTERFACES THAT PROCESS WHAT COMES NATURALLY , 2000 .

[50]  Michael F. McTear,et al.  Book Review: Spoken Dialogue Technology: Toward the Conversational User Interface, by Michael F. McTear , 2002, CL.

[51]  Jakob Nielsen,et al.  Usability engineering , 1997, The Computer Science and Engineering Handbook.

[52]  Andre Zerger,et al.  Impediments to using GIS for real-time disaster decision support , 2003, Comput. Environ. Urban Syst..

[53]  Karen E. Lochbaum,et al.  Using collaborative plans to model the intentional structure of discourse , 1995 .

[54]  Sharon L. Oviatt,et al.  Perceptual user interfaces: multimodal interfaces that process what comes naturally , 2000, CACM.

[55]  Victor Zue,et al.  Recent progress on the MIT VOYAGER spoken language system , 1990, ICSLP.

[56]  Vladimir Pavlovic,et al.  Toward multimodal human-computer interface , 1998, Proc. IEEE.

[57]  Richard A. Bolt,et al.  “Put-that-there”: Voice and gesture at the graphics interface , 1980, SIGGRAPH '80.

[58]  Philip R. Cohen,et al.  QuickSet: multimodal interaction for distributed applications , 1997, MULTIMEDIA '97.

[59]  Jakob Nielsen,et al.  Chapter 4 – The Usability Engineering Lifecycle , 1993 .

[60]  Karen E. Lochbaum,et al.  A Collaborative Planning Model of Intentional Structure , 1998, CL.

[61]  Telecommunications Board,et al.  IT Roadmap to a Geospatial Future , 2003 .

[62]  John M. Carroll,et al.  Five reasons for scenario-based design , 2000, Interact. Comput..

[63]  Conn V. Copas,et al.  Intelligent interfaces through interactive planners , 2000, Interact. Comput..

[64]  Rajeev Sharma,et al.  Understanding Gestures in Multimodal Human Computer Interaction , 2000, Int. J. Artif. Intell. Tools.