Domain knowledge acquisition and plan recognition by probabilistic reasoning

This paper introduces a statistical framework for extracting medical domain knowledge from heterogeneous corpora. The acquired information is incorporated into a natural language understanding agent and applied to DIKTIS, an existing web-based educational dialogue system for the chemotherapy of nosocomial and community acquired pneumonia, aiming at providing a more intelligent natural language interaction. Unlike the majority of existing dialogue understanding engines, the presented system automatically encodes semantic representation of a user's query using Bayesian networks. The structure of the networks is determined from annotated dialogue corpora using the Bayesian scoring method, thus eliminating the tedious and costly process of manually coding domain knowledge. The conditional probability distributions are estimated during a training phase using data obtained from the same set of dialogue acts. In order to cope with words absent from our restricted dialogue corpus, a separate offline module was incorporated, which estimates their semantic role from both medical and general raw text corpora, correlating them with known lexical-semantically similar words or predefined topics. Lexical similarity is identified on the basis of both contextual similarity and co-occurrence in conjunctive expressions. The evaluation of the platform was performed against the existing language natural understanding module of DIKTIS, the architecture of which is based on manually embedded domain knowledge.

[1]  Jennifer Chu-Carroll,et al.  Constructing and Utilizing a Model of User Preferences in Collaborative Consultation Dialogues , 1999, Comput. Intell..

[2]  Robert P. Goldman,et al.  A Bayesian Model of Plan Recognition , 1993, Artif. Intell..

[3]  Philip R. Cohen,et al.  Plans for Discourse , 2003 .

[4]  Ingrid Zukerman,et al.  Natural Language Processing and User Modeling: Synergies and Limitations , 2001, User Modeling and User-Adapted Interaction.

[5]  Alfred Kobsa User Modeling and User-Adapted Interaction , 2005, User Modeling and User-Adapted Interaction.

[6]  Henry A. Kautz,et al.  Generalized Plan Recognition , 1986, AAAI.

[7]  J. Giesecke,et al.  Risk of pneumonia in patients previously treated in hospital for pneumonia , 1992, The Lancet.

[8]  George K. Kokkinakis,et al.  Automatic Stochastic Tagging of Natural Language Texts , 1995, Comput. Linguistics.

[9]  Dekang Lin,et al.  Automatic Retrieval and Clustering of Similar Words , 1998, ACL.

[10]  Peter Haddawy,et al.  BANTER: a Bayesian network tutoring shell , 1997, Artif. Intell. Medicine.

[11]  Krzysztof J. Cios,et al.  Bayesian learning for cardiac SPECT image interpretation , 2002, Artif. Intell. Medicine.

[12]  Ingrid Zukerman,et al.  Bayesian Models for Keyhole Plan Recognition in an Adventure Game , 2004, User Modeling and User-Adapted Interaction.

[13]  Donald Michie High-Road and Low-Road Programs , 1981, AI Mag..

[14]  Anthony Jameson,et al.  Numerical uncertainty management in user and student modeling: An overview of systems and issues , 2005, User Modeling and User-Adapted Interaction.

[15]  Hinrich Schütze,et al.  Automatic Word Sense Discrimination , 1998, Comput. Linguistics.

[16]  James F. Allen,et al.  Toward Conversational Human-Computer Interaction , 2001, AI Mag..

[17]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[18]  Ted Dunning,et al.  Accurate Methods for the Statistics of Surprise and Coincidence , 1993, CL.

[19]  Gregory F. Cooper,et al.  A Bayesian method for the induction of probabilistic networks from data , 1992, Machine Learning.

[20]  B M Farr Prognosis and decisions in pneumonia. , 1997, The New England journal of medicine.

[21]  P. Montravers,et al.  Nosocomial pneumonia in ventilated patients: a cohort study evaluating attributable mortality and hospital stay. , 1993, The American journal of medicine.

[22]  Eric Horvitz,et al.  Decision theory in expert systems and artificial intelligenc , 1988, Int. J. Approx. Reason..

[23]  David Maxwell Chickering,et al.  Learning Bayesian Networks: The Combination of Knowledge and Statistical Data , 1994, Machine Learning.

[24]  A. Hasman,et al.  Probabilistic reasoning in intelligent systems: Networks of plausible inference , 1991 .

[25]  Sandra Carberry Modeling the User's Plans and Goals , 1988, Comput. Linguistics.

[26]  Yuval Shahar,et al.  A Framework for Knowledge-Based Temporal Abstraction , 1997, Artif. Intell..

[27]  Robert L. Mercer,et al.  Class-Based n-gram Models of Natural Language , 1992, CL.

[28]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[29]  Ingrid Zukerman,et al.  # 2001 Kluwer Academic Publishers. Printed in the Netherlands. Predictive Statistical Models for User Modeling , 1999 .

[30]  Johanna D. Moore,et al.  An intelligent interactive system for delivering individualized information to patients , 1995, Artif. Intell. Medicine.

[31]  J. Azarewicz,et al.  Template-based multi-agent plan recognition for tactical situation assessment , 1989, [1989] Proceedings. The Fifth Conference on Artificial Intelligence Applications.

[32]  Sandra Carberry,et al.  Incorporating Default Inferences Into Plan Recognition , 1990, AAAI.

[33]  Edmund H. Durfee,et al.  Observational Uncertainty in Plan Recognition Among Interacting Robots , 1993 .

[34]  William B. Gevarter,et al.  Standards for evaluating expert system tools , 1991 .

[35]  Kyriakos Sgarbas,et al.  Natural Language in Dialogue Systems . A Case Study on a Medical Application , 2001 .