Understanding Messages in a Diagnostic Domain

The problem of coping with subject-matter sublanguages in text processing is well known in the natural language processing field. The main problem is to balance the use of generic knowledge sources and the specific needs of the sublanguages. This paper introduces the characteristics of the sublanguage found in diagnostic messages about automotive equipment failures, and discusses an architecture to analyse those messages. The model is based on a two-level partial parsing approach that uses a syntax-driven strategy to parse fragments of a sentence. A set of semantics-driven strategies is used to collapse the fragments. General knowledge sources are proposed for use as an independent syntax, a knowledge-based semantics, a pragmatic module and a two-level lexicon. The problem of balancing accuracy, robustness and efficiency in the message analysis is addressed. Finally some applied results are shown.

[1]  Norman K. Sondheimer,et al.  Meta-Rules as a Basis for Processing III-Formed Input , 1983, Am. J. Comput. Linguistics.

[2]  Udo Hahn,et al.  Making understanders out of parsers: Semantically driven parsing as a key concept for realistic text understanding applications , 1989, Int. J. Intell. Syst..

[3]  Paul S. Jacobs,et al.  To Parse or Not to Parse: Relation-Driven Text Skimming , 1990, COLING.

[4]  Ralph Grishman,et al.  Analyzing language in restricted domains : sublanguage description and processing , 1986 .

[5]  Lisa F. Rau,et al.  Integrating Top-Down And Bottom-Up Strategies In A Text Processing System , 1988, ANLP.

[6]  Philip J. Hayes,et al.  Automatic Extraction of Facts from Press Releases to Generate News Stories , 1992, ANLP.

[7]  Christine A. Montgomery,et al.  Language Systems, Inc.: description of the DBG system as used for MUC-3 , 1991, MUC.

[8]  Ralph M. Weischedel,et al.  BBN: description of the PLUM system as used for MUC-3 , 1991, MUC.

[9]  Chris Mellish,et al.  Some Chart-Based Techniques for Parsing Ill-Formed Input , 1989, ACL.

[10]  Lisa F. Rau,et al.  Innovations in Text Interpretation , 1993, Artif. Intell..

[11]  David D. McDonald An Efficient Chart-based Algorithm for Partial-Parsing of Unrestricted Texts , 1992, ANLP.

[12]  Steven L. Lytinen,et al.  ULINK: A Semantics-Driven Approach to Understanding Ungrammatical Input , 1991, AAAI.

[13]  Fabio Ciravegna,et al.  Coping with Modifiers in a Restricted Domain , 1993, AI*IA.

[14]  Jaime G. Carbonell,et al.  Recovery Strategies for Parsing Extragrammatical Language , 1983, CL.

[15]  Herbert Gish,et al.  BBN: Description of the PLUM System as Used for MUC-5 , 2005, MUC.

[16]  M. F.,et al.  Bibliography , 1985, Experimental Gerontology.

[17]  Douglas E. Appelt,et al.  FASTUS: A Finite-state Processor for Information Extraction from Real-world Text , 1993, IJCAI.

[18]  Douglas E. Appelt,et al.  Robust Processing of Real-World Natural-Language Texts , 1992, ANLP.

[19]  Philip J. Hayes,et al.  Entity-Oriented Parsing , 1984, ACL.

[20]  Fabio Ciravegna,et al.  Syntax and semantics in a text interpretation system , 1991, RIAO.

[21]  Lisa F. Rau,et al.  Information extraction and text summarization using linguistic knowledge acquisition , 1989, Inf. Process. Manag..