Yet Another Chart-Based Technique for Parsing Ill-Formed Input

A new chart-based technique for parsing ill-formed input is proposed. This can process sentences with unknown/misspelled words, omitted words or extraneous words. This generalized parsing strategy is, similar to Mellish's, based on an active chart parser, and shares the many advantages of Mellish's technique. It is based on pure syntactic knowledge, it is independent of all grammars, and it does not slow down the original parsing operation if there is no ill-formedness. However, unlike Mellish's technique, it doesn't employ any complicated heuristic parameters. There are two key points. First, instead of using a unified or interleaved process for finding errors and correcting them, we separate the initial error detection stage from the other stages and adopt a version of bi-directional parsing. This effectively prunes the search space. Second, it employs normal top-down parsing, in which each parsing state reflects the global context, instead of topdown chart parsing. This enables the technique to determine the global plausibility of candidates easily, based on an admissible A search. The proposed strategy could enumerate all possible minimal-penalty solutions in just 4 times the time taken to parse the correct sentences.

[1]  Martin Kay,et al.  Algorithm schemata and data structures in syntactic processing , 1986 .

[2]  Chris Mellish,et al.  Natural Language Processing in Pop-11: An Introduction to Computational Linguistics , 1989 .

[3]  Chris Mellish,et al.  Some Chart-Based Techniques for Parsing Ill-Formed Input , 1989, ACL.

[4]  Nils J. Nilsson,et al.  Artificial Intelligence , 1974, IFIP Congress.

[5]  Donald Hindle,et al.  Deterministic Parsing of Syntactic Non-fluencies , 1983, ACL.

[6]  M.McGee Wood,et al.  Natural language processing in LISP , 1990 .

[7]  John Bear,et al.  Integrating Multiple Knowledge Sources for Detection and Correction of Repairs in Human-Computer Dialog , 1992, ACL.

[8]  Norman K. Sondheimer,et al.  Meta-Rules as a Basis for Processing III-Formed Input , 1983, Am. J. Comput. Linguistics.

[9]  Jaime G. Carbonell,et al.  Recovery Strategies for Parsing Extragrammatical Language , 1983, CL.

[10]  Giorgio Satta,et al.  Formal Properties and Implementation of Bidirectional Charts , 1989, IJCAI.

[11]  Bernard Lang,et al.  Parsing Incomplete Sentences , 1988, COLING.

[12]  David M. Magerman,et al.  Efficiency, Robustness and Accuracy in Picky Chart Parsing , 1992, ACL.

[13]  Masaru Tomita,et al.  Parsing noisy sentences , 1988, COLING.

[14]  Patrick Henry Winston,et al.  Artificial intelligence (2nd ed.) , 1984 .

[15]  N. Curteanu Book Reviews: Lecture on Contemporary Syntactic Theories: An Introduction to Unification-Based Approaches to Grammar , 1987, CL.

[16]  David H. D. Warren,et al.  Definite Clause Grammars for Language Analysis - A Survey of the Formalism and a Comparison with Augmented Transition Networks , 1980, Artif. Intell..