Syntactic natural language parsers have shown themselves to be inadequate for processing highly-ambiguous large-vocabulary text, as is evidenced by their poor performance on domains like the Wall Street Journal, and by the movement away from parsing-based approaches to text-processing in general. In this paper, I describe SPATTER, a statistical parser based on decision-tree learning techniques which constructs a complete parse for every sentence and achieves accuracy rates far better than any published result. This work is based on the following premises: (1) grammars are too complex and detailed to develop manually for most interesting domains; (2) parsing models must rely heavily on lexical and contextual information to analyze sentences accurately; and (3) existing n-gram modeling techniques are inadequate for parsing models. In experiments comparing SPATTER with IBM's computer manuals parser, SPATTER significantly outperforms the grammar-based parser. Evaluating SPATTER against the Penn Treebank Wall Street Journal corpus using the PARSEVAL measures, SPATTER achieves 86% precision, 86% recall, and 1.3 crossing brackets per sentence for sentences of 40 words or less, and 91% precision, 90% recall, and 0.5 crossing brackets for sentences between 10 and 20 words in length.
[1]
L. Baum,et al.
An inequality and associated maximization technique in statistical estimation of probabilistic functions of a Markov process
,
1972
.
[2]
Leo Breiman,et al.
Classification and Regression Trees
,
1984
.
[3]
Lalit R. Bahl,et al.
A tree-based statistical language model for natural language speech recognition
,
1989,
IEEE Trans. Acoust. Speech Signal Process..
[4]
Ralph Grishman,et al.
A Procedure for Quantitatively Comparing the Syntactic Coverage of English Grammars
,
1991,
HLT.
[5]
Robert L. Mercer,et al.
Class-Based n-gram Models of Natural Language
,
1992,
CL.
[6]
G. Leech,et al.
Statistically-driven computer grammars of English : the IBM/LANCASTER approach
,
1993
.
[7]
David M. Magerman.
Natural Language Parsing as Statistical Pattern Recognition
,
1994,
ArXiv.