Learning grammatical stucture using statistical decision-trees

In this paper, I describe SPATTER, a statistical parser based on decision-tree learning techniques which avoids the difficulties of grammar development simply by having no grammar. Instead, the parser is driven by statistical pattern recognizers, in the form of decision trees, trained on correctly parsed sentences. This approach to grammatical inference results in a parser which constructs a complete parse for every sentence and achieves accuracy rates far better than any previously published result.

[1]  David M. Magerman Natural Language Parsing as Statistical Pattern Recognition , 1994, ArXiv.

[2]  John D. Lafferty,et al.  Development and Evaluation of a Broad-Coverage Probabilistic Grammar of English-Language Computer Manuals , 1992, ACL.

[3]  Mitchell P. Marcus,et al.  Parsing a Natural Language Using Mutual Information Statistics , 1990, AAAI.

[4]  Julian Kupiec A Trellis-Based Algorithm For Estimating The Parameters Of Hidden Stochastic Context-Free Grammar , 1991, HLT.

[5]  Ralph Grishman,et al.  A Procedure for Quantitatively Comparing the Syntactic Coverage of English Grammars , 1991, HLT.

[6]  John D. Lafferty,et al.  Decision Tree Parsing using a Hidden Derivation Model , 1994, HLT.

[7]  Pietro Laface,et al.  Speech Recognition and Understanding: Recent Advances, Trends, and Applications , 1997 .

[8]  Lalit R. Bahl,et al.  A tree-based statistical language model for natural language speech recognition , 1989, IEEE Trans. Acoust. Speech Signal Process..

[9]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[10]  Robert L. Mercer,et al.  Class-Based n-gram Models of Natural Language , 1992, CL.

[11]  Frederick Jelinek,et al.  Basic Methods of Probabilistic Context Free Grammars , 1992 .

[12]  L. Baum,et al.  An inequality and associated maximization technique in statistical estimation of probabilistic functions of a Markov process , 1972 .

[13]  Glenn Carroll,et al.  Learn-ing probaballstic dependency grammars from labelled text , 1992 .

[14]  Richard C. Waters,et al.  Stochastic Lexicalized Context-Free Grammar , 1993, IWPT.

[15]  James K. Baker,et al.  Stochastic modeling for automatic speech understanding , 1990 .

[16]  David M. Magerman Statistical Decision-Tree Models for Parsing , 1995, ACL.

[17]  Toby Walsh,et al.  Proceedings of AAAI-96 , 1996 .

[18]  Vaughan R. Pratt,et al.  A Linguistics Oriented Programming Language , 1973, IJCAI.