Distributional phrase structure induction

Unsupervised grammar induction systems commonly judge potential constituents on the basis of their effects on the likelihood of the data. Linguistic justifications of constituency, on the other hand, rely on notions such as substitutability and varying external contexts. We describe two systems for distributional grammar induction which operate on such principles, using part-of-speech tags as the contextual features. The advantages and disadvantages of these systems are examined, including precision/recall trade-offs, error analysis, and extensibility.

[1]  Steve Young,et al.  Applications of stochastic context-free grammars using the Inside-Outside algorithm , 1990 .

[2]  Noam Chomsky,et al.  The Sound Pattern of English , 1968 .

[3]  Vladimir Solmon,et al.  The estimation of stochastic context-free grammars using the Inside-Outside algorithm , 2003 .

[4]  P. Resnik Selection and information: a class-based approach to lexical relationships , 1993 .

[5]  Glenn Carroll,et al.  Two Experiments on Learning Probabilistic Dependency Grammars from Corpora , 1992 .

[6]  Eugene Charniak,et al.  A Maximum-Entropy-Inspired Parser , 2000, ANLP.

[7]  Zellig S. Harris,et al.  Methods in structural linguistics. , 1952 .

[8]  Michael Collins,et al.  Three Generative, Lexicalised Models for Statistical Parsing , 1997, ACL.

[9]  Michael Halliday,et al.  An Introduction to Functional Grammar , 1985 .

[10]  Z. Harris,et al.  Methods in structural linguistics. , 1952 .

[11]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[12]  Steven Abney,et al.  The English Noun Phrase in its Sentential Aspect , 1972 .

[13]  Eugene Charniak,et al.  Statistical language learning , 1997 .

[14]  Radford,et al.  转换生成语法教程 = Transformational Grammar , 2000 .

[15]  Eric Brill,et al.  Automatic Grammar Induction and Parsing Free Text: A Transformation-Based Approach , 1993, ACL.

[16]  Steven P. Abney Stochastic Attribute-Value Grammars , 1996, CL.

[17]  Andreas Stolcke,et al.  Inducing Probabilistic Grammars by Bayesian Model Merging , 1994, ICGI.