Statistical Grammar Models and Lexicon Acquisition 12.1 Introduction

This paper presents a framework for developing and training statistical grammar models for the acquisition of lexicon information. Util-ising a robust parsing environment and mathematically well-deened unsupervised training methods, the framework enables us to induce lexicon information from text corpora. Particular strengths of the approach concern (i) the fact that no extensive manual work is required to set up the framework, and (ii) that the framework is applicable to any desired language. It has already been applied to English and Manual work within the framework is reduced to a minimum, since the necessary grammars need not go into detailed structures for the relevant grammar aspects to be trained suuciently. The automatic training process utilises a shallow parser embedded in the mathematically well-deened Expectation-Maximisation algorithm. The training approach enforces the lexicalised parameters in the statistical grammar to obtain linguistic reliability. A basic assumption thereby expects that the linguistically correct analyses of text correspond to those analyses which 59 Linguistic Form and its Computation.

[1]  Frank Keller,et al.  Verb Frame Frequency as a Predictor of Verb Bias , 2001, Journal of psycholinguistic research.

[2]  Mark Johnson,et al.  Lexicalized Stochastic Modeling of Constraint-Based Grammars using Log-Linear Measures and EM Training , 2000, ACL.

[3]  Sabine Schulte im Walde Clustering Verbs Semantically According to their Alternation Behaviour , 2000, COLING.

[4]  Mats Rooth,et al.  Using a Probabilistic Class-Based Lexicon for Lexical Ambiguity Resolution , 2000, COLING.

[5]  Sabine Schulte im Walde,et al.  Robust German Noun Chunking With a Probabilistic Context-Free Grammar , 2000, COLING.

[6]  Mark Johnson,et al.  Exploiting auxiliary distributions in stochastic unification-based grammars , 2000, ANLP.

[7]  Mats Rooth,et al.  Inducing a Semantically Annotated Lexicon via EM-Based Clustering , 1999, ACL.

[8]  Mats Rooth,et al.  Inside-Outside Estimation of a Lexicalized PCFG for German , 1999, ACL.

[9]  Mats Rooth,et al.  Valence Induction with a Head-Lexicalized PCFG , 1998, EMNLP.

[10]  Fernando Pereira,et al.  Aggregate and mixed-order Markov models for statistical language processing , 1997, EMNLP.

[11]  G. McLachlan,et al.  The EM algorithm and extensions , 1996 .

[12]  Glenn Carroll,et al.  Learning probabilistic grammars for language modeling , 1996 .

[13]  Eugene Charniak,et al.  Tree-Bank Grammars , 1996, AAAI/IAAI, Vol. 2.

[14]  Beth Levin,et al.  English Verb Classes and Alternations: A Preliminary Investigation , 1993 .

[15]  Naftali Tishby,et al.  Distributional Clustering of English Words , 1993, ACL.

[16]  Robert L. Mercer,et al.  Class-Based n-gram Models of Natural Language , 1992, CL.

[17]  Mats Rooth,et al.  Structural Ambiguity and Lexical Relations , 1991, ACL.

[18]  J. Baker Trainable grammars for speech recognition , 1979 .

[19]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[20]  L. Baum,et al.  A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains , 1970 .

[21]  Vladimir Solmon,et al.  The estimation of stochastic context-free grammars using the Inside-Outside algorithm , 2003 .

[22]  Helmut Schmid,et al.  LoPar: Design and Implementation , 2000 .

[23]  Helmut Schmid,et al.  YAP: parsing and disambiguation with feature based grammars , 1999 .

[24]  Michael I. Jordan,et al.  Unsupervised Learning from Dyadic Data , 1998 .

[25]  Murat Kural,et al.  Verb incorporation and elementary predicates , 1996 .

[26]  Mats Rooth,et al.  Two-dimensional clusters in grammatical relations , 1995 .

[27]  Hermann Ney,et al.  On structuring probabilistic dependences in stochastic language modelling , 1994, Comput. Speech Lang..

[28]  Barbara B. Levin,et al.  English verb classes and alternations , 1993 .

[29]  Ken Hale,et al.  On Argument Structure and the Lexical Expression of Syntactic Relations , 1993 .

[30]  H. Schumacher Verben in Feldern : Valenzwörterbuch zur Syntax und Semantik deutscher Verben , 1986 .

[31]  L. Baum,et al.  An inequality and associated maximization technique in statistical estimation of probabilistic functions of a Markov process , 1972 .