Learning Stochastic Lexicalized Tree Grammars from Hpsg

We 1 present a method for automatically extracting a Stochastic Lexicalized Tree Grammar (SLTG) from an HPSG source grammar and a given corpus. Processing of a SLTG is performed by a specialized fast parser. The approach has been tested on a large English grammar and has been shown to achieve a speed-up by a factor of better than 10 compared to parsing with a highly tuned HPSG parser. Our approach is simple and transparent, and comes with no magic tuning strategies. The extracted grammars are declaratively represented and have a high degree of practical applicability.

[1]  Aravind K. Joshi,et al.  Mathematical and computational aspects of lexicalized grammars , 1990 .

[2]  Aravind K. Joshi,et al.  Parsing with Lexicalized Tree Adjoining Grammar , 1991 .

[3]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[4]  Ted Briscoe,et al.  Generalized Probabilistic LR Parsing of Natural Language (Corpora) with Unification-Based Grammars , 1993, CL.

[5]  Srinivas Bangalore,et al.  Some Novel Applications of Explanation-Based Learning to Parsing Lexicalized Tree-Adjoining Grammars , 1995, ACL.

[6]  Richard C. Waters,et al.  Tree Insertion Grammar: A Cubic-Time, Parsable Formalism that Lexicalizes Context-Free Grammar without Changing the Trees Produced , 1995, CL.

[7]  Manny Rayner,et al.  Fast Parsing Using Pruning and Grammar Specialization , 1996, ACL.

[8]  David J. Weir,et al.  Encoding Frequency Information in Lexicalized Grammars , 1997, IWPT.

[9]  Srinivas Bangalore,et al.  Complexity of lexical descriptions and its relevance to partial parsing , 1997 .

[10]  Dan Flickinger,et al.  Minimal Recursion Semantics: An Introduction , 2005 .

[11]  Günter Neumann,et al.  Applying Explanation-based Learning to Control and Speeding-up Natural Language Generation , 1997, ACL.

[12]  Jun'ichi Tsujii,et al.  LiLFes - Towards a Practical HPSG Parser , 1998, COLING-ACL.

[13]  Günter Neumann,et al.  Interleaving Natural Language Parsing and Generation Through Uniform Processing , 1998, Artif. Intell..

[14]  Günter Neumann Automatic extraction of stochastic lexicalized tree grammars from treebanks , 1998, TAG+.

[15]  Stephan Oepen,et al.  Towards systematic grammar profiling.Test suite technology 10 years after , 1998, Comput. Speech Lang..