Towards Probabilistic Unification-Based Parsing

This thesis is about natural language parsing with corpus-based grammars that are enriched with statistics. Parsing is the process of analysing the syntactic structure of an utterance. The role of statistics is to improve the parsing process. Our research is done in the context of the SCHISMA project, in which a natural language dialogue system for theatre information and booking services is developed.

[1]  Eugene Charniak,et al.  Statistical language learning , 1997 .

[2]  Brian A. Davey,et al.  An Introduction to Lattices and Order , 1989 .

[3]  Adwait Ratnaparkhi,et al.  A Linear Observed Time Statistical Parser Based on Maximum Entropy Models , 1997, EMNLP.

[4]  Michael Dorna,et al.  CUF - A Formalism for Linguistic Knowledge Representation , 1993 .

[5]  H.W.L. ter Doest,et al.  Language Engineering in Dialogue Systems , 1996 .

[6]  op den Akker,et al.  Parsing Attribute Grammars , 1988 .

[7]  Ts Ed Voermans Inductive datatypes with laws and subtyping : a relational model , 1999 .

[8]  Martin Kay,et al.  Head-Driven Parsing , 1989, IWPT.

[9]  Andrew J. Viterbi,et al.  Error bounds for convolutional codes and an asymptotically optimum decoding algorithm , 1967, IEEE Trans. Inf. Theory.

[10]  Taylor L. Booth,et al.  Grammatical Inference: Introduction and Survey-Part I , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Eugene Charniak,et al.  Tree-Bank Grammars , 1996, AAAI/IAAI, Vol. 2.

[12]  J. A. Andernach,et al.  Domain and dialogue knowledge in a natural language information system , 1994 .

[13]  L. Baum,et al.  An inequality and associated maximization technique in statistical estimation of probabilistic functions of a Markov process , 1972 .

[14]  Joshua Goodman Efficient Algorithms for Parsing the DOP Model , 1996, EMNLP.

[15]  Gavin Burnage Celex-a guide for users , 1990 .

[16]  A. M. Geerling,et al.  Transformational development of data-parallel algorithms , 1996 .

[17]  Mark-Jan Nederhof,et al.  Linguistic parsing and program transformations , 1994 .

[18]  P.J.M. de Haan,et al.  Corpus-based research into language. In honour of Jan Aarts , 1994 .

[19]  John D. Lafferty,et al.  Inducing Features of Random Fields , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[20]  Margriet Verlinden Head-Corner Parsing of Unification Grammars: A Case Study , 1993, ACL 1993.

[21]  Anton Nijholt,et al.  A Transformational Approach to Natural Language Understanding in Dialogue Systems , 1998 .

[22]  N. Curteanu Book Reviews: Lecture on Contemporary Syntactic Theories: An Introduction to Unification-Based Approaches to Grammar , 1987, CL.

[23]  M.H.G. Kesseler,et al.  The implementation of functional languages on parallel machines with distributed memory , 1996 .

[24]  Arto Salomaa,et al.  Probabilistic and Weighted Grammars , 1969, Inf. Control..

[25]  Ilya M. Sobol,et al.  A Primer for the Monte Carlo Method , 1994 .

[26]  See-Kiong Ng,et al.  Probabilistic LR Parsing for General Context-Free Grammars , 1991, IWPT.

[27]  Wojciech Skut,et al.  A Maximum-Entropy Partial Parser for Unrestricted Text , 1998, VLC@COLING/ACL.

[28]  Gertjan van Noord An Efficient Implementation of the Head-Corner Parser , 1997, CL.

[29]  James G. Schmolze,et al.  The KL-ONE family , 1992 .

[30]  Stefan Riezler,et al.  Probabilistic constraint logic programming: formal foundations of quantitative and statistical inference in constrained based natural language processing , 2000, ArXiv.

[31]  F.A.M. van den Beuken,et al.  A functional approach to syntax and typing , 1997 .

[32]  Steven Holzner XML Complete , 1997 .

[33]  Robert T. Kasper,et al.  A Logical Semantics for Feature Structures , 1986, ACL.

[34]  Aravind K. Joshi,et al.  Tree-adjoining grammars and lexicalized grammars , 1992, Tree Automata and Languages.

[35]  Andreas Eisele,et al.  Feature Logic with Disjunctive Unification , 1990, COLING.

[36]  Mikio Nakano Constraint projection for efficient unification-based parsing , 1993 .

[37]  Takenobu Tokunaga,et al.  A New Formalization of Probabilistic GLR Parsing , 1997, IWPT.

[38]  Derick Wood,et al.  Standard Generalized Markup Language: Mathematical and Philosophical Issues , 1995, Computer Science Today.

[39]  John D. Lafferty,et al.  Towards History-based Grammars: Using Richer Models for Probabilistic Parsing , 1993, ACL.

[40]  Richard C. Waters,et al.  Stochastic Lexicalized Tree-Insertion Grammar , 1996 .

[41]  H. Alshawi,et al.  The Core Language Engine , 1994 .

[42]  Raymond Lau,et al.  Adaptive statistical language modeling , 1994 .

[43]  Stuart M. Shieber,et al.  Constraint-based grammar formalisms - parsing and type inference for natural and computer languages , 1992 .

[44]  Taylor L. Booth,et al.  Applying Probability Measures to Abstract Languages , 1973, IEEE Transactions on Computers.

[45]  Chris Brew,et al.  Stochastic HPSG , 1995, EACL.

[46]  Peter Achten,et al.  Interactive functional programs: models, methods, and implementation , 1996 .

[47]  J. Wessels,et al.  Faculty of Mathematics and Computing Science , 1988 .

[48]  Rens Bod Efficient Algorithms for Parsing the DOP Model? A Reply to Joshua Goodman , 1996, ArXiv.

[49]  Adam L. Berger,et al.  A Maximum Entropy Approach to Natural Language Processing , 1996, CL.

[50]  Gerhard Winkler,et al.  Image analysis, random fields and dynamic Monte Carlo methods: a mathematical introduction , 1995, Applications of mathematics.

[51]  Aa Twan Basten,et al.  In terms of nets : system design with Petri nets and process algebra , 1998 .

[52]  Charles F. Goldfarb,et al.  SGML handbook , 1990 .

[53]  E. Jaynes Information Theory and Statistical Mechanics , 1957 .

[54]  David M. Magerman Natural Language Parsing as Statistical Pattern Recognition , 1994, ArXiv.

[55]  Klaas Sikkel,et al.  Parsing Schemata , 1997, Texts in Theoretical Computer Science An EATCS Series.

[56]  Klaas Sikkel,et al.  Predictive Head-Corner Chart Parsing , 1993, IWPT.

[57]  Joris Hulstijn,et al.  Structured Information States: Raising and Resolving Issues , 1997 .

[58]  E. N. Wrigley,et al.  GLR Parsing With Probability , 1991 .

[59]  C. S. Wetherell,et al.  Probabilistic Languages: A Review and Some Open Questions , 1980, CSUR.

[60]  Lotfi A. Zadeh,et al.  Note on fuzzy languages , 1969, Inf. Sci..

[61]  J. O. Blanco,et al.  The state operator in process algebra , 1996 .

[62]  Timothy W. Finin,et al.  A Proposal for a new KQML Specification , 1997 .

[63]  Remko Scha,et al.  Data-oriented language processing , 1997 .

[64]  Joshua Goodman,et al.  Probabilistic Feature Grammars , 1997, IWPT.

[65]  Johannes Matiasek,et al.  Structure sharing unification of disjunctive feature descriptions , 1993 .

[66]  Adwait Ratnaparkhi,et al.  A Maximum Entropy Approach to Identifying Sentence Boundaries , 1997, ANLP.

[67]  Ted Briscoe,et al.  Probabilistic Normalisation and Unpacking of Packed Parse Forests for Unification-based Grammars , 1992 .

[68]  Dick Alstein,et al.  Distributed algorithms for hard real-time systems , 1996 .

[69]  Ted Briscoe,et al.  Robust stochastic parsing using the inside-outside algorithm , 1994, ArXiv.

[70]  Robert Gaizauskas,et al.  Investigations into the grammar underlying the Penn Treebank II , 1995 .

[71]  Bob Carpenter,et al.  The logic of typed feature structures , 1992 .

[72]  Anton Nijholt,et al.  Topics in SCHISMA dialogues , 1996 .

[73]  Ralph Grishman,et al.  Evaluating Parsing Strategies Using Standardized Parse Files , 1992, ANLP.

[74]  Mitchell P. Marcus,et al.  Maximum entropy models for natural language ambiguity resolution , 1998 .

[75]  Stuart M. Shieber,et al.  Principles and Implementation of Deductive Parsing , 1994, J. Log. Program..

[76]  David H. D. Warren,et al.  Definite Clause Grammars for Language Analysis - A Survey of the Formalism and a Comparison with Augmented Transition Networks , 1980, Artif. Intell..

[77]  Jeremy H. Clear,et al.  The British national corpus , 1993 .

[78]  Antinus Nijholt,et al.  Speech and language interaction in a (virtual) cultural theatre , 1998 .

[79]  Ronald Rosenfeld,et al.  Adaptive Language Modeling Using the Maximum Entropy Principle , 1993, HLT.

[80]  Dennis Dams,et al.  Abstract interpretation and partition refinement for model checking , 1996 .

[81]  Gustav Herdan,et al.  The advanced theory of language as choice and chance , 1968 .

[82]  Julian Kupiec An Algorithm for Estimating the Parameters of Unrestricted Hidden Stochastic Context-Free Grammars , 1992, COLING.

[83]  Adwait Ratnaparkhi,et al.  A Simple Introduction to Maximum Entropy Models for Natural Language Processing , 1997 .

[84]  Adwait Ratnaparkhi,et al.  A Maximum Entropy Model for Part-Of-Speech Tagging , 1996, EMNLP.

[85]  Richard C. Waters,et al.  Tree Insertion Grammar: A Cubic-Time, Parsable Formalism that Lexicalizes Context-Free Grammar without Changing the Trees Produced , 1995, CL.

[86]  Anton Nijholt Computers and languages - theory and practice , 1988, Studies in computer science and artificial intelligence.

[87]  Dimitri P. Bertsekas,et al.  Constrained Optimization and Lagrange Multiplier Methods , 1982 .

[88]  Rieks op den Akker,et al.  Developing natural language interfaces: a test case , 1994 .

[89]  Ted Briscoe,et al.  Generalized Probabilistic LR Parsing of Natural Language (Corpora) with Unification-Based Grammars , 1993, CL.

[90]  Steven P. Abney Stochastic Attribute-Value Grammars , 1996, CL.

[91]  Anton Nijholt,et al.  SCHISMA: A Natural Language Accessible Theatre Information and Booking System , 1995, NLDB.

[92]  Yves Schabes,et al.  Stochastic Lexicalized Tree-adjoining Grammars , 1992, COLING.

[93]  Stefan Riezler,et al.  Statistical Inference and Probabilistic Modelling for Constraint-Based NLP , 1999, ArXiv.

[94]  Stefan Riezler,et al.  Quantitative Constraint Logic Programming for Weighted Grammar Applications , 1996, LACL.

[95]  H.W.L. ter Doest A corpus-based probabilistic unification grammar. , 1998 .

[96]  Mark Moll,et al.  Head-Corner Parsing using Typed Feature Structures , 1999 .

[97]  Steve Young,et al.  Applications of stochastic context-free grammars using the Inside-Outside algorithm , 1990 .

[98]  Rens Bod,et al.  Beyond Grammar: An Experience-Based Theory of Language , 1998 .

[99]  G Goce Naumoski,et al.  A discrete-event simulator for systems engineering , 1998 .

[100]  Mark Moll,et al.  Parsing in Dialogue Systems Using Typed Feature Structures , 1995, IWPT.

[101]  Fernando Pereira,et al.  Inside-Outside Reestimation From Partially Bracketed Corpora , 1992, HLT.

[102]  Yuji Matsumoto,et al.  A fast method for statistical grammar induction , 1998, Nat. Lang. Eng..

[103]  P. Severi Normalisation in lambda calculus and its relation to type inference , 1996 .

[104]  Noam Chomsky,et al.  On Certain Formal Properties of Grammars , 1959, Inf. Control..

[105]  Philip Resnik,et al.  Probabilistic Tree-Adjoining Grammar as a Framework for Statistical Natural Language Processing , 1992, COLING.

[106]  D. van der Ende Robust Parsing: An Overview , 1995 .

[107]  John F. Sowa,et al.  Conceptual Structures: Information Processing in Mind and Machine , 1983 .

[108]  Peter Thanisch,et al.  Natural language interfaces to databases – an introduction , 1995, Natural Language Engineering.

[109]  Ann Bies,et al.  Bracketing Guidelines For Treebank II Style Penn Treebank Project , 1995 .

[110]  Ulf Grenander,et al.  Parameter Estimation for Constrained Context-Free Language Models , 1992, HLT.

[111]  Bob Carpenter,et al.  Probabilistic Parsing using Left Corner Language Models , 1997, IWPT.

[112]  Wolfgang Wechler,et al.  Universal Algebra for Computer Scientists , 1992, EATCS Monographs on Theoretical Computer Science.

[113]  Rieks op den Akker,et al.  Weakly Restricted Stochastic Grammars , 1994, COLING.

[114]  P. Suppes Probabilistic grammars for natural languages , 2004, Synthese.

[115]  Aravind K. Joshi,et al.  An Introduction to Tree Adjoining Grammar , 1987 .

[116]  Stefan Riezler,et al.  Statistical Inference for Probabilistic Constraint Logic Programming , 1998 .