An O(n^3) Agenda-Based Chart Parser for Arbitrary Probabilistic Context-Free Grammars

While $O(n^3)$ methods for parsing probabilistic context-free grammars (PCFGs) are well known, a tabular parsing framework for arbitrary PCFGs which naturally allows top-down, bottom-up, and other strategies, corresponding to active chart parsing for CFGs, has not yet been provided. This paper presents such an algorithm, and shows its correctness and advantages over prior work. The paper finishes by bringing out the connections between the algorithm and work on hypergraphs, which permits us to extend the presented Viterbi (best parse) algorithm to an inside (total probability) algorithm.

[1]  David H. D. Warren,et al.  Parsing as Deduction , 1983, ACL.

[2]  J. Baker Trainable grammars for speech recognition , 1979 .

[3]  Joshua Goodman,et al.  Parsing Inside-Out , 1998, ArXiv.

[4]  Martin Kay,et al.  Algorithm schemata and data structures in syntactic processing , 1986 .

[5]  Giorgio Gallo,et al.  Directed Hypergraphs and Applications , 1993, Discret. Appl. Math..

[6]  M.McGee Wood,et al.  Natural language processing in Prolog , 1990 .

[7]  Jean-Cédric Chappelier,et al.  A Generalized CYK Algorithm for Parsing Stochastic CFG , 1998, TAPD.

[8]  Walter L. Ruzzo,et al.  An Improved Context-Free Recognizer , 1980, ACM Trans. Program. Lang. Syst..

[9]  Tadao Kasami,et al.  An Efficient Recognition and Syntax-Analysis Algorithm for Context-Free Languages , 1965 .

[10]  Andreas Stolcke,et al.  An Efficient Probabilistic Context-Free Parsing Algorithm that Computes Prefix Probabilities , 1994, CL.

[11]  Jay Earley,et al.  An efficient context-free parsing algorithm , 1970, Commun. ACM.

[12]  Eugene Charniak,et al.  Figures of Merit for Best-First Probabilistic Chart Parsing , 1998, Comput. Linguistics.

[13]  Daniel H. Younger,et al.  Recognition and Parsing of Context-Free Languages in Time n^3 , 1967, Inf. Control..

[14]  Stuart M. Shieber,et al.  Prolog and Natural-Language Analysis , 1987 .

[15]  Michael Collins,et al.  Three Generative, Lexicalised Models for Statistical Parsing , 1997, ACL.

[16]  Stuart M. Shieber,et al.  Principles and Implementation of Deductive Parsing , 1994, J. Log. Program..

[17]  Edsger W. Dijkstra,et al.  A note on two problems in connexion with graphs , 1959, Numerische Mathematik.

[18]  Claude Berge,et al.  Graphs and Hypergraphs , 2021, Clustering.

[19]  Martin Kay,et al.  The MIND System , 1970 .

[20]  Frederick Jelinek,et al.  Basic Methods of Probabilistic Context Free Grammars , 1992 .