Preserving Ambiguities in Generation via Automata Intersection

We discuss the problem of generating text that preserves certain ambiguities, a capability that is useful in applications such as machine translation. We show that it is relatively simple to extend a hybrid symbolic/statistical generator to do ambiguity preservation. The paper gives algorithms and examples, and it discusses practical linguistic difficulties that arise in ambiguity preservation.

[1]  Ronald M. Kaplan,et al.  Ambiguity-preserving Generation with LFG- and PATR-style Grammars , 1996, Comput. Linguistics.

[2]  Martin Kay,et al.  Chart Generation , 1996, ACL.

[3]  Jochen Dörre Efficient construction of underspecified semantics under massive ambiguity , 1997 .

[4]  Frederick Jelinek,et al.  Exploiting Syntactic Structure for Language Modeling , 1998, ACL.

[5]  Kevin Knight,et al.  Generation that Exploits Corpus-Based Statistical Knowledge , 1998, ACL.

[6]  Gertjan van Noord The Intersection of Finite State Automata and Definite Clause Grammars , 1995, ACL.

[7]  Christos H. Papadimitriou,et al.  Elements of the Theory of Computation , 1997, SIGA.

[8]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[9]  M. Kay,et al.  Ambiguity management in natural language generation , 1997 .

[10]  Kevin Knight,et al.  The Practical Value of N-Grams Is in Generation , 1998, INLG.

[11]  Uwe Reyle,et al.  Co-Indexing Labelled DRSs to Represent and Reason with Ambiguities , 1995, ArXiv.

[12]  Michael Collins,et al.  Three Generative, Lexicalised Models for Statistical Parsing , 1997, ACL.

[13]  James F. Allen Natural language understanding , 1987, Bejnamin/Cummings series in computer science.

[14]  Uwe Petermann ON THE PRACTICAL VALUE , 2001 .

[15]  Martin C. Emele,et al.  Ambiguity Preserving Machine Translation using Packed Representations , 1998, COLING-ACL.

[16]  Jochen Dörre Efficient Construction of Underspecified Semantics under Massive Ambiguity , 1997, ACL.

[17]  Irene Langkilde Forest-Based Statistical Sentence Generation , 2000, ANLP.

[18]  Matthew Haines,et al.  Filling Knowledge Gaps in a Broad-Coverage Machine Translation System , 1995, IJCAI.

[19]  Vasileios Hatzivassiloglou,et al.  Two-Level, Many-Paths Generation , 1995, ACL.