论文信息 - A Data-Oriented Approach to Semantic Interpretation

A Data-Oriented Approach to Semantic Interpretation

In Data-Oriented Parsing (DOP), an annotated language corpus is used as a stochastic grammar. The most probable analysis of a new input sentence is constructed by combining sub-analyses from the corpus in the most probable way. This approach has been succesfully used for syntactic analysis, using corpora with syntactic annotations such as the Penn Treebank. If a corpus with semantically annotated sentences is used, the same approach can also generate the most probable semantic interpretation of an input sentence. The present paper explains this semantic interpretation method, and summarizes the results of a preliminary experiment. Semantic annotations were added to the syntactic annotations of most of the sentences of the ATIS corpus. A data-oriented semantic interpretation algorithm was succesfully tested on this semantically enriched corpus.

Rens Bod | Remko Scha | Remko Bonnema

[1] Ralph Grishman,et al. A Corpus-based Probabilistic Grammar with Only Two Non-terminals , 1995, IWPT.

[2] Rens Bod. Using an Annotated Corpus as a Stochastic Grammar , 1993, EACL.

[3] Maryellen C. MacDonald,et al. The lexical nature of syntactic ambiguity resolution , 1994 .

[4] Rens Bod,et al. Two Questions about Data-Oriented Parsing , 1996, VLC@COLING.

[5] Eugene Charniak,et al. Tree-Bank Grammars , 1996, AAAI/IAAI, Vol. 2.

[6] Rens Bod. Monte Carlo Parsing , 1993, IWPT.

[7] Khalil Sima’an,et al. An optimised algorithm for data oriented parsing , 1997 .

[8] Remko Scha,et al. A Corpus-based Approach to Semantic Interpretation , 1994 .

[9] P MarcusMitchell,et al. Building a large annotated corpus of English , 1993 .

[10] Beatrice Santorini,et al. Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[11] Fernando Pereira,et al. Inside-Outside Reestimation From Partially Bracketed Corpora , 1992, HLT.

[12] Remko Scha,et al. The Interpretation of Relational Nouns , 1988, ACL.