论文信息 - A Modular and Flexible Architecture for an Integrated Corpus Query System

A Modular and Flexible Architecture for an Integrated Corpus Query System

The paper describes the architecture of an integrated and extensible corpus query system developed at the University of Stuttgart and gives examples of some of the modules realized within this architecture. The modules form the core of a corpus workbench. Within the proposed architecture, information required for the evaluation of queries may be derived from different knowledge sources (the corpus text, databases, on-line thesauri) and by different means: either through direct lookup in a database or by calling external tools which may infer the necessary information at the time of query evaluation. The information available and the method of information access can be stated declaratively and individually for each corpus, leading to a flexible, extensible and modular corpus workbench.

Oliver Christ | O. Christ

[1] Beatrice Santorini,et al. Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[2] George A. Miller,et al. Introduction to WordNet: An On-line Lexical Database , 1990 .

[3] R. H. Baayen,et al. The CELEX Lexical Database (CD-ROM) , 1996 .