论文信息 - Towards a Workbench for Acquisition of Domain Knowledge from Natural Language

Towards a Workbench for Acquisition of Domain Knowledge from Natural Language

In this paper we describe an architecture and functionality of main components of a workbench for an acquisition of domain knowledge from large text corpora. The workbench supports an incremental process of corpus analysis starting from a rough automatic extraction and organization of lexico-semantic regularities and ending with a computer supported analysis of extracted data and a semiautomatic refinement of obtained hypotheses. For doing this the workbench employs methods from computational linguistics, information retrieval and knowledge engineering. Although the work-bench is currently under implementation some of its components are already implemented and their performance is illustrated with samples from engineering for a medical domain.

Andrei Mikheev | Steven Finch | Andrei Mikheev | S. Finch

[1] Chris Buckley,et al. OHSUMED: an interactive retrieval evaluation and new large test collection for research , 1994, SIGIR '94.

[2] Ralph Grishman,et al. Analyzing language in restricted domains : sublanguage description and processing , 1986 .

[3] Steven Finch,et al. Finding structure in language , 1995 .

[4] Charles F. Goldfarb,et al. SGML handbook , 1990 .

[5] H. Ross. Principles of Numerical Taxonomy , 1964 .