Covering the vocabulary of technical abstracts using standard and specialized dictionaries

Natural language applications such as information retrieval systems will increasingly rely on standard dictionaries (in machine-readable form) as a source of lexical information. It is therefore important to determine how well the dictionar ies cover the text that such systems are likely to encounter. Data gathered from an examination of computer science and library and information science abstracts show that the techni cal terms from the domains are not well covered by standard dictionaries Coverage improves with the use of specialized computer science and library and information science dictio naries, although there is great variation in their performance However, these dictionaries were not designed for automatic text processing, and so this paper concludes with a discussion of the difficulties of incorporating such dictionaries into natu ral language processing systems.