A System for Automating Concordance Line Selection.

This paper argues that as the number of concordance lines presented to researchers increases in line with ever-growing corpus size, the automatic selection of the most central lines will become more important. A method will be presented for selecting the most representative members from a set of concordance lines on the basis of repeated lexical features. This follows from previous work on lexical cohesion that was utilized in a system for automatic abridgement generation. Such a system would grant full accessibility to corpus material while avoiding the presentation of so much data to researchers that they become overloaded. (Contains 7 references.) (Author/CK) *********************************************************************** Reproductions supplied by EDRS are the best that can be made from the original document. ***********************************************************************