Experiments in Automatic Library of Congress Classification

This article presents the results of research into the automatic selection of Library of Congress Classification numbers based on the titles and subject headings in MARC records. The method used in this study was based on partial match retrieval techniques using various elements of new records (i.e., those to be classified) as “queries,” and a test database of classification clusters generated from previously classified MARC records. Sixty individual methods for automatic classification were tested on a set of 283 new records, using all combinations of four different partial match methods, five query types, and three representations of search terms. The results indicate that if the best method for a particular case can be determined, then up to 88% of the new records may be correctly classified. The single method with the best accuracy was able to select the correct classification for about 46% of the new records.

[1]  N. J. Williamson,et al.  The Library of Congress Classification: problems and prospects in online retrieval , 1986 .

[2]  Elaine Broadbent The online catalog: dictionary, classified, or both? , 1989 .

[3]  Karen Markey,et al.  Dewey decimal classification online project : evaluation of a library schedule and index integrated into the subject searching capabilities of an onlin catalog , 1986 .

[4]  Karen Markey,et al.  Searching and Browsing the Dewey Decimal Classification in an Online Catalog , 1987 .

[5]  Diane Vizine-Goetz,et al.  Characteristics of subject heading records in the machine-readable Library of Congress subject headings , 1989 .

[6]  W. Bruce Croft,et al.  Retrieving documents by plausible inference: An experimental study , 1989, Inf. Process. Manag..

[7]  Chris Buckley,et al.  Implementation of the SMART Information Retrieval System , 1985 .

[8]  Martha L. Manheimer,et al.  Classified Library of Congress subject headings , 1972 .

[9]  Karen Markey,et al.  Preparing for the use of classification in online cataloging systems and in online catalogs , 1985 .

[10]  Gerard Salton,et al.  Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer , 1989 .

[11]  L M Chan,et al.  Library of Congress classification as an online retrieval tool: potentials and limitations , 1986 .

[12]  Gerard Salton,et al.  Improving retrieval performance by relevance feedback , 1997, J. Am. Soc. Inf. Sci..

[13]  Barbara C. Dean Reclassification in an Automated Environment , 1985 .

[14]  Elaine Svenonius,et al.  Use of Classification in Online Retrieval , 1983 .

[15]  M. E. Maron,et al.  Automatic Indexing: An Experimental Inquiry , 1961, JACM.

[16]  Ray R. Larson Evaluation of advanced retrieval techniques in an experimental online catalog , 1992 .

[17]  K. Markey,et al.  Findings of the Dewey Decimal Classification on-line project , 1986 .

[18]  W. G. Hoyle Automatic indexing and generation of classification systems by algorithm , 1973, Inf. Storage Retr..

[19]  Elaine Svenonius,et al.  Automatic derivation of name access points in cataloging , 1990, J. Am. Soc. Inf. Sci..

[20]  Diane Vizine-Goetz,et al.  Class dispersion between the Library of Congress Classification and the Dewey Decimal Classification , 1987, J. Am. Soc. Inf. Sci..

[21]  Jeffrey Huestis,et al.  Clustering LC classification numbers in an online catalog for improved browsability , 1988 .

[22]  Alexander Dekhtyar,et al.  Information Retrieval , 2018, Lecture Notes in Computer Science.

[23]  Lois Mai Chan,et al.  Inter-Indexer Consistency in Subject Cataloging. , 1989 .

[24]  N. J. Williamson,et al.  Classification in online systems: research and the North American perspective , 1985 .

[25]  Micheline Hancock,et al.  Subject Searching behaviour at the Library Catalogue and at the shelves: Implications for Online Interactive Catalogues , 1987, J. Documentation.

[26]  W. Bruce Croft A model of cluster searching bases on classification , 1980, Inf. Syst..

[27]  Harold Borko,et al.  Automatic Document Classification , 1963, JACM.

[28]  Karen Markey,et al.  Subject-Searching Experiences and Needs of Online Catalog Users: Implications for Library Classification. , 1985 .

[29]  Peter G. B. Enser,et al.  Automatic Classification of Book material Represented by Back‐of‐the‐Book Index , 1985, J. Documentation.

[30]  Nicholas J. Belkin,et al.  Retrieval techniques , 1987 .

[31]  Nancy J. Williamson,et al.  The Role of Classification in Online Systems , 1989 .

[32]  Harold Borko,et al.  Measuring the reliability of subject classification by men and machines , 1964 .

[33]  Lee J. White,et al.  A distance measure for automatic document classification by sequential analysis , 1978, Inf. Process. Manag..

[34]  Carol A. Mandel,et al.  Classification Schedules as Subject Enhancement in Online Catalogs. A Review of a Conference Sponsored by Forest Press, the OCLC Online Computer Library Center, and the Council on Library Resources. , 1986 .

[35]  Ray R. Larson,et al.  Classification Clustering, Probabilistic Information Retrieval, and the Online Catalog , 1991, The Library Quarterly.