Semantic Clustering of Index Terms

A computer procedure to reorganize indexing vocabularies is described. Index terms are drawn from the vocabulary of a structured indexing system and may consist of single words, collections of words, or syntactic phrases. The basic idea is that a measure of the semantic association between index terms can be determined from the structural relationships which the terms exhibit by their relative positions in the system. The association measure, which is based on a priori (preassigned) semantic relationships between terms, rather than their co-occurrence in a document corpus, is then used for grouping index terms into clusters or concepts. Some results of an experimental investigation are presented. K E Y WORDS AND PHRASES: information, retrieval, clustering,

[1]  M. E. Maron,et al.  On Relevance, Probabilistic Indexing and Information Retrieval , 1960, JACM.

[2]  H. Edmund Stiles,et al.  The Association Factor in Information Retrieval , 1961, JACM.

[3]  Journal of the Association for Computing Machinery , 1961, Nature.

[4]  C. Cleverdon Report on the testing and analysis of an investigation into comparative efficiency of indexing systems , 1962 .

[5]  P. E. Jones,et al.  LINEAR ASSOCIATIVE INFORMATION RETRIEVAL , 1962 .

[6]  Cyril W. Cleverdon,et al.  Aslib Cranfield research project: report on the testing and analysis of an investigation into the comparative efficiency of indexing systems , 1962 .

[7]  Council , 1954, The Aeronautical Journal (1968).

[8]  Claude Berge,et al.  The theory of graphs and its applications , 1962 .

[9]  Gerard Salton,et al.  Associative Document Retrieval Techniques Using Bibliographic Information , 1963, JACM.

[10]  F. Harary,et al.  The theory of graphs and its applications , 1963 .

[11]  Raymond E. Bonner,et al.  On Some Clustering Techniques , 1964, IBM J. Res. Dev..

[12]  Karen Spärck Jones Experiments in semantic classification , 1965, Mech. Transl. Comput. Linguistics.

[13]  Roger M. Needham,et al.  Applications of the theory of clumps , 1965, Mech. Transl. Comput. Linguistics.

[14]  Graph separability and word grouping , 1966, CACM.

[15]  A. R. Meetham Graph separability and word grouping , 1966, ACM '66.

[16]  Peter A. W. Lewis,et al.  Statistical Discrimination of the Synonymy/Antonymy Relationship Between Words , 1967, JACM.