Performance Standards and Evaluations in IR Test Collections: Cluster-Based Retrieval Models

Low performance standards for the group of queries in 13 retrieval test collections have been computed. Derived from the random graph hypothesis, these standards represent the highest levels of retrieval effectiveness that can be obtained from meaningless clustering structures. Operational levels of cluster-based performance reported in selected sources during the past 20 years have been compared to the standards. Comparisons show that typical levels of operational cluster-based retrieval can be explained on the basis of chance. Indeed, most operational results in retrieval test collections are lower than those predicted by random graph theory. A tentative explanation for the poor performance of cluster-based retrieval reveals weaknesses in both fundamental assumptions and operational implementations. The cluster hypothesis offers no guarantee that relevant documents are naturally grouped together, clustering algorithms may not reveal the inherent structure in a set of documents, and retrieval strategies do not reliably retrieve the most effective cluster or clusters of documents. That most cluster-based retrieval implementations implicitly rely on topical relatedness to be equivalent to a relevance relationship contributes to the poor performance. Clustering strategies capable of adapting to relevance information may succeed where static clustering techniques have failed.

[1]  Peter Willett,et al.  Recent trends in hierarchic document clustering: A critical review , 1988, Inf. Process. Manag..

[2]  D. S. Mitrinovic,et al.  Means and Their Inequalities , 1989, Int. J. Math. Math. Sci..

[3]  Robert Burgin The retrieval effectiveness of five clustering algorithms as a function of indexing exhaustivity , 1995 .

[4]  W. Bruce Croft,et al.  Document clustering: An evaluation of some experiments with the cranfield 1400 collection , 1975, Inf. Process. Manag..

[5]  W. Bruce Croft A model of cluster searching bases on classification , 1980, Inf. Syst..

[6]  Anil K. Jain,et al.  Validity studies in clustering methodologies , 1979, Pattern Recognit..

[7]  K. Sparck Jones,et al.  INFORMATION RETRIEVAL TEST COLLECTIONS , 1976 .

[8]  Gerard Salton,et al.  A Comparison Between Manual and Automatic Indexing Methods , 1968 .

[9]  William M. Shaw,et al.  An investigation of document structures , 1990, Inf. Process. Manag..

[10]  Edward A. Fox,et al.  Characterization of Two New Experimental Collections in Computer and Information Science Containing Textual and Bibliographic Concepts , 1983 .

[11]  William M. Shaw,et al.  Termrelevance Computations and Perfect Retrieval Performance , 1995, Inf. Process. Manag..

[12]  William M. Shaw,et al.  Retrieval Expectations, Cluster-Based Effectiveness, and Performance Standards in the CF Database , 1994, Inf. Process. Manag..

[13]  L. R. Dice Measures of the Amount of Ecologic Association Between Species , 1945 .

[14]  Frank Harary,et al.  Graph Theory , 2016 .

[15]  Peter Willett,et al.  On the Non-Random Nature of the Nearest-Neighbour Document Clusters , 1993, Inf. Process. Manag..

[16]  Peter Willett,et al.  Hierarchic Agglomerative Clustering Methods for Automatic Document Classification , 1984, J. Documentation.

[17]  Robert Burgin,et al.  Performance Standards and Evaluations in IR Test Collections: Vector-Space and Other Retrieval Models , 1997, Inf. Process. Manag..

[18]  Jr. W. M. Shaw Subject and citation indexing. Part II: The optimal, cluster‐based retrieval performance of composite representations , 1991 .

[19]  William M. Shaw,et al.  Subject indexing and citation indexing--part I: Clustering structure in the cystic fibrosis document collection , 1990, Inf. Process. Manag..

[20]  W. Robertson,et al.  A neural algorithm for document clustering , 1991, Inf. Process. Manag..

[21]  Gerard Salton,et al.  A new comparison between conventional indexing (MEDLARS) and automatic text processing (SMART) , 1972, J. Am. Soc. Inf. Sci..

[22]  J. Kenney,et al.  Mathematics of statistics , 1940 .

[23]  Stephen E. Robertson,et al.  Relevance weighting of search terms , 1976, J. Am. Soc. Inf. Sci..

[24]  R. F. Ling The Expected Number of Components in Random Linear Graphs , 1973 .

[25]  K. Sparck Jones,et al.  A TEST FOR THE SEPARATION OF RELEVANT AND NON‐RELEVANT DOCUMENTS IN EXPERIMENTAL RETRIEVAL COLLECTIONS , 1973 .

[26]  R. F. Ling An exact probability distribution on the connectivity of random graphs , 1975 .

[27]  Michael D. Gordon User‐based document clustering by redescribing subject descriptions with a genetic algorithm , 1991 .

[28]  C. J. van Rijsbergen Further experiments with hierarchic clustering in document retrieval , 1974, Inf. Storage Retr..

[29]  W. John Wilbur,et al.  The Effectiveness of Document Neighboring in Search Enhancement , 1994, Inf. Process. Manag..

[30]  Ellen M. Vdorhees The cluster hypothesis revisited , 1985, SIGIR 1985.

[31]  Carol L. Barry User-defined relevance criteria: an exploratory study , 1994 .

[32]  W. M. Shaw On the Foundation of Evaluation. , 1986 .

[33]  Helen R. Tibbo,et al.  The Cystic Fibrosis Database: Content and Research Opportunities. , 1991 .

[34]  Peter Willett,et al.  Comparison of Hierarchie Agglomerative Clustering Methods for Document Retrieval , 1989, Comput. J..

[35]  E. Michael Keen,et al.  Presenting Results of Experimental Retrieval Comparisons , 1997, Inf. Process. Manag..

[36]  W. M. Shaw Subject and Citation Indexing. Part I: The Clustering Structure of Composite Representations in the Cystic Fibrosis Document Collection. Part II: The Optimal, Cluster-Based Retrieval Performance of Composite Representations. , 1991 .

[37]  Anil K. Jain,et al.  Clustering Methodologies in Exploratory Data Analysis , 1980, Adv. Comput..

[38]  R. F. Ling,et al.  Probability Tables for Cluster Analysis Based on a Theory of Random Graphs , 1976 .

[39]  William M. Shaw,et al.  Subject indexing and citation indexing-- part II: An evaluation and comparison , 1990, Inf. Process. Manag..

[40]  Peter Willett,et al.  Using interdocument similarity information in document retrieval systems , 1997 .

[41]  Peter Willett,et al.  Techniques for the measurement of clustering tendency in document retrieval systems , 1987, J. Inf. Sci..

[42]  Kui-Lam Kwok,et al.  Experiments with a component theory of probabilistic information retrieval based on single terms as document components , 1990, TOIS.

[43]  William M. Shaw,et al.  Controlled and Uncontrolled Subject Descriptions in the CF Database: A Comparison of Optimal Cluster-Based Retrieval Results , 1993, Inf. Process. Manag..