Information Space Gets Normal

Experiments are presented based on unofficial results for TREC-7. Eigensystems analysis of a term cooccurrence matrix is compared to eigensystems analysis of a term correlation matrix. For each matrix type, the effect of term weighting and document length normalization is assessed. Recall-precision curves and other TREC statistics indicate that the use of the correlation matrix improves performance regardless of what term weighting or document length normalization is used.