Information retrieval using hierarchical dirichlet processes
暂无分享,去创建一个
An information retrieval method is proposed using a hierarchical Dirichlet process as a prior on the parameters of a set of multinomial distributions. The resulting method naturally includes a number of features found in other popular methods. Specifically, tf.idf-like term weighting and document length normalisation are recovered. The new method is compared with Okapi BM-25 [3] and the Twenty-One model [1] on TREC data and is shown to give better performance.
[1] Stephen E. Robertson,et al. Okapi at TREC-3 , 1994, TREC.
[2] Michael I. Jordan,et al. Hierarchical Dirichlet Processes , 2006 .
[3] Djoerd Hiemstra,et al. Twenty-One at TREC7: Ad-hoc and Cross-Language Track , 1998, TREC.