Replicability and Reproducibility of Automatic Routing Runs

This paper reports our participation in CENTRE@CLEF19. We focus on reimplementing submissions by Grossman and Cormack to the TREC 2017 Common Core Track. Our contributions are twofold. Reimplementations are used to study the replicability as well as the reproducibility of WCRobust04 and WCRobust0405. Our results show that the replicability and reproducibility of transferring relevance judgments across different corpora are limited. It is not possible to replicate or reproduce the baseline. However, improvements in evaluation measures by enriching training data are achievable. Further experiments examine general relevance transfer and the augmentation of tfidf-features.