Tilburg University Reliability of test scores in nonparametric item response theory