Confidence intervals for test scores and significance tests for test score differences: a comparison of methods.