Designs for evaluating the validity and accuracy of performance ratings