Assuring the reliability of resident performance appraisals: more items or more observations?