Comparison of the validity of bookmark and Angoff standard setting methods in medical performance tests