Evaluation is necessary to produce stereotype threat performance effects