How and how well do workplace assessments work? Using contextual variations in a theory-based evaluation with a large N