On the generalizability of school-level performance assessment scores