Which covariates should be controlled in propensity score matching? Evidence from a simulation study