Traditional difference-score analyses of reasoning are flawed