A closer look at the training dynamics of knowledge distillation