Efficient Knowledge Distillation from Model Checkpoints