Swift: Expedited Failure Recovery for Large-Scale DNN Training