Rethinking Confidence Calibration for Failure Prediction