Calibrating Ensembles for Scalable Uncertainty Quantification in Deep Learning-based Medical Segmentation