Surrogate-guided sampling designs for classification of rare outcomes from electronic medical records data