Prediction in the Era of Massive Datasets