A two-phase machine learning approach for predicting student outcomes