Should Colleges Invest in Machine Learning? Comparing the Predictive Powers of Early Momentum Metrics and Machine Learning for Community College Credential Completion. CCRC Working Paper No. 118.

Among community college leaders and others interested in reforms to improve student success, there is growing interest in adopting machine learning (ML) techniques to predict credential completion. However, ML algorithms are often complex and are not readily accessible to practitioners for whom a simpler set of near-term measures may serve as sufficient predictors. This study compares the out-of-sample predictive power of early momentum metrics (EMMs)—13 near-term success measures suggested by the literature—with that of metrics from ML-based models that employ approximately 500 predictors for community college credential completion. Using transcript data from approximately 50,000 students at more than 30 community colleges in two states, I find that the EMMs that were modeled by logistic regression accurately predict completion for approximately 80% of students. This classification performance is comparable to that of the ML-based models. The EMMs even outperform the ML-based models in probability estimation. These findings suggest that EMMs are useful predictors for credential completion and that the marginal gain from using an ML-based model over EMMs is small for credential completion prediction when additional predictors do not have strong rationales to be included in an ML-based model, no matter how large the number of those predictors may be.

[1]  William R. Doyle Effect of increased academic momentum on transfer rates: An application of the generalized propensity score , 2011 .

[2]  Thomas R. Bailey,et al.  Early Momentum Metrics: Why They Matter for College Improvement , 2017 .

[3]  C. Adelman Answers In The Tool Box: Academic Intensity, Attendance Patterns, And Bachelor's Degree Attainment , 1999 .

[4]  Stephen L. DesJardins,et al.  The Use of Matching Methods in Higher Education Research: Answering Whether Attendance at a 2-Year Institution Results in Differences in Educational Attainment , 2009 .

[5]  Xueli Wang,et al.  Modeling Entrance into STEM Fields of Study Among Students Beginning at Community Colleges and Four-Year Institutions , 2013 .

[6]  Juan Carlos Calcagno,et al.  Stepping Stones to a Degree: The Impact of Enrollment Pathways and Milestones on Community College Student Outcomes , 2006 .

[7]  Amy E. Brown,et al.  Building Guided Pathways to Community College Student Success: Promising Practices and Early Evidence From Tennessee , 2018 .

[8]  Brian Rowan,et al.  The Diverted Dream: Community Colleges and the Promise of Educational Opportunity in America, 1900-1985. , 1990 .

[9]  Dheeraj Raju,et al.  Exploring Student Characteristics of Retention that Lead to Graduation in Higher Education Using Data Mining Models , 2015 .

[10]  A. H. Murphy,et al.  Reliability of Subjective Probability Forecasts of Precipitation and Temperature , 1977 .

[11]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[12]  Geoff Holmes,et al.  Probability Calibration Trees , 2017, ACML.

[13]  Serge Herzog,et al.  Measuring Determinants of Student Return VS. Dropout/Stopout VS. Transfer: A First-to-Second Year Analysis of New Freshmen , 2005 .

[14]  Sharad Goel,et al.  The Measure and Mismeasure of Fairness: A Critical Review of Fair Machine Learning , 2018, ArXiv.

[15]  Huan Liu,et al.  Feature Selection for Classification , 1997, Intell. Data Anal..

[16]  Rich Caruana,et al.  Obtaining Calibrated Probabilities from Boosting , 2005, UAI.

[17]  Rich Caruana,et al.  Predicting good probabilities with supervised learning , 2005, ICML.

[18]  David B. Monaghan,et al.  How Many Credits Should an Undergraduate Take? , 2016 .

[19]  Tim Menzies,et al.  Learning patterns of university student retention , 2011, Expert Syst. Appl..

[20]  Paul Attewell,et al.  What Is Academic Momentum? And Does It Matter? , 2012 .

[21]  Davis Jenkins,et al.  Using Longitudinal Data to Increase Community College Student Success: A Guide to Measuring Milestone and Momentum Point Attainment. CCRC Research Tools No. 2. , 2008 .

[22]  Xueli Wang Toward a Holistic Theoretical Model of Momentum for Community College Student Success , 2017 .

[23]  Dursun Delen,et al.  A comparative analysis of machine learning techniques for student retention management , 2010, Decis. Support Syst..

[24]  Barbara K. Townsend The Contradictory College: The Conflicting Origins, Impacts, and Futures of the Community College , 1996 .

[25]  Davis Jenkins,et al.  Early Momentum Metrics: Leading Indicators for Community College Improvement , 2019 .

[26]  Jevin D. West,et al.  Predicting Student Dropout in Higher Education , 2016, ArXiv.

[27]  C. Adelman Executive Summary: The Toolbox Revisited--Paths to Degree Completion from High School through College. , 2006 .

[28]  Davis Jenkins,et al.  Get With the Program ... and Finish It: Building Guided Pathways to Accelerate Student Completion , 2013 .

[29]  Stephen L. DesJardins,et al.  Artificial Neural Networks: A New Approach to Predicting Application Behavior , 2002 .

[30]  I. Maqsood,et al.  Random Forests and Decision Trees , 2012 .

[31]  M. Pepe The Statistical Evaluation of Medical Tests for Classification and Prediction , 2003 .

[32]  Klaus Nordhausen,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Second Edition by Trevor Hastie, Robert Tibshirani, Jerome Friedman , 2009 .

[33]  Xueli Wang Pathway to a Baccalaureate in STEM Fields , 2015 .

[34]  Dursun Delen,et al.  Predicting Student Attrition with Data Mining Methods , 2011 .

[35]  D. Fike,et al.  Predictors of First-Year Student Retention in the Community College , 2008 .

[36]  Davis Jenkins,et al.  Momentum: The Academic and Economic Value of a 15-Credit First-Semester Course Load for College Students in Tennessee , 2016 .