Diagnosing University Student Subject Proficiency and Predicting Degree Completion in Vector Space

Author(s): Luo, Y; Pardos, ZA | Editor(s): McIlraith, Sheila A; Weinberger, Kilian Q | Abstract: Copyright © 2018, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved. We investigate the issues of undergraduate on-time graduation with respect to subject proficiencies through the lens of representation learning, training a student vector embeddings from a dataset of 8 years of course enrollments. We compare the per-semester student representations of a cohort of undergraduate Integrative Biology majors to those of graduated students in subject areas involved in their degree requirements. The result is an embedding rich in information about the relationships between majors and pathways taken by students which encoded enough information to improve prediction accuracy of on-time graduation to 95%, up from a baseline of 87.3%. Challenges to preparation of the data for student vectorization and sourcing of validation sets for optimization are discussed.

[1]  Nemanja Djuric,et al.  E-commerce in Your Inbox: Product Recommendations at Scale , 2015, KDD.

[2]  Zachary A. Pardos,et al.  The School of Information and its relationship to computer science at UC Berkeley , 2017 .

[3]  William J. Clancey,et al.  Classification Problem Solving , 1984, AAAI.

[4]  Serge Herzog,et al.  Estimating Student Retention and Degree-Completion Time: Decision Trees and Neural Networks Vis-a-Vis Regression. , 2006 .

[5]  H. Kuhn The Hungarian method for the assignment problem , 1955 .

[6]  Eitel J. M. Lauría,et al.  Early Alert of Academically At-Risk Students: An Open Source Analytics Initiative , 2014, J. Learn. Anal..

[7]  Lubos Popelínský,et al.  Predicting drop-out from social behaviour of students , 2012, EDM.

[8]  J. J. Lin,et al.  Student Retention Modelling : An Evaluation of Different Methods and their Impact on Prediction Results , 2009 .

[9]  Mykola Pechenizkiy,et al.  Predicting Students Drop Out: A Case Study , 2009, EDM.

[10]  John P. Bean Dropouts and turnover: The synthesis and test of a causal model of student attrition , 1980 .

[11]  Steven Skiena,et al.  DeepWalk: online learning of social representations , 2014, KDD.

[12]  Jacob Whitehill,et al.  Delving Deeper into MOOC Student Dropout Prediction , 2017, ArXiv.

[13]  Zachary A. Pardos,et al.  Predictive Modelling of Student Behavior Using Granular Large-Scale Action Data , 2017 .

[14]  Leonidas J. Guibas,et al.  Deep Knowledge Tracing , 2015, NIPS.

[15]  Carolyn Penstein Rosé,et al.  “ Turn on , Tune in , Drop out ” : Anticipating student dropouts in Massive Open Online Courses , 2013 .

[16]  Jevin D. West,et al.  Predicting Student Dropout in Higher Education , 2016, ArXiv.

[17]  Zachary A. Pardos,et al.  Imputing KCs with Representations of Problem Content and Context , 2017, UMAP.

[18]  J. J. Lin,et al.  Artificial Intelligence Methods To Forecast Engineering Students' Retention Based On Cognitive And Non Cognitive Factors , 2008 .

[19]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[20]  Doug Shapiro,et al.  Time to Degree: A National View of the Time Enrolled and Elapsed for Associate and Bachelor's Degree Earners. (Signature Report No. 11). , 2016 .

[21]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[22]  Vincent Tinto Dropout from Higher Education: A Theoretical Synthesis of Recent Research , 1975 .