Predicting Students’ GPA and Developing Intervention Strategies Based on Self-Regulatory Learning Behaviors

Predicting students’ grades has emerged as a major area of investigation in education due to the desire to identify the underlying factors that influence academic performance. Because of limited success in predicting the grade point average (GPA), most of the prior research has focused on predicting grades in a specific set of classes based on students’ prior performances. The issues associated with data-driven models of GPA prediction are further amplified by a small sample size and a relatively large dimensionality of observations in an experiment. In this paper, we utilize the state-of-the-art machine learning techniques to construct and validate a predictive model of GPA solely based on a set of self-regulatory learning behaviors determined in a relatively small-sample experiment. We quantify the predictability of each constituents of the constructed model and discuss its relevance. Ultimately, the goal of grade prediction in similar experiments is to use the constructed models for the design of intervention strategies aimed at helping students at risk of academic failure. In this regard, we lay the mathematical groundwork for defining and detecting probably helpful interventions using a probabilistic predictive model of GPA. We demonstrate the application of this framework by defining basic interventions and detecting those interventions that are probably helpful to students with a low GPA. The use of self-regulatory behaviors is warranted, because the proposed interventions can be easily practiced by students.

[1]  Sanja Fidler,et al.  Human-Machine CRFs for Identifying Bottlenecks in Scene Understanding , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  David E. Pritchard,et al.  Data from a Web-based Homework Tutor can predict Student’s Final Exam Score , 2005 .

[3]  B. Zimmerman Self-Regulated Learning and Academic Achievement: An Overview , 1990 .

[4]  Vincent Y. F. Tan,et al.  Learning Graphical Models for Hypothesis Testing and Classification , 2010, IEEE Transactions on Signal Processing.

[5]  J. Vitale,et al.  Explaining Student Performance in an Undergraduate Agricultural Economics Classroom , 2010 .

[6]  Robert E. Slavin,et al.  Effective Programs in Middle and High School Mathematics: A Best-Evidence Synthesis , 2008 .

[7]  Wael R. Elwasif,et al.  Predicting performance from test scores using backpropagation and counterpropagation , 1994, Proceedings of 1994 IEEE International Conference on Neural Networks (ICNN'94).

[8]  Sebastián Ventura,et al.  Educational Data Mining: A Review of the State of the Art , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[9]  Richard E. Zartman,et al.  Variables that Influence Student Performance in an Introductory Soils Class. , 2006 .

[10]  Amin Zollanvari,et al.  The Illusion of Distribution-Free Small-Sample Classification in Genomics , 2011, Current genomics.

[11]  Michael G. Madden,et al.  On the classification performance of TAN and general Bayesian networks , 2008, Knowl. Based Syst..

[12]  G. McLachlan The bias of the apparent error rate in discriminant analysis , 1976 .

[13]  Alfred Joseph Lizzio,et al.  University Students' Perceptions of the Learning Environment and Academic Outcomes: Implications for theory and practice , 2002 .

[14]  Timothy Wang,et al.  Using neural networks to predict student's performance , 2002, International Conference on Computers in Education, 2002. Proceedings..

[15]  Tom Gedeon,et al.  Explaining student grades predicted by a neural network , 1993, Proceedings of 1993 International Conference on Neural Networks (IJCNN-93-Nagoya, Japan).

[16]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[17]  Robert E. Slavin,et al.  Effective Programs in Elementary Mathematics: A Best-Evidence Synthesis , 2007 .

[18]  Nir Friedman,et al.  Bayesian Network Classifiers , 1997, Machine Learning.

[19]  Analía Amandi,et al.  Evaluating Bayesian networks' precision for detecting students' learning styles , 2007, Comput. Educ..

[20]  Daniel Martinez,et al.  Predicting Student Outcomes Using Discriminant Function Analysis. , 2001 .

[21]  Ronald H. Stevens,et al.  Developing a framework for integrating prior problem solving and knowledge sharing histories of a group to predict future group performance , 2005, 2005 International Conference on Collaborative Computing: Networking, Applications and Worksharing.

[22]  Amy L. Dent,et al.  The Relation Between Self-Regulated Learning and Academic Achievement Across Childhood and Adolescence: A Meta-Analysis , 2016 .

[23]  Roy D. Goldman,et al.  Why College Grade Point Average is Difficult to Predict. , 1976 .

[24]  Erkki Sutinen,et al.  Using data mining for improving web-based course design , 2002, International Conference on Computers in Education, 2002. Proceedings..

[25]  Chien-Chung Chan,et al.  A Framework for Assessing Usage of Web-Based e-Learning Systems , 2007, Second International Conference on Innovative Computing, Informatio and Control (ICICIC 2007).

[26]  A. Furnham,et al.  Personality, intelligence and approaches to learning as predictors of academic performance , 2008 .

[27]  Gabriele Oettingen,et al.  Planning promotes goal striving. , 2011 .

[28]  C. Abraham,et al.  Psychological correlates of university students' academic performance: a systematic review and meta-analysis. , 2012, Psychological bulletin.

[29]  Sebastian Tschiatschek,et al.  On Bayesian Network Classifiers with Reduced Precision Parameters , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[30]  Ulisses Braga-Neto,et al.  Bolstered error estimation , 2004, Pattern Recognit..

[31]  Eva Lucrecia Gibaja Galindo,et al.  Predicting students' marks from Moodle logs using neural network models , 2006 .

[32]  Rina Dechter,et al.  Bucket elimination: A unifying framework for probabilistic inference , 1996, UAI.

[33]  Deva Ramanan,et al.  Face detection, pose estimation, and landmark localization in the wild , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[34]  Zachary A. Pardos,et al.  The Effect of Model Granularity on Student Performance Prediction Using Bayesian Networks , 2007, User Modeling.

[35]  Stuart A. Karabenick,et al.  Relationship of Academic Help Seeking to the Use of Learning Strategies and Other Instrumental Achievement Behavior in College Students. , 1991 .

[36]  B. Zimmerman Becoming a Self-Regulated Learner: An Overview , 2002 .

[37]  E. Gehan,et al.  The properties of high-dimensional data spaces: implications for exploring gene and protein expression data , 2008, Nature Reviews Cancer.

[38]  T. Kadir,et al.  Bayesian Networks for Clinical Decision Support in Lung Cancer Care , 2013, PloS one.

[39]  Edward R. Dougherty,et al.  Coefficient of determination in nonlinear signal processing , 2000, Signal Process..

[40]  C. N. Liu,et al.  Approximating discrete probability distributions with dependence trees , 1968, IEEE Trans. Inf. Theory.

[41]  Robert E. Slavin,et al.  The Relationship Between Sample Sizes and Effect Sizes in Systematic Reviews in Education , 2009 .

[42]  Geoffrey J McLachlan,et al.  Selection bias in gene extraction on the basis of microarray gene-expression data , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[43]  Theofanis Sapatinas,et al.  Discriminant Analysis and Statistical Pattern Recognition , 2005 .

[44]  B. Zimmerman,et al.  Development of a Structured Interview for Assessing Student Use of Self-Regulated Learning Strategies , 1986 .

[45]  B. Junker,et al.  Do Skills Combine Additively to Predict Task Di culty in Eighth-grade Mathematics ? , 2006 .

[46]  C. Midden,et al.  To plan or not to plan? Goal achievement or interrupting the performance of mundane behaviors , 1999 .

[47]  Lynn Hasher,et al.  Time of day, Intellectual Performance, and Behavioral Problems in Morning Versus Evening type Adolescents: Is there a Synchrony Effect? , 2007, Personality and individual differences.

[48]  Bart Baesens,et al.  Toward Comprehensible Software Fault Prediction Models Using Bayesian Network Classifiers , 2013, IEEE Transactions on Software Engineering.

[49]  A. Furnham,et al.  Personality, cognitive ability, and beliefs about intelligence as predictors of academic performance , 2003 .

[50]  Chih-Ming Chen,et al.  Mining Key Formative Assessment Rules based on Learner Profiles for Web-based Learning Systems , 2007, Seventh IEEE International Conference on Advanced Learning Technologies (ICALT 2007).

[51]  Susan D. Voyer,et al.  Gender differences in scholastic achievement: a meta-analysis. , 2014, Psychological bulletin.

[52]  V. O. Oladokun,et al.  Predicting Students' Academic Performance using Artificial Neural Network: A Case Study of an Engineering Course. , 2008 .

[53]  P. Golding,et al.  Predicting Academic Performance , 2006, Proceedings. Frontiers in Education. 36th Annual Conference.

[54]  Edward R. Dougherty,et al.  Is cross-validation valid for small-sample microarray classification? , 2004, Bioinform..

[55]  Anil K. Jain,et al.  On the optimal number of features in the classification of multivariate Gaussian data , 1978, Pattern Recognit..

[56]  Brian W. Junker,et al.  Predicting end-of-year accountability assessment scores from monthly student records in an online tutoring system , 2006 .

[57]  Sotiris B. Kotsiantis,et al.  Predicting students marks in Hellenic Open University , 2005, Fifth IEEE International Conference on Advanced Learning Technologies (ICALT'05).

[58]  Amin Zollanvari,et al.  High-Dimensional Statistical Learning: Roots, Justifications, and Potential Machineries , 2015, Cancer informatics.

[59]  P. Haddawy,et al.  A decision support system for evaluating international student applications , 2007, 2007 37th Annual Frontiers In Education Conference - Global Engineering: Knowledge Without Borders, Opportunities Without Passports.

[60]  Carolina Ruiz,et al.  The Composition Effect: Conjuntive or Compensatory? An Analysis of Multi-Skill Math Questions in ITS , 2008, EDM.