Learning analytics: Collaborative filtering or regression with experts?

An intelligent learning analytics based on an enormous amount of education data is a key enabler of the next generation of education; among many tasks of the intelligent learning analytics, personalized prediction of test responses based on the record of each individual learner is of the utmost importance. In recent years, a variety of machine learning algorithms for predicting test outcomes have been proposed, and two of the most prominent approaches are collaborative filtering and logistic regression. Collaborative filtering is fully data-driven since it does not require any extra information other than test outcomes while logistic regression is applicable only when questions can be independently analyzed by experts. In this work, we first propose a new model for test responses, and propose a collaborative filtering algorithm with enhanced human-interpretability based on the new model. Then, we evaluate the prediction performance of these approaches using a large education data set, collected via mobile applications for English Language Learning. Our experimental results show that the fully data-driven collaborative filtering approach can predict test outcomes better than the logistic regression approach.

[1]  Emmanuel J. Candès,et al.  A Singular Value Thresholding Algorithm for Matrix Completion , 2008, SIAM J. Optim..

[2]  Emmanuel J. Candès,et al.  Exact Matrix Completion via Convex Optimization , 2009, Found. Comput. Math..

[3]  Christopher Ré,et al.  Parallel stochastic gradient algorithms for large-scale matrix completion , 2013, Mathematical Programming Computation.

[4]  Richard G. Baraniuk,et al.  SPRITE: A Response Model For Multiple Choice Testing , 2015 .

[5]  Yehuda Koren,et al.  Lessons from the Netflix prize challenge , 2007, SKDD.

[6]  J. Daniel,et al.  Making Sense of MOOCs : Musings in a Maze of Myth , Paradox and Possibility Author : , 2013 .

[7]  Greg Linden,et al.  Amazon . com Recommendations Item-to-Item Collaborative Filtering , 2001 .

[8]  Richard G. Baraniuk,et al.  Tag-Aware Ordinal Sparse Factor Analysis for Learning and Content Analytics , 2014, EDM.

[9]  Taghi M. Khoshgoftaar,et al.  A Survey of Collaborative Filtering Techniques , 2009, Adv. Artif. Intell..

[10]  Emmanuel J. Candès,et al.  Matrix Completion With Noise , 2009, Proceedings of the IEEE.

[11]  Richard G. Baraniuk,et al.  Time-varying learning and content analytics via sparse factor analysis , 2013, KDD.

[12]  Alexander Shapiro,et al.  Stochastic Approximation approach to Stochastic Programming , 2013 .

[13]  Pablo A. Parrilo,et al.  Guaranteed Minimum-Rank Solutions of Linear Matrix Equations via Nuclear Norm Minimization , 2007, SIAM Rev..

[14]  Austin Gibbons,et al.  Learning Analytics , 2014, Encyclopedia of Social Network Analysis and Mining.

[15]  Mark D. Reckase,et al.  MAXLOG: A computer program for the estimation of the parameters of a multidimensional logistic model , 1983 .

[16]  Yoav Bergner,et al.  Model-Based Collaborative Filtering Analysis of Student Response Data: Machine-Learning Item Response Theory , 2012, EDM.

[17]  Emmanuel J. Candès,et al.  The Power of Convex Relaxation: Near-Optimal Matrix Completion , 2009, IEEE Transactions on Information Theory.

[18]  Prateek Jain,et al.  Low-rank matrix completion using alternating minimization , 2012, STOC '13.

[19]  Ioannis Konstas,et al.  On social networks and collaborative recommendation , 2009, SIGIR.