Abstract Educational data mining (EDM) involves the application of data mining, machine learning, and statistics to information generated from an educational setting. In most school education, one teacher teaches many students. A periodic examination is used as a method to confirm that students have acquired skills. However, it is difficult to grasp the status of the student from each lesson, since examinations cannot be carried out easily. On the other hand, in programming classes, the students’ history of UNIX commands and source-code editing can be easily and automatically stored as log-data. Therefore, attempts have been made to estimate the student’s performance from this log-data, although their estimation accuracy is not high. In this research, we aim to extract those students who cannot keep up with programming lessons, rather than estimating the student’s performance from the log-data. Specifically, we propose a method for predicting dropouts using outlier detection to cluster data with unsupervised learning.
[1]
Toon Calders,et al.
Introduction to the special section on educational data mining
,
2012,
SKDD.
[2]
Mykola Pechenizkiy,et al.
Predicting Students Drop Out: A Case Study
,
2009,
EDM.
[3]
Sergei Vassilvitskii,et al.
k-means++: the advantages of careful seeding
,
2007,
SODA '07.
[4]
Kusumoto Shinji,et al.
Programming Process Visualization for Supporting Students in Programming Exercise
,
2012
.
[5]
Eamonn Keogh.
Exact Indexing of Dynamic Time Warping
,
2002,
VLDB.
[6]
Hua Li,et al.
Dropout prediction in MOOCs using behavior features and multi-view semi-supervised learning
,
2016,
2016 International Joint Conference on Neural Networks (IJCNN).
[7]
Sherif A. Halawa,et al.
Dropout Prediction in MOOCs using Learner Activity Features
,
2014
.
[8]
E. Emanuel.
Online education: MOOCs taken by educated few
,
2013,
Nature.