Classifiers for educational data mining

The idea of classification is to place an object into one class or category, based on its other characteristics. In education, teachers and instructors are all the time classifying their students for their knowledge, motivation, and behaviour. Assessing exam answers is also a classification task, where a mark is determined according to certain evaluation criteria. Automatic classification is an inevitable part of intelligent tutoring systems and adaptive learning environments. Before the system can select any adaptation action like selecting tasks, learning material, or advice, it should first classify the learner’s current situation. For this purpose, we need a classifier – a model, which predicts the class value from other explanatory attributes. For example, one can derive the student’s motivation level from her/his actions in the tutoring system or predict the students who are likely to fail or drop out from their task scores. Such predictions are equally useful in the traditional teaching, but computerized learning systems often serve larger classes and collect more data for deriving classifiers. Classifiers can be designed manually, based on expert’s knowledge, but nowadays it is more common to learn them from real data. The basic idea is the following: First, we have to choose the classification method, like decision

[1]  W. F. Punch,et al.  Predicting student performance: an application of data mining methods with an educational Web-based system , 2003, 33rd Annual Frontiers in Education, 2003. FIE 2003..

[2]  Peter J. Huber,et al.  Robust Statistics , 2005, Wiley Series in Probability and Statistics.

[3]  Thanh Ha Dang,et al.  Fuzzy Decision Tree for User Modeling from Human-Computer Interactions , 2005 .

[4]  Mykola Pechenizkiy,et al.  Predicting Students Drop Out: A Case Study , 2009, EDM.

[5]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[6]  Heikki Mannila,et al.  Principles of Data Mining , 2001, Undergraduate Topics in Computer Science.

[7]  Michel C. Desmarais,et al.  A Bayesian Student Model without Hidden Nodes and its Comparison with Item Response Theory , 2005, Int. J. Artif. Intell. Educ..

[8]  Wilhelmiina Hämäläinen,et al.  Comparison of Machine Learning Methods for Intelligent Tutoring Systems , 2006, Intelligent Tutoring Systems.

[9]  Hasmik Mehranian,et al.  Evaluating the Feasibility of Learning Student Models from Data , 2005 .

[10]  Jirí Vomlel,et al.  Bayesian Networks In Educational Testing , 2002, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[11]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[12]  Hecht-Nielsen Theory of the backpropagation neural network , 1989 .

[13]  Sotiris B. Kotsiantis,et al.  Preventing Student Dropout in Distance Learning Using Machine Learning Techniques , 2003, KES.

[14]  Stephan Weibelzahl,et al.  Eliciting Adaptation Knowledge from On-Line Tutors to Increase Motivation , 2007, User Modeling.

[15]  Ronald L. Rivest,et al.  Constructing Optimal Binary Decision Trees is NP-Complete , 1976, Inf. Process. Lett..

[16]  Ronald L. Rivest,et al.  Training a 3-node neural network is NP-complete , 1988, COLT '88.

[17]  César Hervás-Martínez,et al.  Data Mining Algorithms to Classify Students , 2008, EDM.

[18]  Philip S. Yu,et al.  Targeting the right students using data mining , 2000, KDD '00.

[19]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[20]  Fuzong Lin,et al.  Investigation of Web-based teaching and learning by boosting algorithms , 2003, International Conference on Information Technology: Research and Education, 2003. Proceedings. ITRE2003..

[21]  Pedro M. Domingos,et al.  On the Optimality of the Simple Bayesian Classifier under Zero-One Loss , 1997, Machine Learning.

[22]  Marc Eisenstadt,et al.  Tools for creating intelligent computer tutors , 1984 .

[23]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[24]  Vasile Paul Bresfelean,et al.  Determining students’ academic failure profile founded on data mining methods , 2008, ITI 2008 - 30th International Conference on Information Technology Interfaces.

[25]  Martin Muehlenbrock Automatic Action Analysis in an Interactive Learning Environment , 2005 .

[26]  Nir Friedman,et al.  Bayesian Network Classifiers , 1997, Machine Learning.

[27]  Robert P. W. Duin,et al.  Learned from Neural Networks , 2000 .

[28]  T.R. Rhoads,et al.  Learning from student data , 2004, Proceedings of the 2004 IEEE Systems and Information Engineering Design Symposium, 2004..

[29]  E. Sutinen,et al.  Data Mining In Personalizing DistanceEducation Courses , 2006 .

[30]  Christian Jutten,et al.  Blind separation of sources, part I: An adaptive algorithm based on neuromimetic architecture , 1991, Signal Process..

[31]  David G. Stork,et al.  Pattern Classification , 1973 .

[32]  Serge Herzog,et al.  Estimating Student Retention and Degree-Completion Time: Decision Trees and Neural Networks Vis-a-Vis Regression. , 2006 .

[33]  Ryan Shaun Joazeiro de Baker,et al.  Detecting Student Misuse of Intelligent Tutoring Systems , 2004, Intelligent Tutoring Systems.

[34]  Anil K. Jain,et al.  Statistical Pattern Recognition: A Review , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[35]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[36]  Giorgio Valentini,et al.  Ensembles of Learning Machines , 2002, WIRN.

[37]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[38]  Nadine Meskens,et al.  Determination of factors influencing the achievement of the first-year university students using data mining methods , 2006 .

[39]  I. Jolliffe Principal Component Analysis , 2002 .

[40]  Mihaela Cocea,et al.  Can Log Files Analysis Estimate Learners' Level of Motivation? , 2006, LWA.

[41]  Nguyen Thai Nghe,et al.  A comparative analysis of techniques for predicting academic performance , 2007, 2007 37th Annual Frontiers In Education Conference - Global Engineering: Knowledge Without Borders, Opportunities Without Passports.