The State of Educational Data Mining in 2009: A Review and Future Visions.

We review the history and current trends in the field of Educational Data Mining (EDM). We consider the methodological profile of research in the early years of EDM, compared to in 2008 and 2009, and discuss trends and shifts in the research conducted by this community. In particular, we discuss the increased emphasis on prediction, the emergence of work using existing models to make scientific discoveries (“discovery with models”), and the reduction in the frequency of relationship mining within the EDM community. We discuss two ways that researchers have attempted to categorize the diversity of research in educational data mining research, and review the types of research problems that these methods have been used to address. The mostcited papers in EDM between 1995 and 2005 are listed, and their influence on the EDM community (and beyond the EDM community) is discussed.

[1]  Ryan Shaun Joazeiro de Baker,et al.  Modeling and understanding students' off-task behavior in intelligent tutoring systems , 2007, CHI.

[2]  Gordon I. McCalla,et al.  Utilizing Artificial Learners to Help Overcome the Cold-Start Problem in a Pedagogically-Oriented Paper Recommendation System , 2004, AH.

[3]  Kenneth R. Koedinger,et al.  An Open Repository and analysis tools for fine-grained, longitudinal learner data , 2008, EDM.

[4]  Kenneth R. Koedinger,et al.  Using Item-type Performance Covariance to Improve the Skill Model of an Existing Tutor , 2008, EDM.

[5]  Osmar R. Zaïane,et al.  Building a Recommender Agent for e-Learning Systems , 2002, ICCE.

[6]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques with Java implementations , 2002, SGMD.

[7]  Richard C. Anderson,et al.  FEEDBACK PROCEDURES IN COMPUTER-ASSISTED ARITHMETIC INSTRUCTION , 1973 .

[8]  Tiffany Barnes,et al.  The Q-matrix Method: Mining Student Response Data for Knowledge , 2005 .

[9]  Nadine Meskens,et al.  Determination of factors influencing the achievement of the first-year university students using data mining methods , 2006 .

[10]  Osmar R. Zaïane,et al.  Web Usage Mining for a Better Web-Based Learning Environment , 2001 .

[11]  Ryan Shaun Joazeiro de Baker,et al.  Detecting Student Misuse of Intelligent Tutoring Systems , 2004, Intelligent Tutoring Systems.

[12]  Agathe Merceron,et al.  A Web-Based Tutoring Tool with Mining Facilities to Improve Learning and Teaching , 2003 .

[13]  César Hervás-Martínez,et al.  Data Mining Algorithms to Classify Students , 2008, EDM.

[14]  J. Schofield Computers and classroom culture , 1995 .

[15]  Michel C. Desmarais,et al.  A Bayesian Student Model without Hidden Nodes and its Comparison with Item Response Theory , 2005, Int. J. Artif. Intell. Educ..

[16]  Sebastián Ventura,et al.  Educational data mining: A survey from 1995 to 2005 , 2007, Expert Syst. Appl..

[17]  Joseph E. Beck,et al.  High-Level Student Modeling with Machine Learning , 2000, Intelligent Tutoring Systems.

[18]  Sebastián Ventura,et al.  Discovering Prediction Rules in AHA! Courses , 2003, User Modeling.

[19]  P. Tannenbaum,et al.  Theories of cognitive consistency: a sourcebook. , 1968 .

[20]  Jack Mostow,et al.  How Who Should Practice: Using Learning Decomposition to Evaluate the Efficacy of Different Types of Practice for Different Types of Students , 2008, Intelligent Tutoring Systems.

[21]  Gordon I. McCalla,et al.  Smart Recommendation for an Evolving E-Learning System: Architecture and Experiment , 2005 .

[22]  Arnon Hershkovitz,et al.  The Impact of Off-task and Gaming Behaviors on Learning: Immediate or Aggregate? , 2009, AIED.

[23]  Mykola Pechenizkiy,et al.  Predicting Students Drop Out: A Case Study , 2009, EDM.

[24]  S. Tanimoto Improving the Prospects for Educational Data Mining , 2007 .

[25]  Gautam Biswas,et al.  Mining Student Behavior Models in Learning-by-Teaching Environments , 2008, EDM.

[26]  Osmar R. Za ¨ õane Building a Recommender Agent for e-Learning Systems , 2002 .

[27]  James C. Lester,et al.  Modeling self-efficacy in intelligent tutoring systems: An inductive approach , 2008, User Modeling and User-Adapted Interaction.

[28]  J. Beck Difficulties in inferring student knowledge from observations ( and why you should care ) , 2007 .

[29]  Judy Kay,et al.  The Big Five and Visualisations of Team Work Activity , 2006, Intelligent Tutoring Systems.

[30]  Kalina Yacef,et al.  Educational Data Mining: a Case Study , 2005, AIED.

[31]  Neil T. Heffernan,et al.  Does Self-Discipline impact students' knowledge and learning? , 2009, EDM.

[32]  Steven L. Tanimoto,et al.  Student Consistency and Implications for Feedback in Online Assessment Systems , 2009, EDM.

[33]  Arthur C. Graesser,et al.  Automatic detection of learner’s affect from conversational cues , 2008, User Modeling and User-Adapted Interaction.

[34]  Manolis Mavrikis,et al.  Data-driven modelling of students' interactions in an ILE , 2008, EDM.

[35]  Albert T. Corbett,et al.  Cognitive Computer Tutors: Solving the Two-Sigma Problem , 2001, User Modeling.

[36]  Toon Calders,et al.  Mining the Student Assessment Data: Lessons Drawn from a Small Scale Case Study , 2008, EDM.

[37]  K. Koedinger,et al.  Investigations into Help Seeking and Learning with a Cognitive Tutor , 2001 .

[38]  Carolyn Penstein Rosé,et al.  Supporting CSCL with automatic corpus analysis technology , 2005, CSCL.

[39]  Kenneth R. Koedinger,et al.  Learning Factors Transfer Analysis: Using Learning Curve Analysis to Automatically Generate Domain Models , 2009, EDM.

[40]  Jun Hu,et al.  Scientometric analysis of the CHI proceedings , 2009, CHI.

[41]  Judy Kay,et al.  Clustering and Sequential Pattern Mining of Online Collaborative Learning Data , 2009, IEEE Transactions on Knowledge and Data Engineering.