Mining Web-based Educational Systems to Predict Student Learning Achievements

Educational Data Mining (EDM) is getting great importance as a new interdisciplinary research field related to some other areas. It is directly connected with Web-based Educational Systems (WBES) and Data Mining (DM, a fundamental part of Knowledge Discovery in Databases). The former defines the context: WBES store and manage huge amounts of data. Such data are increasingly growing and they contain hidden knowledge that could be very useful to the users (both teachers and students). It is desirable to identify such knowledge in the form of models, patterns or any other representation schema that allows a better exploitation of the system. The latter reveals itself as the tool to achieve such discovering. Data mining must afford very complex and different situations to reach quality solutions. Therefore, data mining is a research field where many advances are being done to accommodate and solve emerging problems. For this purpose, many techniques are usually considered. In this paper we study how data mining can be used to induce student models from the data acquired by a specific Web-based tool for adaptive testing, called SIETTE. Concretely we have used top down induction decision trees algorithms to extract the patterns because these models, decision trees, are easily understandable. In addition, the conducted validation processes have assured high quality models.

[1]  Sebastián Ventura,et al.  Applying Web usage mining for personalizing hyperlinks in Web-based adaptive educational systems , 2009, Comput. Educ..

[2]  Vipin Kumar,et al.  Chapman & Hall/CRC Data Mining and Knowledge Discovery Series , 2008 .

[3]  Ricardo Conejo,et al.  SIETTE: A Web-Based Tool for Adaptive Testing , 2004, Int. J. Artif. Intell. Educ..

[4]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery in Databases , 1996, AI Mag..

[5]  Sebastián Ventura,et al.  Knowledge Discovery with Genetic Programming for Providing Feedback to Courseware Authors , 2004, User Modeling and User-Adapted Interaction.

[6]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[7]  Sebastián Ventura,et al.  Educational Data Mining: A Review of the State of the Art , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[8]  A. Alonso AEPIA- Spanish Association for Artificial Intelligence , 2015, SOCO 2015.

[9]  Paul Libbrecht,et al.  ActiveMath: A Generic and Adaptive Web-Based Learning Environment , 2001 .

[10]  Paul H. Lee,et al.  Resampling Methods Improve the Predictive Power of Modeling in Class-Imbalanced Datasets , 2014, International journal of environmental research and public health.

[11]  Eduardo Guzmán,et al.  A Data-Driven Technique for Misconception Elicitation , 2010, UMAP.

[12]  Jesús S. Aguilar-Ruiz,et al.  Knowledge discovery from data streams , 2009, Intell. Data Anal..

[13]  José del Campo-Ávila,et al.  Improving the performance of an incremental algorithm driven by error margins , 2008, Intell. Data Anal..

[14]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[15]  Peter Brusilovsky,et al.  ELM-ART: An Intelligent Tutoring System on World Wide Web , 1996, Intelligent Tutoring Systems.

[16]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[17]  Robert Wagenaar,et al.  Annex 3. ECTS Grading Table , 2009 .

[18]  José del Campo-Ávila,et al.  Online and Non-Parametric Drift Detection Methods Based on Hoeffding’s Bounds , 2015, IEEE Transactions on Knowledge and Data Engineering.

[19]  Ngoc Thanh Nguyen,et al.  A method for learning scenario determination and modification in intelligent tutoring systems , 2011, Int. J. Appl. Math. Comput. Sci..

[20]  João Gama,et al.  A survey on concept drift adaptation , 2014, ACM Comput. Surv..

[21]  S. Graf,et al.  Adaptive and Intelligent Web-Based Educational Systems , 2009 .