Meta-Reasoning Algorithm for Improving Analysis of Student Interactions with Learning Objects using Supervised Learning

Supervised learning (SL) systems have been used to automatically learn models for analysis of learning object (LO) data. However, SL systems have trouble accommodating data from multiple distributions and “troublesome” data that contains irrelevant features or noise—all of which are relatively common in highly diverse LO data. The solution is to break up the available data into separate areas and then take steps to improve models on areas containing troublesome data. Unfortunately, finding these areas in the first place is a far from trivial task that balances finding a single distribution with having sufficient data to support meaningful analysis. Therefore, we propose a BoU metareasoning (MR) algorithm that first uses semi-supervised clustering to find compact clusters with multiple labels that each support meaningful analyses. After clustering, our BoU MR algorithm learns a separate model on each such cluster. Finally, our BoU MR algorithm uses feature selection (FS) and noise correction (NC) algorithms to improve models on clusters containing troublesome data. Our experiments, using three datasets containing over 5000 sessions of student interactions with LOs, show that multiple models from BoU MR achieve more accurate analyses than a single model. Further, FS and NC algorithms are more effective at improving multiple models than a single model.

[1]  Lee Dee Miller,et al.  Evaluating the use of learning objects in CS1 , 2011, SIGCSE.

[2]  Leen-Kiat Soh,et al.  A Placement Test for Computer Science: Design, Implementation, and Analysis , 2006, Comput. Sci. Educ..

[3]  Chien-Sing Lee,et al.  Learning Objects Reusability and Retrieval through Ontological Sharing: A Hybrid Unsupervised Data Mining Approach , 2007, Seventh IEEE International Conference on Advanced Learning Technologies (ICALT 2007).

[4]  Cristina Conati,et al.  Discovering and Recognizing Student Interaction Patterns in Exploratory Learning Environments , 2010, Intelligent Tutoring Systems.

[5]  Sebastián Ventura,et al.  Predicting Student Grades in Learning Management Systems with Multiple Instance Genetic Programming. , 2009, EDM 2009.

[6]  Ann O'Brien,et al.  Learning Objects Update: Review and Critical Approach to Content Aggregation , 2008, J. Educ. Technol. Soc..

[7]  Inderjit S. Dhillon,et al.  Semi-supervised graph clustering: a kernel approach , 2005, Machine Learning.

[8]  Sanjoy Dasgupta,et al.  Hierarchical sampling for active learning , 2008, ICML '08.

[9]  César Hervás-Martínez,et al.  Data Mining Algorithms to Classify Students , 2008, EDM.

[10]  Nicola Segata,et al.  FaLKM-lib v1.0: A Library for Fast Local Kernel Machines , 2009 .

[11]  Mykola Pechenizkiy,et al.  Class Noise and Supervised Learning in Medical Domains: The Effect of Feature Extraction , 2006, 19th IEEE Symposium on Computer-Based Medical Systems (CBMS'06).

[12]  David J. Paul,et al.  An Investigation into the Learning Styles and Self- Regulated Learning Strategies for Computer Science Students , 2011 .

[13]  Simon,et al.  My students don't learn the way I do , 2011, ACE 2011.

[14]  Sebastián Ventura,et al.  Educational Data Mining: A Review of the State of the Art , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[15]  David R. Musicant,et al.  Understanding Support Vector Machine Classifications via a Recommender System-Like Approach , 2009, DMIN.

[16]  Rajibussalim Mining Students' Interaction Data from a System that Support Learning by Reflection , 2010, EDM.

[17]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[18]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[19]  Huan Liu,et al.  Toward integrating feature selection algorithms for classification and clustering , 2005, IEEE Transactions on Knowledge and Data Engineering.

[20]  Enrico Blanzieri,et al.  Fast and Scalable Local Kernel Machines , 2010, J. Mach. Learn. Res..