Spectral Clustering in Educational Data Mining

Spectral Clustering is a graph theoretic technique to represent data in such a way that clustering on this new representation is reduced to a trivial task. It is especially useful in complex datasets where traditional clustering methods would fail to find groupings. In previous work we have shown the utility of using K-means clustering for exploiting structure in the data to affect a significant improvement in prediction accuracy on educational datasets. In this work we show that by using Spectral Clustering we are able to further improve the student performance prediction. We evaluate an educational data mining prediction task: predicting student state test scores from student features derived from a tutor and also explore some other EDM tasks using spectral clustering.

[1]  Sanjoy Dasgupta,et al.  Learning Mixtures of Gaussians using the k-means Algorithm , 2009, ArXiv.

[2]  Ieee Xplore,et al.  IEEE Transactions on Pattern Analysis and Machine Intelligence Information for Authors , 2022, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Michael William Newman,et al.  The Laplacian spectrum of graphs , 2001 .

[4]  Geoffrey E. Hinton,et al.  Adaptive Mixtures of Local Experts , 1991, Neural Computation.

[5]  R. Charles Murray,et al.  Reducing the Knowledge Tracing Space , 2009, EDM.

[6]  B. Mohar THE LAPLACIAN SPECTRUM OF GRAPHS y , 1991 .

[7]  Santosh S. Vempala,et al.  A divide-and-merge methodology for clustering , 2005, PODS '05.

[8]  Rebecca Nugent,et al.  Skill Set Profile Clustering: The Empty K-Means Algorithm with Automatic Specification of Starting Cluster Centers , 2010, EDM.

[9]  James R. Curran,et al.  Data Mining for Generating Hints in a Python Tutor , 2010, EDM.

[10]  Neil T. Heffernan,et al.  Addressing the assessment challenge with an online system that tutors as it assesses , 2009, User Modeling and User-Adapted Interaction.

[11]  Tamara Sumner,et al.  Online Curriculum Planning Behavior of Teachers , 2010, EDM.

[12]  Neil T. Heffernan,et al.  Can We Get Better Assessment From A Tutoring System Compared to Traditional Paper Testing? Can We Have Our Cake (Better Assessment) and Eat It too (Student Learning During the Test)? , 2010, EDM.

[13]  Michael I. Jordan,et al.  On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.

[14]  Zachary A. Pardos,et al.  Clustering Students to Generate an Ensemble to Improve Standard Test Score Predictions , 2011, AIED.

[15]  Kenneth R. Koedinger,et al.  Unsupervised Discovery of Student Strategies , 2010, EDM.

[16]  Neil T. Heffernan,et al.  How to Construct More Accurate Student Models: Comparing and Optimizing Knowledge Tracing and Performance Factor Analysis , 2011, Int. J. Artif. Intell. Educ..

[17]  Edward Y. Chang,et al.  Parallel Spectral Clustering in Distributed Systems , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Ulrike von Luxburg,et al.  A tutorial on spectral clustering , 2007, Stat. Comput..