The process of deciding which skills are involved in a given task is tedious and challenging. Means to automate it are highly desirable, even if only partial automation that provides supportive tools can be achieved. A recent technique based on Non-negative Matrix Factorization (NMF) was shown to offer valuable results, especially due to the fact that the resulting factorization allows a straightforward interpretation in terms of a Q-matrix. We investigate the factors and assumptions under which NMF can effectively derive the underlying high level skills behind assessment results. We demonstrate the use of different techniques to analyse and interpret the output of NMF. We propose a simple model to generate simulated data and to provide lower and upper bounds for quantifying skill effect. Using the simulated data, we show that, under the assumption of independent skills, the NMF technique is highly effective in deriving the Q-matrix. However, the NMF performance degrades under different ratios of variance between subject performance, item difficulty, and skill mastery. The results corroborates conclusions from previous work in that high level skills, corresponding to general topics like World History and Biology, seem to have no substantial effect on test performance, whereas other topics like Mathematics and French do. The analysis and visualization techniques of the NMF output, along with the simulation approach presented in this paper, should be useful for future investigations using NMF for Q-matrix induction from data.
[1]
H. Sebastian Seung,et al.
Learning the parts of objects by non-negative matrix factorization
,
1999,
Nature.
[2]
K. Koedinger,et al.
Automating Cognitive Model Improvement by A * Search and Logistic Regression
,
2005
.
[3]
K. Tatsuoka.
RULE SPACE: AN APPROACH FOR DEALING WITH MISCONCEPTIONS BASED ON ITEM RESPONSE THEORY
,
1983
.
[4]
Elmar Wolfgang Lang,et al.
A Nonnegative Blind Source Separation Model for Binary Test Data
,
2010,
IEEE Transactions on Circuits and Systems I: Regular Papers.
[5]
Tiffany Barnes.
Evaluation of the q-matrix Method in Understanding Student Logic Proofs
,
2006,
FLAIRS Conference.
[6]
Michael W. Berry,et al.
Algorithms and applications for approximate nonnegative matrix factorization
,
2007,
Comput. Stat. Data Anal..
[7]
Michel C. Desmarais,et al.
On the Faithfulness of Simulated Student Performance Data
,
2010,
EDM.
[8]
Kenneth R. Koedinger,et al.
Learning Factors Analysis - A General Method for Cognitive Model Evaluation and Improvement
,
2006,
Intelligent Tutoring Systems.