Clustering student skill set profiles in a unit hypercube using mixtures of multivariate betas

This paper presents a finite mixture of multivariate betas as a new model-based clustering method tailored to applications where the feature space is constrained to the unit hypercube. The mixture component densities are taken to be conditionally independent, univariate unimodal beta densities (from the subclass of reparameterized beta densities given by Bagnato and Punzo in Comput Stat 28(4):10.1007/s00180-012-367-4, 2013). The EM algorithm used to fit this mixture is discussed in detail, and results from both this beta mixture model and the more standard Gaussian model-based clustering are presented for simulated skill mastery data from a common cognitive diagnosis model and for real data from the Assistment System online mathematics tutor (Feng et al. in J User Model User Adap Inter 19(3):243–266, 2009). The multivariate beta mixture appears to outperform the standard Gaussian model-based clustering approach, as would be expected on the constrained space. Fewer components are selected (by BIC-ICL) in the beta mixture than in the Gaussian mixture, and the resulting clusters seem more reasonable and interpretable.

[1]  Adrian E. Raftery,et al.  MCLUST Version 3 for R: Normal Mixture Modeling and Model-Based Clustering † , 2007 .

[2]  Rebecca Nugent,et al.  A Comparison of Student Skill Knowledge Estimates , 2009, EDM.

[3]  Adrian E. Raftery,et al.  How Many Clusters? Which Clustering Method? Answers Via Model-Based Cluster Analysis , 1998, Comput. J..

[4]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[5]  F. James Rohlf,et al.  Biometry: The Principles and Practice of Statistics in Biological Research , 1969 .

[6]  Gilles Celeux,et al.  Variable selection in model-based clustering: A general variable role modeling , 2009, Comput. Stat. Data Anal..

[7]  B. Junker,et al.  Cognitive Assessment Models with Few Assumptions, and Connections with Nonparametric Item Response Theory , 2001 .

[8]  Antonio Punzo,et al.  Finite mixtures of unimodal beta and gamma densities and the \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$k$$\end{d , 2012, Computational Statistics.

[9]  Neil Henry Latent structure analysis , 1969 .

[10]  Christian Hennig,et al.  Methods for merging Gaussian mixture components , 2010, Adv. Data Anal. Classif..

[11]  Rebecca Nugent,et al.  Subspace Clustering of Skill Mastery: Identifying Skills that Separate Students , 2009, EDM.

[12]  L. Hubert,et al.  Comparing partitions , 1985 .

[13]  Sokal Rr,et al.  Biometry: the principles and practice of statistics in biological research 2nd edition. , 1981 .

[14]  Tiffany Barnes,et al.  The Q-matrix Method: Mining Student Response Data for Knowledge , 2005 .

[15]  Adrian E. Raftery,et al.  mclust Version 4 for R : Normal Mixture Modeling for Model-Based Clustering , Classification , and Density Estimation , 2012 .

[16]  Adrian E. Raftery,et al.  Model-Based Clustering, Discriminant Analysis, and Density Estimation , 2002 .

[17]  Geoffrey J. McLachlan,et al.  Finite Mixture Models , 2019, Annual Review of Statistics and Its Application.

[18]  Jonathan Templin,et al.  Diagnostic Measurement: Theory, Methods, and Applications , 2010 .

[19]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[20]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[21]  Rebecca Nugent,et al.  Comparing different clustering models on the unit hypercube , 2011 .

[22]  David Peel,et al.  The EMMIX Algorithm for the Fitting of Normal and t-Components , 1999 .

[23]  Yuan Ji,et al.  Applications of beta-mixture models in bioinformatics , 2005, Bioinform..

[24]  Jonathan Templin,et al.  Using Efficient Model Based Sum‐Scores for Conducting Skills Diagnoses , 2007 .

[25]  B. Lindsay Mixture models : theory, geometry, and applications , 1995 .

[26]  Neil T. Heffernan,et al.  Addressing the assessment challenge with an online system that tutors as it assesses , 2009, User Modeling and User-Adapted Interaction.

[27]  J. D. L. Torre,et al.  DINA Model and Parameter Estimation: A Didactic , 2009 .

[28]  Rebecca Nugent,et al.  Skill set profile clustering based on student capability vectors computed from online tutoring data , 2008 .

[29]  Gilles Celeux,et al.  Combining Mixture Components for Clustering , 2010, Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America.

[30]  Christian Hennig,et al.  Ridgeline Plot and Clusterwise Stability as Tools for Merging Gaussian Mixture Components , 2010 .

[31]  Peter Adams,et al.  The EMMIX software for the fitting of mixtures of normal and t-components , 1999 .

[32]  A. Raftery,et al.  Variable Selection for Model-Based Clustering , 2006 .

[33]  Paul F. Lazarsfeld,et al.  Latent Structure Analysis. , 1969 .

[34]  R. Sokal,et al.  Biometry: The Principles and Practice of Statistics in Biological Research (2nd ed.). , 1982 .

[35]  Louis V. DiBello,et al.  31A Review of Cognitively Diagnostic Assessment and a Summary of Psychometric Models , 2006 .