A Sparse Latent Class Model for Cognitive Diagnosis

Cognitive diagnostic models (CDMs) are latent variable models developed to infer latent skills, knowledge, or personalities that underlie responses to educational, psychological, and social science tests and measures. Recent research focused on theory and methods for using sparse latent class models (SLCMs) in an exploratory fashion to infer the latent processes and structure underlying responses. We report new theoretical results about sufficient conditions for generic identifiability of SLCM parameters. An important contribution for practice is that our new generic identifiability conditions are more likely to be satisfied in empirical applications than existing conditions that ensure strict identifiability. Learning the underlying latent structure can be formulated as a variable selection problem. We develop a new Bayesian variable selection algorithm that explicitly enforces generic identifiability conditions and monotonicity of item response functions to ensure valid posterior inference. We present Monte Carlo simulation results to support accurate inferences and discuss the implications of our findings for future SLCM research and educational testing.

[1]  Jonathan Templin,et al.  Diagnostic Measurement: Theory, Methods, and Applications , 2010 .

[2]  J. D. L. Torre,et al.  The Generalized DINA Model Framework. , 2011 .

[3]  Jingchen Liu,et al.  Theory of the Self-learning Q-Matrix. , 2010, Bernoulli : official journal of the Bernoulli Society for Mathematical Statistics and Probability.

[4]  Jingchen Liu,et al.  On the Identifiability of Diagnostic Classification Models , 2017, Psychometrika.

[5]  Matthias von Davier,et al.  A General Diagnostic Model Applied to Language Testing Data. Research Report. ETS RR-05-16. , 2005 .

[6]  Chia-Yi Chiu,et al.  Cluster Analysis for Cognitive Diagnosis: Theory and Applications , 2009 .

[7]  M. Verlaan,et al.  Non-uniqueness in probabilistic numerical identification of bacteria , 1994, Journal of Applied Probability.

[8]  H. Teicher Identifiability of Mixtures , 1961 .

[9]  J. Kruskal More factors than subjects, tests and treatments: An indeterminacy theorem for canonical decomposition and individual differences scaling , 1976 .

[10]  Miguel Á. Carreira-Perpiñán,et al.  Practical Identifiability of Finite Mixtures of Multivariate Bernoulli Distributions , 2000, Neural Computation.

[11]  Yuguo Chen,et al.  Bayesian Estimation of the DINA Q matrix , 2018, Psychometrika.

[12]  Jeffrey A Douglas,et al.  Higher-order latent trait models for cognitive diagnosis , 2004 .

[13]  Curtis Tatsuoka,et al.  Data analytic methods for latent partially ordered classification models , 2002 .

[14]  Kikumi K. Tatsuoka,et al.  Analysis of Errors in Fraction Addition and Subtraction Problems. Final Report. , 1984 .

[15]  J. Kruskal Three-way arrays: rank and uniqueness of trilinear decompositions, with application to arithmetic complexity and statistics , 1977 .

[16]  J. Hagenaars Loglinear Models with Latent Variables , 1993 .

[17]  C. Matias,et al.  Identifiability of parameters in latent structure models with many observed variables , 2008, 0809.5032.

[18]  B. Junker,et al.  Cognitive Assessment Models with Few Assumptions, and Connections with Nonparametric Item Response Theory , 2001 .

[19]  S. Yakowitz,et al.  On the Identifiability of Finite Mixtures , 1968 .

[20]  Gongjun Xu,et al.  Identifiability of restricted latent class models with binary responses , 2016, 1603.04140.

[21]  S. Chib,et al.  Bayesian analysis of binary and polychotomous response data , 1993 .

[22]  David A. Cox,et al.  Ideals, Varieties, and Algorithms , 1997 .

[23]  Sarah M. Hartz,et al.  A Bayesian framework for the unified model for assessing cognitive abilities: Blending theory with practicality. , 2002 .

[24]  E. Maris Estimating multiple classification latent class models , 1999 .

[25]  Edward H. Haertel Using restricted latent class models to map the skill structure of achievement items , 1989 .

[26]  Steven Andrew Culpepper,et al.  Estimating the Cognitive Diagnosis $$\varvec{Q}$$Q Matrix with Expert Knowledge: Application to the Fraction-Subtraction Dataset , 2018, Psychometrika.

[27]  B. Mityagin The Zero Set of a Real Analytic Function , 2015, Mathematical Notes.

[28]  L. A. Goodman Exploratory latent structure analysis using both identifiable and unidentifiable models , 1974 .

[29]  Z. Ying,et al.  Statistical Analysis of Q-Matrix Based Diagnostic Classification Models , 2015, Journal of the American Statistical Association.

[30]  Gongjun Xu,et al.  Identifying Latent Structures in Restricted Latent Class Models , 2018, Journal of the American Statistical Association.

[31]  J. Templin,et al.  Measurement of psychological disorders using cognitive diagnosis models. , 2006, Psychological methods.