New Results on an Improved Parallel EM Algorithm for Estimating Generalized Latent Variable Models

The second generation of a parallel algorithm for generalized latent variable models, including MIRT models and extensions, on the basis of the general diagnostic model (GDM) is presented. This new development further improves the performance of the parallel-E parallel-M algorithm presented in an earlier report by means of additional computational improvements that produce even larger gains in performance. The additional gain achieved by this second-generation parallel algorithm reaches factor 20 for several of the examples reported with a sixfold gain based on the first generation. The estimation of a multidimensional IRT model for large-scale data may show a larger reduction in runtime compared to a multiple-group model which has a structure that is more conducive to parallel processing of the E-step. Multiple population models can be arranged such that the parallelism directly exploits the ability to estimate multiple latent variable distributions separately in independent threads of the algorithm.

[1]  G. Amdhal,et al.  Validity of the single processor approach to achieving large scale computing capabilities , 1967, AFIPS '67 (Spring).

[2]  Li Cai,et al.  Metropolis-Hastings Robbins-Monro Algorithm for Confirmatory Item Factor Analysis , 2010 .

[3]  Minjeong Jeon,et al.  A Third-Order Item Response Theory Model for Modeling the Effects of Domains and Subdomains in Large-Scale Educational Assessment Surveys , 2014 .

[4]  Donald Hedeker,et al.  Full-information item bi-factor analysis , 1992 .

[5]  Matthias von Davier,et al.  High-Performance Psychometrics: The Parallel-E Parallel-M Algorithm for Generalized Latent Variable Models , 2016 .

[6]  Matthias von Davier,et al.  Polytomous Mixed Rasch Models , 1995 .

[7]  Guang R. Gao,et al.  Tile Reduction: The First Step towards Tile Aware Parallelization in OpenMP , 2009, IWOMP.

[8]  Kentaro Yamamoto,et al.  Partially Observed Mixtures of IRT Models: An Extension of the Generalized Partial-Credit Model , 2003 .

[9]  Minjeong Jeon,et al.  Modeling Differential Item Functioning Using a Generalization of the Multiple-Group Bifactor Model , 2013 .

[10]  Charles E. McCulloch,et al.  The EM Algorithm and Its Extensions , 1998 .

[11]  Óscar Cánovas Reverte,et al.  P-EDR: An algorithm for parallel implementation of Parzen density estimation from uncertain observations , 1999, Proceedings 13th International Parallel Processing Symposium and 10th Symposium on Parallel and Distributed Processing. IPPS/SPDP 1999.

[12]  Matthias von Davier,et al.  A General Diagnostic Model Applied to Language Testing Data. Research Report. ETS RR-05-16. , 2005 .

[13]  Matthias von Davier,et al.  Measuring Growth in a Longitudinal Large-Scale Assessment with a General Latent Variable Model , 2011 .

[14]  John T. Willse,et al.  Defining a Family of Cognitive Diagnosis Models Using Log-Linear Models with Latent Variables , 2009 .

[15]  Abhinandan Das,et al.  Google news personalization: scalable online collaborative filtering , 2007, WWW '07.

[16]  M. Davier The Log‐Linear Cognitive Diagnostic Model (LCDM) as a Special Case of the General Diagnostic Model (GDM) , 2014 .