Using computerized adaptive testing to reduce the burden of mental health assessment.

OBJECTIVE This study investigated the combination of item response theory and computerized adaptive testing (CAT) for psychiatric measurement as a means of reducing the burden of research and clinical assessments. METHODS Data were from 800 participants in outpatient treatment for a mood or anxiety disorder; they completed 616 items of the 626-item Mood and Anxiety Spectrum Scales (MASS) at two times. The first administration was used to design and evaluate a CAT version of the MASS by using post hoc simulation. The second confirmed the functioning of CAT in live testing. RESULTS Tests of competing models based on item response theory supported the scale's bifactor structure, consisting of a primary dimension and four group factors (mood, panic-agoraphobia, obsessive-compulsive, and social phobia). Both simulated and live CAT showed a 95% average reduction (585 items) in items administered (24 and 30 items, respectively) compared with administration of the full MASS. The correlation between scores on the full MASS and the CAT version was .93. For the mood disorder subscale, differences in scores between two groups of depressed patients--one with bipolar disorder and one without--on the full scale and on the CAT showed effect sizes of .63 (p<.003) and 1.19 (p<.001) standard deviation units, respectively, indicating better discriminant validity for CAT. CONCLUSIONS Instead of using small fixed-length tests, clinicians can create item banks with a large item pool, and a small set of the items most relevant for a given individual can be administered with no loss of information, yielding a dramatic reduction in administration time and patient and clinician burden.

[1]  Steven P. Reise,et al.  The role of the bifactor model in resolving dimensionality issues in health outcomes measures , 2007, Quality of Life Research.

[2]  Otto B. Walter,et al.  Development and evaluation of a computer adaptive test for ‘Anxiety’ (Anxiety-CAT) , 2007, Quality of Life Research.

[3]  Donald Hedeker,et al.  Full-Information Item Bifactor Analysis of Graded Response Data , 2007 .

[4]  Ayelet Meron Ruscio,et al.  The epidemiology of panic attacks, panic disorder, and agoraphobia in the National Comorbidity Survey Replication. , 2006, Archives of general psychiatry.

[5]  Jakob B. Bjorner,et al.  Multidimensional Computerized Adaptive Testing of the EORTC QLQ-C30: Basic Developments and Evaluations , 2006, Quality of Life Research.

[6]  Otto B. Walter,et al.  Development of a Computer-adaptive Test for Depression (D-CAT) , 2005, Quality of Life Research.

[7]  D. Kupfer,et al.  Cross-cultural validity of the Structured Clinical Interview for Panic-Agoraphobic Spectrum , 2005, Social Psychiatry and Psychiatric Epidemiology.

[8]  A. Belanger,et al.  Reliability and Validity , 2004 .

[9]  D. Kupfer,et al.  The Mood Spectrum in Unipolar and Bipolar Disorder: Arguments for a Unitary Approach , 2004 .

[10]  D. Weiss Computerized Adaptive Testing for Effective and Efficient Measurement in Counseling and Education , 2004 .

[11]  Daniel J Buysse,et al.  Computerized adaptive measurement of depression: A simulation study , 2004, BMC psychiatry.

[12]  P. Fayers Item Response Theory for Psychologists , 2004, Quality of Life Research.

[13]  D. Kupfer,et al.  The mood spectrum in unipolar and bipolar disorder: arguments for a unitary approach. , 2004, The American journal of psychiatry.

[14]  Olga V. Demler,et al.  The epidemiology of major depressive disorder: results from the National Comorbidity Survey Replication (NCS-R). , 2003, JAMA.

[15]  M. Hall,et al.  Depression: Epidemiology and risk factors , 2003 .

[16]  D. Kupfer,et al.  Clinical significance of lifetime panic spectrum symptoms in the treatment of patients with bipolar I disorder. , 2002, Archives of general psychiatry.

[17]  W. Gardner,et al.  Multidimensional Adaptive Testing for Mental Health Problems in Primary Care , 2002, Medical care.

[18]  E. Frank,et al.  Measuring social anxiety and obsessive-compulsive spectra: comparison of interviews and self-report instruments. , 2002, Comprehensive Psychiatry.

[19]  D. Kupfer,et al.  Adult separation anxiety: psychometric properties of a new structured clinical interview. , 2002, Journal of psychiatric research.

[20]  E. Frank,et al.  Measuring mood spectrum: comparison of interview (SCI-MOODS) and self-report (MOODS-SR) instruments. , 2002, Comprehensive psychiatry.

[21]  D. Kupfer,et al.  Panic-agoraphobic spectrum: reliability and validity of assessment instruments. , 2001, Journal of psychiatric research.

[22]  D. Cooke,et al.  Psychopathy and ethnicity: structural, item, and test generalizability of the Psychopathy Checklist--Revised (PCL-R) in Caucasian and African American Participants. , 2001, Psychological assessment.

[23]  Howard Wainer,et al.  CATs: Whither and whence , 2000 .

[24]  J. Ware,et al.  Practical implications of item response theory and computerized adaptive testing: a brief summary of ongoing studies of widely used headache impact scales. , 2000, Medical care.

[25]  Daniel J Buysse,et al.  Interpersonal psychotherapy and antidepressant medication: evaluation of a sequential treatment strategy in women with recurrent major depression. , 2000, The Journal of clinical psychiatry.

[26]  E. Frank,et al.  Internal consistency and discriminant validity of the Structured Clinical Interview for Panic Agoraphobic Spectrum (SCI‐PAS) , 1999 .

[27]  E. Frank,et al.  The bipolar spectrum: a clinical reality in search of diagnostic criteria and an assessment methodology. , 1999, Journal of affective disorders.

[28]  Brent Bridgeman,et al.  COMPARABILITY OF PAPER-AND-PENCIL AND COMPUTER ADAPTIVE TEST SCORES ON THE GRE® GENERAL TEST , 1998 .

[29]  Ellen Frank,et al.  The Spectrum Model: A More Coherent Approach to the Complexity of Psychiatric Symptomatology , 1998, CNS Spectrums.

[30]  M K Shear,et al.  The panic-agoraphobic spectrum: a descriptive approach to the assessment and treatment of subtle symptoms. , 1997, The American journal of psychiatry.

[31]  Barbara G. Dodd,et al.  Computerized Adaptive Testing With Polytomous Items , 1995 .

[32]  James O. Ramsay,et al.  Nonparametric Item Analyses of the Beck Depression Inventory: Evaluating Gender Item Bias and Response Option Weights , 1994 .

[33]  R. Kessler,et al.  Lifetime and 12-month prevalence of DSM-III-R psychiatric disorders in the United States. Results from the National Comorbidity Survey. , 1994, Archives of general psychiatry.

[34]  D J Kupfer,et al.  Exactly what does the Hamilton Depression Rating Scale measure? , 1993, Journal of psychiatric research.

[35]  Donald Hedeker,et al.  Full-information item bi-factor analysis , 1992 .

[36]  S. Reise,et al.  Fitting the Two-Parameter Model to Personality Data , 1990 .

[37]  Eiji Muraki,et al.  Fitting a Polytomous Item Response Model to Likert-Type Data , 1990 .

[38]  A. Baron Experimental Designs , 1990, The Behavior analyst.

[39]  E. Muraki,et al.  Full-Information Item Factor Analysis , 1988 .

[40]  David Andrich,et al.  The Application of an Unfolding Model of the PIRT Type to the Measurement of Attitude , 1988 .

[41]  D. Weiss Adaptive testing by computer. , 1985, Journal of consulting and clinical psychology.

[42]  R. Gibbons,et al.  Application of modern psychometric theory in psychiatric research. , 1985, Journal of psychiatric research.

[43]  R. Hambleton,et al.  Item Response Theory: Principles and Applications , 1984 .

[44]  J. Mcbride,et al.  Reliability and Validity of Adaptive Ability Tests in a Military Setting , 1983 .

[45]  R. D. Bock,et al.  Marginal maximum likelihood estimation of item parameters: Application of an EM algorithm , 1981 .

[46]  David J Weiss,et al.  An Alternate-Forms Reliability and Concurrent Validity Comparison of Bayesian Adaptive and Conventional Ability Tests. , 1980 .

[47]  D. Andrich A rating formulation for ordered response categories , 1978 .

[48]  D. Andrich Application of a Psychometric Rating Model to Ordered Categories Which Are Scored with Successive Integers , 1978 .

[49]  David J. Weiss,et al.  An Adaptive Testing Strategy for Achievement Test Batteries. , 1977 .

[50]  F. Samejima Estimation of latent ability using a response pattern of graded scores , 1968 .

[51]  William G. Cochran,et al.  Experimental Designs, 2nd Edition , 1950 .

[52]  K. Holzinger,et al.  The Bi-factor method , 1937 .