Assessing Features of Psychometric Assessment Instruments: A Comparison of the COSMIN Checklist with Other Critical Appraisal Tools

The past 20 years have seen the development of instruments designed to specify standards and evaluate the adequacy of published studies with respect to the quality of study design, the quality of findings, as well as the quality of their reporting. In the field of psychometrics, the first minimum set of standards for the review of psychometric instruments was published in 1996 by the Scientific Advisory Committee of the Medical Outcomes Trust. Since then, a number of tools have been developed with similar aims. The present paper reviews basic psychometric properties (reliability, validity and responsiveness), compares six tools developed for the critical appraisal of psychometric studies and provides a worked example of using the COSMIN checklist, Terwee-m statistical quality criteria, and the levels of evidence synthesis using the method of Schellingerhout and colleagues (2012). This paper will aid users and reviewers of questionnaires in the quality appraisal and selection of appropriate instruments by presenting available assessment tools, their characteristics and utility.

[1]  O. Babenko,et al.  Internal Consistency: Do We Really Know What It Is and How to Assess It? , 2013 .

[2]  David Moher,et al.  STARD 2015: an updated list of essential items for reporting diagnostic accuracy studies , 2015, BMJ : British Medical Journal.

[3]  Montse Ferrer,et al.  Assessing quality of life in patients with prostate cancer: a systematic and standardized comparison of available instruments , 2014, Quality of Life Research.

[4]  Daniel S. J. Costa Reflective, causal, and composite indicators of quality of life: A conceptual or an empirical distinction? , 2015, Quality of Life Research.

[5]  D. Streiner Clinimetrics vs. psychometrics: an unnecessary distinction. , 2003, Journal of clinical epidemiology.

[6]  G. Feder,et al.  Development and validation of an international appraisal instrument for assessing the quality of clinical practice guidelines: the AGREE project , 2003, Quality & safety in health care.

[7]  N. Black,et al.  The feasibility of creating a checklist for the assessment of the methodological quality both of randomised and non-randomised studies of health care interventions. , 1998, Journal of epidemiology and community health.

[8]  Marshall Godwin,et al.  Health measurement scales , 1991 .

[9]  C. Terwee,et al.  Measurement properties of disease-specific questionnaires in patients with neck pain: a systematic review , 2011, Quality of Life Research.

[10]  J. Rust,et al.  Modern Psychometrics: The Science of Psychological Assessment , 1989 .

[11]  R. D. de Haan,et al.  Psychometric properties of the Impact on Participation and Autonomy Questionnaire. , 2001, Archives of physical medicine and rehabilitation.

[12]  D. Wade,et al.  Multi-disciplinary rehabilitation for acquired brain injury in adults of working age. , 2005, The Cochrane database of systematic reviews.

[13]  E. Charters,et al.  Efficacy of electronic portable assistive devices for people with acquired brain injury: A systematic review , 2015, Neuropsychological rehabilitation.

[14]  K N Lohr,et al.  Evaluating quality-of-life and health status instruments: development of scientific review criteria. , 1996, Clinical therapeutics.

[15]  A. Beelen,et al.  Responsiveness of the Impact on Participation and Autonomy questionnaire. , 2002, Archives of physical medicine and rehabilitation.

[16]  C. Terwee,et al.  Quality criteria were proposed for measurement properties of health status questionnaires. , 2007, Journal of clinical epidemiology.

[17]  C. Terwee,et al.  Clinimetrics and psychometrics: two sides of the same coin , 2003 .

[18]  C. Terwee,et al.  The COSMIN study reached international consensus on taxonomy, terminology, and definitions of measurement properties for health-related patient-reported outcomes. , 2010, Journal of clinical epidemiology.

[19]  Robert F. DeVellis,et al.  Scale Development: Theory and Applications. , 1992 .

[20]  Shameem Nyla NATIONAL COUNCIL ON MEASUREMENT IN EDUCATION , 2004 .

[21]  D. Moher,et al.  CONSORT 2010 statement: Updated guidelines for reporting parallel group randomised trials , 2010, Journal of pharmacology & pharmacotherapeutics.

[22]  J. Steiner,et al.  Health and Quality of Life Outcomes , 2006 .

[23]  Michael Herdman,et al.  Development of EMPRO: a tool for the standardized assessment of patient-reported outcome measures. , 2008, Value in health : the journal of the International Society for Pharmacoeconomics and Outcomes Research.

[24]  David L Streiner,et al.  Recommendations for reporting the results of studies of instrument and scale development and testing. , 2014, Journal of advanced nursing.

[25]  J. Wilcox,et al.  Reconsidering formative measurement. , 2007, Psychological methods.

[26]  Ron D Hays,et al.  What is sufficient evidence for the reliability and validity of patient-reported outcome measures? , 2007, Value in health : the journal of the International Society for Pharmacoeconomics and Outcomes Research.

[27]  James B. Schreiber,et al.  Reporting Structural Equation Modeling and Confirmatory Factor Analysis Results: A Review , 2006 .

[28]  C. Terwee,et al.  Rating the methodological quality in systematic reviews of studies on measurement properties: a scoring system for the COSMIN checklist , 2011, Quality of Life Research.

[29]  W. Revelle,et al.  Coefficients Alpha, Beta, Omega, and the glb: Comments on Sijtsma , 2009 .

[30]  James C. Hayton,et al.  Factor Retention Decisions in Exploratory Factor Analysis: a Tutorial on Parallel Analysis , 2004 .

[31]  G. A. van den Bos,et al.  The development of a handicap assessment questionnaire: the Impact on Participation and Autonomy (IPA) , 1999, Clinical rehabilitation.

[32]  Jesús M. Alvarado,et al.  Best Alternatives to Cronbach's Alpha Reliability in Realistic Conditions: Congeneric and Asymmetrical Measurements , 2016, Front. Psychol..

[33]  J. Ponsford,et al.  INCOG Guidelines for Cognitive Rehabilitation Following Traumatic Brain Injury: Methods and Overview , 2014, The Journal of head trauma rehabilitation.

[34]  C. Terwee,et al.  The COSMIN checklist for assessing the methodological quality of studies on measurement properties of health status measurement instruments: an international Delphi study , 2010, Quality of Life Research.

[35]  Scott B. MacKenzie,et al.  Construct Measurement and Validation Procedures in MIS and Behavioral Research: Integrating New and Existing Techniques , 2011, MIS Q..

[36]  J T Kelly,et al.  Assessing quality. , 1988, JAMA.

[37]  D. Streiner Test development: two-sided coin or one-sided Möbius strip? , 2003 .

[38]  D. Streiner,et al.  Guidelines for Reporting Reliability and Agreement Studies (GRRAS). , 2011, International journal of nursing studies.

[39]  B Kirshner,et al.  A methodological framework for assessing health indices. , 1985, Journal of chronic diseases.

[40]  M. Bondy Psychiatric antecedents of psychological testing (before Binet). , 1974, Journal of the history of the behavioral sciences.

[41]  E. Andresen,et al.  Criteria for assessing the tools of disability outcomes research. , 2000, Archives of physical medicine and rehabilitation.

[42]  Catherine Sherrington,et al.  Reliability of the PEDro scale for rating quality of randomized controlled trials. , 2003, Physical therapy.

[43]  Simon A Naji,et al.  Health measurement scales: a practical guide to their development and use (5th edition). , 1990, Australian and New Zealand journal of public health.

[44]  D. Penson,et al.  Checklist to operationalize measurement characteristics of patient-reported outcome measures , 2016, Systematic Reviews.

[45]  Jonathan J. Evans,et al.  The Single-Case Reporting Guideline In BEhavioural Interventions (SCRIBE) 2016 Statement , 2016, Physical Therapy.

[46]  R. MacCallum,et al.  Sample size in factor analysis. , 1999 .

[47]  M. Weng,et al.  Multi-disciplinary rehabilitation for acquired brain injury in adults of working age , 2016 .

[48]  G H Guyatt,et al.  Measuring health status: what are the necessary measurement properties? , 1992, Journal of clinical epidemiology.

[49]  Kathleenl N. Lohr,et al.  Assessing health status and quality-of-life instruments: Attributes and review criteria , 2002, Quality of Life Research.

[50]  Patrick M M Bossuyt,et al.  Reporting standards for studies of diagnostic test accuracy in dementia , 2014, Neurology.

[51]  H. Vet,et al.  Evaluation of the methodological quality of systematic reviews of health status measurement instruments , 2009, Quality of Life Research.

[52]  Robert E. Gibby,et al.  A history of the early days of personality testing in American industry: an obsession with adjustment. , 2008, History of psychology.

[53]  Thomas J. Dunn,et al.  From alpha to omega: a practical solution to the pervasive problem of internal consistency estimation. , 2014, British journal of psychology.

[54]  H. Vet,et al.  Measurement in Medicine: Systematic reviews of measurement properties , 2011 .

[55]  Caroline B. Terwee,et al.  Measurement in Medicine: A Practical Guide , 2011 .

[56]  A. Hrõbjartsson,et al.  Guidelines for Reporting Reliability and Agreement Studies (GRRAS) were proposed. , 2011, Journal of clinical epidemiology.

[57]  Cheryl Burke Jarvis,et al.  A Critical Review of Construct Indicators and Measurement Model Misspecification in Marketing and Consumer Research , 2003 .