Equivalence of electronic and paper-and-pencil administration of patient-reported outcome measures: a meta-analytic review.

OBJECTIVES Patient-reported outcomes (PROs; self-report assessments) are increasingly important in evaluating medical care and treatment efficacy. Electronic administration of PROs via computer is becoming widespread. This article reviews the literature addressing whether computer-administered tests are equivalent to their paper-and-pencil forms. METHODS Meta-analysis was used to synthesize 65 studies that directly assessed the equivalence of computer versus paper versions of PROs used in clinical trials. A total of 46 unique studies, evaluating 278 scales, provided sufficient detail to allow quantitative analysis. RESULTS Among 233 direct comparisons, the average mean difference between modes averaged 0.2% of the scale range (e.g., 0.02 points on a 10-point scale), and 93% were within +/-5% of the scale range. Among 207 correlation coefficients between paper and computer instruments (typically intraclass correlation coefficients), the average weighted correlation was 0.90; 94% of correlations were at least 0.75. Because the cross-mode correlation (paper vs. computer) is also a test-retest correlation, with potential variation because of retest, we compared it to the within-mode (paper vs. paper) test-retest correlation. In four comparisons that evaluated both, the average cross-mode paper-to-computer correlation was almost identical to the within-mode correlation for readministration of a paper measure (0.88 vs. 0.91). CONCLUSIONS Extensive evidence indicates that paper- and computer-administered PROs are equivalent.

[1]  Jacob Cohen,et al.  The Equivalence of Weighted Kappa and the Intraclass Correlation Coefficient as Measures of Reliability , 1973 .

[2]  J. Fleiss Statistical methods for rates and proportions , 1974 .

[3]  J. Fleiss,et al.  Intraclass correlations: uses in assessing rater reliability. , 1979, Psychological bulletin.

[4]  W. Grove Statistical Methods for Rates and Proportions, 2nd ed , 1981 .

[5]  D. Cicchetti,et al.  Developing criteria for establishing interrater reliability of specific items: applications to assessment of adaptive behavior. , 1981, American journal of mental deficiency.

[6]  R. Orwin A fail-safe N for effect size in meta-analysis. , 1983 .

[7]  L. Hedges,et al.  Statistical Methods for Meta-Analysis , 1987 .

[8]  B. Plake,et al.  Comparing computerized versus traditional psychological assessment , 1985 .

[9]  P. Albert,et al.  Models for longitudinal data: a generalized estimating equation approach. , 1988, Biometrics.

[10]  G. Guyatt,et al.  Measurement of health status. Ascertaining the minimal clinically important difference. , 1989, Controlled clinical trials.

[11]  R Rosenthal,et al.  Meta‐analysis: a review. , 1991, Psychosomatic medicine.

[12]  J. Cox,et al.  Validation of a computerized version of the 10-item (self-rating) Edinburgh postnatal depression scale , 1991 .

[13]  Clay E. George,et al.  The effects of computerized versus paper-and-pencil administration on measures of negative affect , 1992 .

[14]  C Safran,et al.  Computer-based interview for screening blood donors for risk of HIV transmission. , 1990, JAMA.

[15]  R A Steer,et al.  Structure of the computer-assisted Beck Anxiety Inventory with psychiatric inpatients. , 1993, Journal of personality assessment.

[16]  S. Shiffman,et al.  Ecological Momentary Assessment (Ema) in Behavioral Medicine , 1994 .

[17]  I. Wilson,et al.  Linking clinical variables with health-related quality of life. A conceptual model of patient outcomes. , 1995, JAMA.

[18]  Jack D. Burke,et al.  Test-retest reliability in psychiatric patients of the sf-36 health survey , 1996 .

[19]  N. Stuckless,et al.  The effects of computer versus paper-and-pencil administration on measures of anger and revenge with an inmate population , 1996 .

[20]  K. McGraw,et al.  Forming inferences about some intraclass correlation coefficients. , 1996 .

[21]  M. Atkinson,et al.  Computerized quality-of-life screening in an oncology clinic. , 1997, Cancer practice.

[22]  N. Bellamy,et al.  Validation study of a computerized version of the Western Ontario and McMaster Universities VA3.0 Osteoarthritis Index. , 1997, The Journal of rheumatology.

[23]  J H Greist,et al.  A computer-administered telephone interview to identify mental disorders. , 1997, JAMA.

[24]  P Wright,et al.  Computer anxiety: a comparison of pen-based personal digital assistants, conventional computer and paper assessment of mood and performance. , 1998, British journal of psychology.

[25]  M Elia,et al.  Comparison of the traditional paper visual analogue scale questionnaire with an Apple Newton electronic appetite rating system (EARS) in free living subjects feeding ad libitum , 1998, European Journal of Clinical Nutrition.

[26]  D. Forman,et al.  Automated collection of quality-of-life data: a comparison of paper and computer touch-screen questionnaires. , 1999, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[27]  P. van Schaik,et al.  Effect of an Educational Multimedia Prostate Program on the International Prostate Symptom Score , 1999, European Urology.

[28]  Norbert Schmitz,et al.  Computerized administration of the Symptom Checklist (SCL-90-R) and the Inventory of Interpersonal Problems (IIP-C) in psychosomatic outpatients , 1999, Psychiatry Research.

[29]  Stefan E. Schulenberg,et al.  The equivalence of computerized and paper-and-pencil psychological instruments: Implications for measures of negative affect , 1999, Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc.

[30]  N. Hartkamp,et al.  Comparison of the standard and the computerized versions of the Symptom Check List (SCL‐90‐R): a randomized trial , 2000, Acta psychiatrica Scandinavica.

[31]  M. Elia,et al.  Description and evaluation of a Newton-based electronic appetite rating system for temporal tracking of appetite in human subjects , 2001, Physiology & Behavior.

[32]  R. Jamison,et al.  Electronic diaries for monitoring chronic pain: 1-year validation study , 2001, Pain.

[33]  B. Tiplady,et al.  Validity and sensitivity of a pen computer battery of performance tests , 2001, Journal of psychopharmacology.

[34]  N. Leidy,et al.  A Comparative Trial of Paper-and-Pencil Versus Computer Administration of the Quality of Life in Reflux and Dyspepsia (QOLRAD) Questionnaire , 2001, Medical care.

[35]  George D. Kitas,et al.  Computerized information-gathering in specialist rheumatology clinics: an initial evaluation of an electronic version of the Short Form 36. , 2002, Rheumatology.

[36]  J. Sorkin,et al.  Factors Influencing the Use of Computer Technology in the Collection of Clinical Data in a Predominantly African‐American Population , 2002, Journal of the American Geriatrics Society.

[37]  Michael R. Hufford,et al.  Applications and What Works in the Field , 2002 .

[38]  N. Bellamy,et al.  Clinical evaluation of the WOMAC 3.0 OA Index in numeric rating scale format using a computerized touch screen version. , 2002, Osteoarthritis and cartilage.

[39]  Khaled J Saleh,et al.  Comparison of commonly used orthopaedic outcome measures using palm‐top computers and paper surveys , 2002, Journal of orthopaedic research : official publication of the Orthopaedic Research Society.

[40]  Nathaniel P. Katz,et al.  Comparative study of electronic vs. paper VAS ratings: a randomized, crossover trial using healthy volunteers , 2002, PAIN.

[41]  S. Shiffman,et al.  Patient non-compliance with paper diaries , 2002, BMJ : British Medical Journal.

[42]  Ted Okon,et al.  The Cancer Care Monitor: psychometric content evaluation and pilot testing of a computer administered system for symptom screening and quality of life in adult cancer patients. , 2003, Journal of pain and symptom management.

[43]  B. Parasuraman,et al.  Electronic Versus Paper Questionnaires: A Further Comparison in Persons with Asthma , 2003, The Journal of asthma : official journal of the Association for the Care of Asthma.

[44]  Stanley E. Kaufman,et al.  Electronic collection of health-related quality of life data: Validity, time benefits, and patient preference , 2004, Quality of Life Research.

[45]  Benjamin M. Ogles,et al.  Computerized Depression Screening and Awareness , 1998, Community Mental Health Journal.

[46]  H. Drummond,et al.  Electronic quality of life questionnaires: a comparison of pen-based electronic questionnaires with conventional paper in a gastrointestinal study , 1995, Quality of Life Research.

[47]  New technology in quality of life research: Are all computer-assisted approaches created equal? , 2003, Quality of Life Research.

[48]  R. Willke,et al.  Measuring treatment impact: a review of patient-reported outcomes and other efficacy endpoints in approved product labels. , 2004, Controlled clinical trials.

[49]  J. J. Caro Sr,et al.  Does electronic implementation of questionnaires used in asthma alter responses compared to paper implementation? , 2004, Quality of Life Research.

[50]  Angelia Mosley-Williams,et al.  Validation of a computer version of the American College of Rheumatology patient assessment questionnaire for the autonomous self-entry of self-report data in an urban rheumatology clinic. , 2004, Arthritis and rheumatism.

[51]  Robyn Attewell,et al.  A comparison of an electronic version of the SF-36 General Health Questionnaire to the standard paper version , 2002, Quality of Life Research.

[52]  J. Gaertner,et al.  Electronic pain diary: a randomized crossover study. , 2004, Journal of pain and symptom management.

[53]  Mikael Palmblad,et al.  Electronic diaries and questionnaires: Designing user interfaces that are easy for all patients to use , 2004, Quality of Life Research.

[54]  Michel Hersen,et al.  Comprehensive handbook of psychological assessment , 2004 .

[55]  Reyis Kurt,et al.  Computer-assisted assessment of depression and function in older primary care patients , 2004, Comput. Methods Programs Biomed..

[56]  Paul P Stork,et al.  A randomized trial of electronic versus paper pain diaries in children: impact on compliance, accuracy, and acceptability , 2004, Pain.

[57]  A. N. Brand,et al.  A comparison of the standard and the computerized versions of the Well-being Questionnaire (WBQ) and the Diabetes Treatment Satisfaction Questionnaire (DTSQ) , 1998, Quality of Life Research.

[58]  Ingela Wiklund,et al.  Assessment of patient‐reported outcomes in clinical trials: the example of health‐related quality of life , 2004, Fundamental & clinical pharmacology.

[59]  Leslie Lenert,et al.  A web-compatible instrument for measuring self-reported disease activity in arthritis. , 2004, The Journal of rheumatology.

[60]  A. Cook,et al.  Electronic pain questionnaires: A randomized, crossover comparison with paper questionnaires for chronic pain assessment , 2004, Pain.

[61]  N. Bellamy,et al.  Responsiveness of the electronic touch screen WOMAC 3.1 OA Index in a short term clinical trial with rofecoxib. , 2004, Osteoarthritis and cartilage.

[62]  Bill Byrom,et al.  The value of computer-administered self-report data in central nervous system clinical trials. , 2005, Current opinion in drug discovery & development.

[63]  N. Bellamy,et al.  Validation and patient acceptance of a computer touch screen version of the WOMAC 3.1 osteoarthritis index , 2004, Annals of the rheumatic diseases.

[64]  K. Grafton,et al.  Test-Retest Reliability of the Short-Form McGill Pain Questionnaire: Assessment of Intraclass Correlation Coefficients and Limits of Agreement in Patients With Osteoarthritis , 2005, The Clinical journal of pain.

[65]  Jacek A Kopec,et al.  Computer-administered bath ankylosing spondylitis and Quebec Scale outcome questionnaires for low back pain: agreement with traditional paper format. , 2005, The Journal of rheumatology.

[66]  E. McColl Cognitive Interviewing. A Tool for Improving Questionnaire Design , 2006, Quality of Life Research.

[67]  T. Kvien,et al.  Performance of health status measures with a pen based personal digital assistant , 2005, Annals of the rheumatic diseases.

[68]  H. Bischoff-Ferrari,et al.  A computer touch-screen version of the North American Spine Society outcome assessment instrument for the lumbar spine. , 2005, The Journal of bone and joint surgery. British volume.

[69]  D. Patrick,et al.  Validation of electronic data capture of the Irritable Bowel Syndrome--Quality of Life Measure, the Work Productivity and Activity Impairment Questionnaire for Irritable Bowel Syndrome and the EuroQol. , 2006, Value in health : the journal of the International Society for Pharmacoeconomics and Outcomes Research.

[70]  U. S. Department of Health and Human Services FDA Cen Research,et al.  Guidance for industry: patient-reported outcome measures: use in medical product development to support labeling claims: draft guidance , 2006, Health and quality of life outcomes.

[71]  Toben F Nelson,et al.  Comparing web and mail responses in a mixed mode survey in college alcohol use research. , 2006, Addictive behaviors.

[72]  D V Doyle,et al.  Touch-screen computer systems in the rheumatology clinic offer a reliable and user-friendly means of collecting quality-of-life and outcome data from patients with rheumatoid arthritis. , 2006, Rheumatology.