Clinical trials, outcome measures, and response criteria.

Pillemer and Tilley: Editorial 407 After a patient has completed a clinical trial, the physician is anxious to know: “Did my patient get better?” Usually, the next question is: “Is the new treatment any good?” The emphasis of the first question tends to be whether the patient showed meaningful clinical improvement by the end of the study. For the second question, the emphasis is more toward whether the response in the group receiving the treatment of interest was superior to that in the control or placebo group. A positive answer for one question does not necessarily imply a “yes” answer for the other. For example, the patient’s physician may think: “I regret that the trial ended up with a negative result, but my patient got better.” The clinical trialist may think: “I regret that your patient didn’t get better, but the treatment that we tested was clearly superior to the placebo.” Here, we will look into how these questions have been approached, with an emphasis on clinical trials for rheumatoid arthritis (RA). We will also address how an alternative analytic approach may be used in RA as well as in conditions for which, in contrast to RA, widely agreed-upon response criteria may not yet be available. Over an extended period of time, investigators in many countries have sought the optimal outcome meas res for clinical trials in various rheumatic diseases. For RA, the efforts included the examination of a number of individual measures to find the best one1. These efforts have included exercises on how to count and score joints, and which joints to evaluate2,3. When considerable data became available after decades of work, an international effort was launched to determine which measures should be included in a core set to best describe the disease activity. After the Outcome Measures in Rheumatoid Arthritis Clinical Trials (OMERACT) meeting, the American College of Rheumatology (ACR) published a core set of criteria that were recommended for inclusion in clinical trials of RA4-6. This core set was subsequently ratified by the World Health Organization (WHO) and the International League of Associations for Rheumatology (ILAR) as the WHO/ILAR core set7. However, the question of how to optimally combine these outcome measures remained8,9. Paulus had developed rules-based criteria for response in RA trials based on data from the Cooperative Systemic Studies of Rheumatic Diseases (CSSRD) program (Table 1)10. Further activities by the ACR followed a heuristic or rules-based approach. The ACR analyzed data from CSSRD trials and applied rules that were essentially different ways of weighting and summing up the data across the core set of outcome measures11. An optimal rule would give the greatest difference between the active treatment and the placebo group, while giving a low placebo response rate. The goal was to achieve the greatest power to detect a treatment with the smallest number of patients. Different response criteria proposed by the European League Against Rheumatism (EULAR) and others were tested in analyses of the same CSSRD trials11. These analyses resulted in the selection of a rule that could be applied by clinicians to determine whether a patient was or was not a responder for a given therapy. The EULAR criteria use a more complex approach than the ACR, where each outcome measure is divided into categories based on percentage change from baseline12,13. As shown in Table 1, the EULAR criteria are based on both an improvement and the achievement of a low disease activity state, as measured by the Disease Activity Score (DAS). In the DAS, the weights for summing across different measures have been determined using a multivariate approach. This results in a more complicated equation that may have more precision for the set of clinical trial data on which it was originally based than, say, the ACR response criteria. There are problems with the heuristic approach. The first of these problems is inherent in extrapolating data from a single set of clinical trials to a later period. We may think of the conduct of clinical trials as the process of following cohorts of subjects who have been exposed or not exposed to the treatment of interest over a limited period of time. Editorial

[1]  R. Jonsson,et al.  Revisiting Sjögren's syndrome in the new millennium: perspectives on assessment and outcome measures. Report of a workshop held on 23 March 2000 at Oxford, UK. , 2001, Rheumatology.

[2]  S. Bowman,et al.  Outcome measures in Sjögren's syndrome. , 2001, Rheumatology.

[3]  F. Vivino The treatment of Sjögren's syndrome patients with Pilocarpine-tablets , 2001, Scandinavian journal of rheumatology. Supplement.

[4]  S. Pillemer,et al.  Global statistical tests for comparing multiple outcomes in rheumatoid arthritis trials. MIRA Trial Group. , 1999, Arthritis and rheumatism.

[5]  A. B. Richards,et al.  Treatment of primary Sjogren's syndrome with low-dose natural human interferon-α administered by the oral mucosal route: A phase II clinical trial , 1999 .

[6]  D. Felson,et al.  ACR and EULAR improvement criteria have comparable validity in rheumatoid arthritis trials. American College of Rheumatology European League of Associations for Rheumatology. , 1999, The Journal of rheumatology.

[7]  D. Furst,et al.  Etanercept therapy in rheumatoid arthritis. A randomized, controlled trial. , 1999, Annals of internal medicine.

[8]  T. C. Adamson,et al.  T cell receptor peptide vaccination in rheumatoid arthritis : A placebo-controlled trial using a combination of Vβ3, Vβ14, and Vβ17 peptides , 1998 .

[9]  G. Wells,et al.  Should improvement in rheumatoid arthritis clinical trials be defined as fifty percent or seventy percent improvement in core set measures, rather than twenty percent? , 1998, Arthritis and rheumatism.

[10]  N L Geller,et al.  Use of a global test for multiple outcomes in stroke trials with application to the National Institute of Neurological Disorders and Stroke t-PA Stroke Trial. , 1996, Stroke.

[11]  S. Pillemer,et al.  HLA-DRB1 genes and disease severity in rheumatoid arthritis. The MIRA Trial Group. Minocycline in Rheumatoid Arthritis. , 1996, Arthritis and rheumatism.

[12]  R. Fox,et al.  Treatment of primary Sjögren's syndrome with hydroxychloroquine: a retrospective, open-label study , 1996 .

[13]  M. Prevoo,et al.  Development and validation of the European League Against Rheumatism response criteria for rheumatoid arthritis. Comparison with the preliminary American College of Rheumatology and the World Health Organization/International League Against Rheumatism Criteria. , 1996, Arthritis and rheumatism.

[14]  J J Anderson,et al.  American College of Rheumatology. Preliminary definition of improvement in rheumatoid arthritis. , 1995, Arthritis and rheumatism.

[15]  T. Pincus,et al.  Reduced joint counts in controlled clinical trials in rheumatoid arthritis. , 1994, Arthritis and rheumatism.

[16]  P. Tugwell,et al.  World Health Organization and International League of Associations for Rheumatology core endpoints for symptom modifying antirheumatic drugs in rheumatoid arthritis clinical trials. , 1994, The Journal of rheumatology. Supplement.

[17]  M Lefkopoulou,et al.  Global tests for multiple binary outcomes. , 1993, Biometrics.

[18]  R. Roberts Pooled outcome measures in arthritis: the pros and cons. , 1993, The Journal of rheumatology.

[19]  P. Tugwell,et al.  OMERACT conference on outcome measures in rheumatoid arthritis clinical trials: introduction. , 1993, The Journal of rheumatology.

[20]  P. Tugwell,et al.  The validity of pooled outcome measures (indices) in rheumatoid arthritis clinical trials. , 1993, The Journal of rheumatology.

[21]  M. Chernoff,et al.  Sensitivity to change of rheumatoid arthritis clinical trial outcome measures. , 1993, The Journal of rheumatology.

[22]  OMERACT, Conference on Outcome Measures in Rheumatoid Arthritis Clinical Trials. Proceedings. Maastricht, The Netherlands, April 29-May 3, 1992. , 1993, The Journal of rheumatology.

[23]  T. Holford Understanding the effects of age, period, and cohort on incidence and mortality rates. , 1991, Annual review of public health.

[24]  H. Paulus,et al.  Analysis of improvement in individual rheumatoid arthritis patients treated with disease-modifying antirheumatic drugs, based on the findings in patients treated with placebo. The Cooperative Systematic Studies of Rheumatic Diseases Group. , 1990, Arthritis and rheumatism.

[25]  N L Geller,et al.  The analysis of multiple endpoints in clinical trials. , 1987, Biometrics.

[26]  J. Reading,et al.  Reduced joint count indices in the evaluation of rheumatoid arthritis. , 1985, Arthritis and rheumatism.

[27]  P. O'Brien Procedures for comparing samples with multiple endpoints. , 1984, Biometrics.

[28]  Dawkins Rl,et al.  Immunogenetics of rheumatoid arthritis. , 1983 .