Why predictive indexes perform less well in validation studies. Is it magic or methods?

When prognostic indexes have been tested in a second population, they have often performed less well. Since this is believed to be inevitable, methodologic differences that may explain the discrepancies have been overlooked. Data from a prospective study of 232 patients undergoing noncardiac surgery were used to examine the effect of methodologic differences in assembly of population, postoperative surveillance, and the criteria for cardiac complications on the performance of Goldman's cardiac risk index. Our prospective population was used to simulate the methods used in Goldman's study and in three other studies using the risk index to demonstrate the potential impact of differences in population, surveillance, and outcome criteria for cardiac complications. If Goldman's detection and outcome criteria were employed and only the eligibility criteria used for assembly of the populations differed, the overall complication rates would be between 5.2% and 6.9%; and the complication rates for the different Goldman classes were similar. When both different detection strategies and different outcome criteria were used, however, important discrepancies in cardiac complication rates emerged. For example, complication rates in class 2 varied from 2% to 23%. In conclusion, important discrepancies in performance of prognostic indexes may arise from differences in surveillance strategies and definitions of outcome. With sufficient attention to methodologic consistency, the performance of predictive indexes may not inevitably deteriorate in subsequent studies.

[1]  A. Detsky,et al.  Cardiac Assessment for Patients Undergoing Noncardiac Surgery , 1987 .

[2]  E. Cook,et al.  Evaluation of creatine kinase and creatine kinase-MB for diagnosing myocardial infarction. Clinical impact in the emergency room. , 1987, Archives of internal medicine.

[3]  A S Detsky,et al.  Cardiac assessment for patients undergoing noncardiac surgery. A multifactorial clinical risk index. , 1986, Archives of internal medicine.

[4]  R D Cebul,et al.  The importance of disease prevalence in transporting clinical prediction rules. The case of streptococcal pharyngitis. , 1986, Annals of internal medicine.

[5]  R. Adolph,et al.  Cardiac prognosis in noncardiac geriatric surgery. , 1985, Annals of internal medicine.

[6]  F. Harrell,et al.  Regression models for prognostic prediction: advantages, problems, and suggested solutions. , 1985, Cancer treatment reports.

[7]  H. Sox,et al.  Clinical prediction rules. Applications and methodological standards. , 1985, The New England journal of medicine.

[8]  M. Charlson,et al.  Applying results of randomised trials to clinical practice: impact of losses before randomisation. , 1984, British medical journal.

[9]  Zeldin Ra Assessing cardiac risk in patients who undergo noncardiac surgical procedures. , 1984 .

[10]  F. Harrell,et al.  Regression modelling strategies for improved prognostic prediction. , 1984, Statistics in medicine.

[11]  Rose Cc,et al.  Performance of an Index Predicting the Response of Patients with Acute Bronchial Asthma to Intensive Emergency Department Treatment , 1984 .

[12]  R M Centor,et al.  Inability to predict relapse in acute asthma. , 1984, The New England journal of medicine.

[13]  D. Freedman A Note on Screening Regression Equations , 1983 .

[14]  D. C. Brewster,et al.  A Prospective Evaluation of Cardiac Risk Index , 1983 .

[15]  B. Efron,et al.  A Leisurely Look at the Bootstrap, the Jackknife, and , 1983 .

[16]  E F Cook,et al.  Comparative Reproducibility and Validity of Systems for Assessing Cardiovascular Functional Class: Advantages of a New Specific Activity Scale , 1981, Circulation.

[17]  M A Fischl,et al.  An index predicting relapse and need for hospitalization in patients with acute bronchial asthma. , 1981, The New England journal of medicine.

[18]  Ronald D. Snee,et al.  Validation of Regression Models: Methods and Examples , 1977 .

[19]  L Goldman,et al.  Multifactorial index of cardiac risk in noncardiac surgical procedures. , 1977, The New England journal of medicine.

[20]  P A Lachenbruch,et al.  Some Misuses of Discriminant Analysis , 1977, Methods of Information in Medicine.

[21]  A. Chalmers,et al.  The effect of dilution on creatine kinase activity. , 1976, Clinica chimica acta; international journal of clinical chemistry.

[22]  R. Snee,et al.  Ridge Regression in Practice , 1975 .

[23]  B. Sobel,et al.  Quantification of serum creatine phosphokinase isoenzyme activity. , 1974, The American journal of cardiology.

[24]  D. Mercer Separation of tissue and serum creatine kinase isoenzymes by ion-exchange column chromatography. , 1974, Clinical chemistry.

[25]  S. Rosalki An improved procedure for serum creatine phosphokinase determination. , 1967, The Journal of laboratory and clinical medicine.

[26]  Q. Mcnemar Note on the sampling error of the difference between correlated proportions or percentages , 1947, Psychometrika.