A comparison of a Bayesian vs. a frequentist method for profiling hospital performance.

The objective of this study was to compare the classification of hospitals as outcomes outliers using a commonly implemented frequentist statistical approach vs. an implementation of Bayesian hierarchical statistical models, using 30-day hospital-level mortality rates for a cohort of acute myocardial infarction patients as a test case. For the frequentist approach, a logistic regression model was constructed to predict mortality. For each hospital, a risk-adj usted mortality rate was computed. Those hospitals whose 95% confidence interval, around the risk-adjusted mortality rate, excludes the mean mortality rate were classified as outliers. With the Bayesian hierarchical models, three factors could vary: the profile of the typical patient (low, medium or high risk), the extent to which the mortality rate for the typical patient departed from average, and the probability that the mortality rate was indeed different by the specified amount. The agreement between the two methods was compared for different patient profiles, threshold differences from the average and probabilities. Only marginal agreement was shown between the Bayesian and frequentist approaches. In only five of the 27 comparisons was the kappa statistic at least 0.40. The remaining 22 comparisons demonstrated only marginal agreement between the two methods. Within the Bayesian framework, hospital classification clearly depended on patient profile, threshold and probability of exceeding the threshold. These inconsistencies raise questions about the validity of current methods for classifying hospital performance, and suggest a need for urgent research into which methods are most meaningful to clinicians, managers and the general public.

[1]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[2]  David J. Spiegelhalter,et al.  Introducing Markov chain Monte Carlo , 1995 .

[3]  E. DeLong,et al.  Comparing risk-adjustment methods for provider profiling. , 1998, Statistics in medicine.

[4]  G. Y. Wong,et al.  The Hierarchical Logistic Regression Model for Multilevel Analysis , 1985 .

[5]  C. Morris,et al.  Development and validation of a claims based index for adjusting for risk of mortality: the case of acute myocardial infarction. , 1995, Journal of clinical epidemiology.

[6]  C L Christiansen,et al.  Improving the Statistical Approach to Health Care Provider Profiling , 1997, Annals of Internal Medicine.

[7]  E. Braunwald,et al.  Predictors of Early Morbidity and Mortality After Thrombolytic Therapy of Acute Myocardial Infarction: Analyses of Patient Subgroups in the Thrombolysis in Myocardial Infarction (TIMI) Trial, Phase II , 1992, Circulation.

[8]  E L Hannan,et al.  Benefits and hazards of reporting medical outcomes publicly. , 1996, The New England journal of medicine.

[9]  L I Iezzoni,et al.  Predicting In-hospital Mortality for Stroke Patients , 1996, Medical decision making : an international journal of the Society for Medical Decision Making.

[10]  S Lemeshow,et al.  Confidence interval estimates of an index of quality performance based on logistic regression models. , 1995, Statistics in medicine.

[11]  B. Everitt,et al.  Statistical methods for rates and proportions , 1973 .

[12]  L. Iezzoni Risk Adjustment for Measuring Healthcare Outcomes , 1994 .

[13]  L. Iezzoni,et al.  Using risk-adjusted outcomes to assess clinical practice: an overview of issues pertaining to risk adjustment. , 1994, The Annals of thoracic surgery.

[14]  L I Iezzoni,et al.  Predicting Who Dies Depends on How Severity Is Measured: Implications for Evaluating Patient Outcomes , 1995, Annals of Internal Medicine.

[15]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[16]  J. Fleiss,et al.  Risk stratification and survival after myocardial infarction. , 1983, The New England journal of medicine.

[17]  A. Vera,et al.  Prediction on admission of in-hospital mortality in patients older than 70 years with acute myocardial infarction. , 1995, Chest.

[18]  J. Allison Risk adjustment for measuring health care outcomes , 1996 .

[19]  C A Gatsonis,et al.  Variations in the Utilization of Coronary Angiography for Elderly Patients with an Acute Myocardial Infarction: An Analysis Using Hierarchical Logistic Regression , 1995, Medical care.

[20]  A. Albert,et al.  Short‐term risk stratification at admission based on simple clinical data in acute myocardial infarction , 1989, The American journal of cardiology.

[21]  E. Negri,et al.  Determinants of 6‐Month Mortality in Survivors of Myocardial Infarction After Thrombolysis Results of the GISSI‐2 Data Base , 1993, Circulation.

[22]  Walter R. Gilks,et al.  A Language and Program for Complex Bayesian Modelling , 1994 .

[23]  L. Joseph,et al.  Bayesian Statistics: An Introduction , 1989 .

[24]  L. Iezzoni,et al.  Severity measurement methods and judging hospital death rates for pneumonia. , 1996, Medical care.

[25]  Alastair H Leyland,et al.  League tables and acute myocardial infarction , 1998, The Lancet.

[26]  M. Glickman,et al.  Statistical Methods for Profiling Providers of Medical Care: Issues and Applications , 1997 .

[27]  D. Spiegelhalter,et al.  Reliability of league tables of in vitro fertilisation clinics: retrospective analysis of live birth rates. , 1998, BMJ.

[28]  G. Casella,et al.  Statistical Inference , 2003, Encyclopedia of Social Network Analysis and Mining.

[29]  P. Austin,et al.  Temporal changes in the outcomes of acute myocardial infarction in Ontario, 1992-1996. , 1999, CMAJ : Canadian Medical Association journal = journal de l'Association medicale canadienne.

[30]  David J. Spiegelhalter,et al.  Hepatitis B: a case study in MCMC methods , 1996 .