The Effect of Clustering of Outcomes on the Association of Procedure Volume and Surgical Outcomes

Context While studies suggest that surgical patients fare best with providers (hospitals and surgeons) that perform a high volume of procedures, most have not accounted for the tendency of patients of 1 provider to have similar outcomes (clustering). When outcomes of individual patients are clustered, more patients are required to prove that providers' outcomes differ from one another. Contribution This reanalysis of data from 3 previously published volumeoutcome studies of surgery accounted for clustering within provider. Statistical significance of volumeoutcome relationships of morbidity end points was attenuated substantially after adjustment for the effects of clustering. Implications Planners considering regionalizing surgery should remember that volumeoutcome studies that have not accounted for clustering exaggerate differences in outcomes by provider. The Editors In an attempt to evaluate the degree to which the choice of provider affects outcomes of major medical procedures, numerous investigators have used procedure volume as a proxy for expertise and have conducted studies correlating volume with outcomes. However, volume is a crude, easily calculated measure, and its use may overlook large variations in quality among providers that are independent of the number of procedures performed. If there are such large variations in outcomes among providers, the outcomes of patients treated by the same provider are necessarily correlated, or clustered. Clustering of this nature invalidates conventional statistical analyses. Patients treated at the same hospital or by the same surgeon may be more likely to experience similar outcomes if surgical technique or supportive care practices vary among providers and if these factors affect outcomes. It is well established that statistical methods must correct for the effect of clustering of this nature if it exists (1-3). In general, correction for clustering attenuates the statistical significance of observed trends. In this article, we describe measures that characterize the degree of clustering of outcomes and methods that permit the calculation of tests of statistical significance that are adjusted for the impact of clustering. We reanalyzed 3 large studies of major cancer surgery (4-7) and found examples that illustrate varying degrees of clustering. This reanalysis focused on the relationship of surgeon volume to outcome. We explored the effect of correction for clustering on the statistical significance of tests for association between surgeon volume and outcome. Graphical methods that visually characterize the degree of between-surgeon variation in quality after adjustment for volume and case mix are described. We also explored analyses that attempt to distinguish between the effects of surgeon volume and those of hospital volume. Methods Data Sources All 3 data sets were obtained by using the linkage of the Surveillance, Epidemiology, and End Results registries with Medicare claims from the Centers for Medicare & Medicaid Services (formerly Health Care Financing Administration) (8). These linkages permit evaluation of the outcomes of surgery for all patients who received a diagnosis of cancer in the geographical regions defined by the registries. Patients were enrolled in Part A and Part B Medicare but were not enrolled in a health maintenance organization, and each had a diagnosis of colon, prostate, or rectal cancer made at age 65 years or older in a region covered by one of the Surveillance, Epidemiology, and End Results registries. All tumors were primary, invasive, and malignant and were diagnosed during 1992 to 1996. The sample sizes for the 3 cohorts ranged from very large (colon cancer, n = 24166) to large (prostate cancer, n = 10737) to moderate (rectal cancer, n = 2603). These analyses encompassed 2682, 999, and 1141 surgeons, respectively. The median numbers of patients per surgeon were 5 (range, 1 to 85), 7 (range, 1 to 121), and 1 (range, 1 to 26), respectively. The particular billing codes used to identify the surgeon, the specific surgical procedures, and the outcomes have been described in detail in the original publications (4-7), but our essential approach is summarized as follows. Procedure Volume Surgeon volume was defined as the total number of operations performed by the given surgeon between 1992 and 1996 on members of each cohort, as ascertained from the Medicare files on the basis of International Classification of Diseases, Ninth Revision, procedure codes (9). The Common Procedural Terminology codes submitted in the National Claims History files were used to identify the procedures (10). Surgeon identity was established on the basis of a unique identifier that has been mandatory on all claims since 1991. Outcomes Patient outcomes were binary indicators for 2-year mortality or a particular postoperative complication, depending on the cancer cohort. For the colon and rectal cancer data sets, mortality at 2 years and occurrence of a procedure impairing fecal continence (ileostomy or colostomy in colon cancer and an abdominoperineal resection in rectal cancer) were the outcomes evaluated. These measures of morbidity were selected because they can have an important impact on the patient's subsequent quality of life. For the prostate cancer cohort, postoperative and late urinary complications were examined. Postoperative complications were defined as potentially life-threatening cardiac, respiratory, or vascular events; the need for reoperation; and hemorrhage, renal failure, or shock, all occurring within 30 days of surgery. Late urinary complications were identified by procedures or symptoms recorded more than 30 days after but within 1 year of surgery and included diagnoses of bladder-neck obstruction, urethral or ureteral strictures, intestinal or vesical fistulas, pelvic abscess, and other urinary tract complications that required surgical repair. Case Mix In each primary analysis, clinical and demographic variables were used to adjust for differences in patient case mix among individual surgeons. These included age; sex; ethnicity; disease stage at diagnosis; disease grade; income level; and the presence of particular features such as emergent presentation, obstruction, or perforation. All results shown in the tables and figures are adjusted for case mix. Alternative Statistical Methods for Cluster-Correlated Data We used and compared 2 general statistical approaches that permit adjustments to account for the effect of clustering of outcomes on statistical significance levels: random-effects models, in which the impact of each surgeon is modeled explicitly (11), and generalized estimating equations (GEEs), in which the underlying patient-specific analysis is adjusted to accommodate the effect of clustering on the statistical significance of the analyses (12). The GEE method does not alter the estimate of the magnitude of the impact of surgeon volume on outcome; only the statistical significance (and the width of the confidence interval) is altered. In the random-effects model, the estimate may change after correction for clustering. Both of these methods are widely used in health services research and clinical research. We evaluated the effect of surgeon volume on outcomes in 3 settings: adjustment solely for case mix; adjustment for case mix and hospital volume; and adjustment for case mix, hospital volume, and clustering. Logistic regression was used throughout to estimate the impact of volume on outcome. The random-effects models were fitted by using the gllamm6 command in Stata, version 7.0 (Stata Corp., College Station, Texas), and the GEE models were fitted by using the GENMOD procedure in SAS, version 8.0 (SAS Institute, Inc., Cary, North Carolina). Volume was modeled as a continuous variable in all analyses. Odds ratios are reported per 100-unit decline in volume increments for the colon and prostate cancer analyses and per 10-unit decline for the rectal cancer analysis. A Graphical Assessment and Statistical Test for Clustering of Outcomes We used a graphical technique to characterize visually the extent of variation among providers. Only surgeons with a minimum number of patients treated were used to ensure that each surgeon had a sufficient number of patients in his or her profile to allow meaningful estimates of the mean departure from the overall event rate. As a result, we did not construct graphs for the rectal cancer data set, since the vast majority of the surgeons treated very few patients. The observed outcome relative frequencies, that is, the proportion of patients who experienced the outcome, were evaluated for each surgeon. A conventional multivariable logistic regression analysis that included surgeon volume, hospital volume, and case mix as predictor variables was used to estimate the patient-specific outcome probabilities. The sum of these probabilities for each patient in the surgeon's profile was calculated as the expected event rate for the surgeon. To obtain the graph, the observed frequencies were plotted in a histogram. An expected histogram was created to display the outcome histogram that would be expected if there was no additional surgeon-to-surgeon variation after adjustment for known predictors of outcome in the model. For each surgeon, the binomial distribution based on the preceding expected event rate and the number of patients in the surgeon's profile was used to determine probabilities for each possible observed outcome relative frequency (in a histogram grouped by cells of 5 percentage points). For each cell of the histogram, these probabilities were summed across all surgeons to obtain the expected histogram. This expected histogram reflects the degree of spread we would expect to observe in the absence of clustering. The spread of the observed distribution will exceed the spread of the expected distribution in the presence of clustering. To test whether differences between the 2 histogr

[1]  Ethan A Halm,et al.  Is Volume Related to Outcome in Health Care? A Systematic Review and Methodologic Critique of the Literature , 2002, Annals of Internal Medicine.

[2]  C. Muir,et al.  International Classification of Diseases for Oncology , 1990 .

[3]  J M Neuhaus,et al.  An annotated bibliography of methods for analysing correlated categorical data. , 1992, Statistics in medicine.

[4]  A. Localio,et al.  Adjustments for Center in Multicenter Studies: An Overview , 2001, Annals of Internal Medicine.

[5]  N. Powe,et al.  The association between hospital volume and survival after acute myocardial infarction in elderly patients. , 1999, The New England journal of medicine.

[6]  C. Begg,et al.  Influence of hospital procedure volume on outcomes following surgery for colon cancer. , 2000, JAMA.

[7]  B. Gordon Current procedural terminology , 1966 .

[8]  F. Kee,et al.  Influence of hospital and clinician workload on survival from colorectal cancer: cohort study. , 1999, BMJ.

[9]  E. Hannan,et al.  The decline in coronary artery bypass graft surgery mortality in New York State. The role of surgeon volume. , 1995, JAMA.

[10]  Noreen Goldman,et al.  An assessment of estimation procedures for multilevel models with binary responses , 1995 .

[11]  J. Roberts,et al.  The effect of the volume of procedures at transplantation centers on mortality after liver transplantation. , 1999, The New England journal of medicine.

[12]  R. Littell SAS System for Mixed Models , 1996 .

[13]  E L Hannan,et al.  Coronary angioplasty volume-outcome relationships for hospitals and cardiologists. , 1997, JAMA.

[14]  E L Hannan,et al.  Improving the outcomes of coronary artery bypass surgery in New York State. , 1994, JAMA.

[15]  A. Donner,et al.  Analysis of data arising from a stratified design with the cluster as unit of randomization. , 1987, Statistics in medicine.

[16]  E. Hannan,et al.  Relationship between provider volume and mortality for carotid endarterectomies in New York state. , 1998, Stroke.

[17]  P. Walsh Variations in morbidity after radical prostatectomy. , 2002, The Journal of urology.

[18]  C. Begg,et al.  The influence of hospital volume on survival after resection for lung cancer. , 2001, The New England journal of medicine.

[19]  N. Breslow,et al.  Approximate inference in generalized linear mixed models , 1993 .

[20]  C. Begg,et al.  Surgeon volume compared to hospital volume as a predictor of outcome following primary colon cancer resection , 2003, Journal of surgical oncology.

[21]  L. Kessler,et al.  Potential for Cancer Related Health Services Research Using a Linked Medicare‐Tumor Registry Database , 1993, Medical care.

[22]  E L Hannan,et al.  New York State's Cardiac Surgery Reporting System: four years later. , 1994, The Annals of thoracic surgery.

[23]  S. Zeger,et al.  Longitudinal data analysis using generalized linear models , 1986 .

[24]  J. Birkmeyer,et al.  Hospital volume and surgical mortality in the United States. , 2002, The New England journal of medicine.

[25]  P. Pronovost,et al.  Hospital volume is related to clinical and economic outcomes of esophageal resection in Maryland. , 2001, The Annals of thoracic surgery.

[26]  T. Ryan,et al.  Relation between operator and hospital volume and outcomes following percutaneous coronary interventions in the era of the coronary stent. , 2000, JAMA.

[27]  M. Gail,et al.  Interplay between design and analysis for behavioral intervention trials with community as the unit of randomization. , 1995, American journal of epidemiology.

[28]  J. Birkmeyer,et al.  Relationship between hospital volume and late survival after pancreaticoduodenectomy. , 1999, Surgery.

[29]  E. Hannan,et al.  Investigation of the relationship between volume and mortality for surgical procedures performed in New York State hospitals. , 1990, JAMA.

[30]  Sheldon Greenfield,et al.  Profiling Care Provided by Different Groups of Physicians: Effects of Patient Case-Mix (Bias) and Physician-Level Clustering on Quality Assessment Results , 2002, Annals of Internal Medicine.

[31]  S. Yao,et al.  Population-based study of relationships between hospital volume of prostatectomies, patient outcomes, and length of hospital stay. , 1999, Journal of the National Cancer Institute.

[32]  A. Polednak Hospital volume and survival of breast cancer patients in Connecticut. , 1999, American journal of public health.

[33]  N. Bickell,et al.  Hospital volume differences and five-year survival from breast cancer. , 1998, American journal of public health.

[34]  C. Begg,et al.  Taking stock of volume-outcome studies. , 2003, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[35]  J. Birkmeyer,et al.  Volume standards for high-risk surgical procedures: potential benefits of the Leapfrog initiative. , 2001, Surgery.

[36]  S. Mehta,et al.  Role of surgeon volume in radical prostatectomy outcomes. , 2003, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[37]  R. Glasgow,et al.  Hospital volume influences outcome in patients undergoing pancreatic resection for cancer. , 1996, The Western journal of medicine.

[38]  C. Begg,et al.  Hospital and Surgeon Procedure Volume as Predictors of Outcome Following Rectal Cancer Resection , 2002, Annals of surgery.

[39]  P. Katz,et al.  The relationship between hospital volume and outcomes of hepatic resection for hepatocellular carcinoma. , 1999, Archives of surgery.

[40]  C. Begg,et al.  Impact of hospital volume on operative mortality for major cancer surgery. , 1998, JAMA.