Short-term prediction of mortality in patients with systemic lupus erythematosus: classification of outcomes using random forests.

OBJECTIVE To identify demographic and clinical characteristics that classify patients with systemic lupus erythematosus (SLE) at risk for in-hospital mortality. METHODS Patients hospitalized in California from 1996 to 2000 with a principal diagnosis of SLE (N = 3,839) were identified from a state hospitalization database. As candidate predictors of mortality, we used patient demographic characteristics; the presence or absence of 40 different clinical conditions listed among the discharge diagnoses; and 2 summary indexes derived from the discharge diagnoses, the Charlson Index and the SLE Comorbidity Index. Predictors of patients at increased risk of mortality were identified and validated using random forests, a statistical procedure that is a generalization of single classification trees. Random forests use bootstrapped samples of patients and randomly selected subsets of predictors to create individual classification trees, and this process is repeated to generate multiple trees (a forest). Classification is then done by majority vote across all trees. RESULTS Of the 3,839 patients, 109 died during hospitalization. Selecting from all available predictors, the random forests had excellent predictive accuracy for classification of death. The mean classification error rate, averaged over 10 forests of 500 trees each, was 11.9%. The most important predictors were the Charlson Index, respiratory failure, SLE Comorbidity Index, age, sepsis, nephritis, and thrombocytopenia. CONCLUSION Information on clinical diagnoses can be used to accurately predict mortality among hospitalized patients with SLE. Random forests represent a useful technique to identify the most important predictors from a larger (often much larger) number and to validate the classification.

[1]  F Alemi,et al.  Predicting In-Hospital Survival of Myocardial Infarction: A Comparative Study of Various Severity Measures , 1990, Medical care.

[2]  M. Pine,et al.  Using Clinical Variables to Estimate the Risk of Patient Mortality , 1991, Medical care.

[3]  Hude Quan,et al.  Validity of Information on Comorbidity Derived From ICD-9-CCM Administrative Data , 2002, Medical care.

[4]  A. Wiik,et al.  Systemic lupus erythematosus. Follow-up study of 148 patients. II: Predictive factors of importance for course and outcome , 1987, Clinical Rheumatology.

[5]  J. Fries,et al.  Variables associated with decreased survival in systemic lupus erythematosus. , 1991, Seminars in Arthritis & Rheumatism.

[6]  G. Szegedi,et al.  Systemic lupus erythematosus survival in Hungary. Results from a single centre. , 1999, Clinical and experimental rheumatology.

[7]  S. Studenski,et al.  Long-term survival in systemic lupus erythematosus. Patient characteristics associated with poorer outcomes. , 1995, Arthritis and rheumatism.

[8]  C. Lau,et al.  A prospective study of survival and prognostic indicators of systemic lupus erythematosus in a southern Chinese population. , 2000, Rheumatology.

[9]  J. Reveille,et al.  Systemic lupus erythematosus in three ethnic groups. VIII. Predictors of early mortality in the LUMINA cohort , 2001 .

[10]  M. Ward,et al.  Mortality and causes of death in systemic lupus erythematosus , 2001, Current opinion in rheumatology.

[11]  M. Ward,et al.  Hospitalizations for coronary artery disease among patients with systemic lupus erythematosus. , 2003, Arthritis and rheumatism.

[12]  D. Wallace,et al.  Systemic lupus erythematosus--survival patterns. Experience with 609 patients. , 1981, JAMA.

[13]  D. Mark,et al.  Bias in the coding of hospital discharge data and its implications for quality assessment. , 1994, Medical care.

[14]  J. Fries,et al.  A multicenter study of outcome in systemic lupus erythematosus. I. Entry variables as predictors of prognosis. , 1982, Arthritis and rheumatism.

[15]  R. Deyo,et al.  Adapting a clinical comorbidity index for use with ICD-9-CM administrative databases. , 1992, Journal of clinical epidemiology.

[16]  G. Hughes,et al.  Morbidity and Mortality in Systemic Lupus Erythematosus During a 10-Year Period: A Comparison of Early and Late Manifestations in a Cohort of 1,000 Patients , 2003, Medicine.

[17]  E Lucero,et al.  Systemic lupus erythematosus: mortality and survival in Argentina. A multicenter study. , 2000, Lupus.

[18]  T. Harrer,et al.  Definition of risk factors for death, end stage renal disease, and thromboembolic events in a monocentric cohort of 338 patients with systemic lupus erythematosus , 2002, Annals of the rheumatic diseases.

[19]  D. Alarcón-Segovia,et al.  Influence of the antiphospholipid syndrome in the survival of patients with systemic lupus erythematosus. , 1994, The Journal of rheumatology.

[20]  D. Gladman,et al.  Early damage as measured by the SLICC/ACR damage index is a predictor of mortality in systemic lupus erythematosus , 2001, Lupus.

[21]  Jodi B Segal,et al.  Accuracy of identification of patients with immune thrombocytopenic purpura through administrative records: A data validation study , 2004, American journal of hematology.

[22]  D. Isenberg,et al.  SLICC/ACR Damage Index is valid, and renal and pulmonary organ scores are predictors of severe outcome in patients with systemic lupus erythematosus. , 1996, British journal of rheumatology.

[23]  L. Villarroel,et al.  Survival of Chilean patients with systemic lupus erythematosus. , 1994, Seminars in arthritis and rheumatism.

[24]  S. Studenski,et al.  Mortality risks associated with specific clinical manifestations of systemic lupus erythematosus. , 1996, Archives of internal medicine.

[25]  D. Gladman,et al.  Mortality studies in systemic lupus erythematosus. Results from a single center. II. Predictor variables for mortality. , 1995, The Journal of rheumatology.

[26]  J. Gómez-Reino,et al.  Survival analysis of 306 European Spanish patients with systemic lupus erythematosus , 1998, Lupus.

[27]  R. Tamblyn,et al.  Validation of diagnostic codes within medical services claims. , 2004, Journal of clinical epidemiology.

[28]  E L Hannan,et al.  Clinical Versus Administrative Data Bases for CABG Surgery: Does it Matter , 1992, Medical care.

[29]  L. Roos,et al.  Risk adjustment in claims-based research: the search for efficient approaches. , 1989, Journal of clinical epidemiology.

[30]  M Pine,et al.  Predictions of Hospital Mortality Rates: A Comparison of Data Sources , 1997, Annals of Internal Medicine.

[31]  O. Nived,et al.  High predictive value of the Systemic Lupus International Collaborating Clinics/American College of Rheumatology damage index for survival in systemic lupus erythematosus. , 2002, The Journal of rheumatology.

[32]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[33]  D. Gladman,et al.  Prediction of short term mortality in systemic lupus erythematosus with time dependent measures of disease activity. , 2000, The Journal of rheumatology.

[34]  M. Vilardell,et al.  Contribution of the initial features of systemic lupus erythematosus to the clinical evolution and survival of a cohort of Mediterranean patients , 2003, Annals of the rheumatic diseases.

[35]  J. Reveille,et al.  Prognosis in systemic lupus erythematosus , 1990 .

[36]  M. Ward Development and testing of a systemic lupus-specific risk adjustment index for in-hospital mortality. , 2000, The Journal of rheumatology.

[37]  A. Voss,et al.  A multicentre study of 513 Danish patients with systemic lupus erythematosus. II. Disease mortality and clinical factors of prognostic value , 2005, Clinical Rheumatology.