Mining high-dimensional administrative claims data to predict early hospital readmissions

BACKGROUND Current readmission models use administrative data supplemented with clinical information. However, the majority of these result in poor predictive performance (area under the curve (AUC)<0.70). OBJECTIVE To develop an administrative claim-based algorithm to predict 30-day readmission using standardized billing codes and basic admission characteristics available before discharge. MATERIALS AND METHODS The algorithm works by exploiting high-dimensional information in administrative claims data and automatically selecting empirical risk factors. We applied the algorithm to index admissions in two types of hospitalized patient: (1) medical patients and (2) patients with chronic pancreatitis (CP). We trained the models on 26,091 medical admissions and 3218 CP admissions from The Johns Hopkins Hospital (a tertiary research medical center) and tested them on 16,194 medical admissions and 706 CP admissions from Johns Hopkins Bayview Medical Center (a hospital that serves a more general patient population), and vice versa. Performance metrics included AUC, sensitivity, specificity, positive predictive values, negative predictive values, and F-measure. RESULTS From a pool of up to 5665 International Classification of Diseases, 9th Revision, Clinical Modification (ICD-9-CM) diagnoses, 599 ICD-9-CM procedures, and 1815 Current Procedural Terminology codes observed, the algorithm learned a model consisting of 18 attributes from the medical patient cohort and five attributes from the CP cohort. Within-site and across-site validations had an AUC≥0.75 for the medical patient cohort and an AUC≥0.65 for the CP cohort. CONCLUSIONS We have created an algorithm that is widely applicable to various patient cohorts and portable across institutions. The algorithm performed similarly to state-of-the-art readmission models that require clinical data.

[1]  E. Fisher,et al.  The accuracy of Medicare's hospital claims data: progress has been made, but problems remain. , 1992, American journal of public health.

[2]  B. Yawn,et al.  Identifying Persons with Diabetes Using Medicare Claims Data , 1999, American journal of medical quality : the official journal of the American College of Medical Quality.

[3]  J. Goodwin,et al.  An approach to identifying incident breast cancer cases using Medicare claims data. , 2000, Journal of clinical epidemiology.

[4]  J. Goodwin,et al.  Patterns of use of chemotherapy for breast cancer in older women: findings from Medicare claims data. , 2001, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[5]  C. Klabunde,et al.  Assessing Comorbidity Using Claims Data: An Overview , 2002, Medical care.

[6]  Marlene R. Miller,et al.  Administrative data based patient safety research: a critical review , 2003, Quality & safety in health care.

[7]  E. Schimmel The hazards of hospitalization* , 2003 .

[8]  J. Katz,et al.  Accuracy of Medicare claims data for rheumatologic diagnoses in total hip replacement recipients. , 2003, Journal of clinical epidemiology.

[9]  Sung-joon Min,et al.  Posthospital care transitions: patterns, complications, and risk identification. , 2004, Health services research.

[10]  Thomas Lengauer,et al.  ROCR: visualizing classifier performance in R , 2005, Bioinform..

[11]  J. Avorn,et al.  Identification of individuals with CKD from Medicare claims data: a validation study. , 2005, American journal of kidney diseases : the official journal of the National Kidney Foundation.

[12]  C. Christiansen,et al.  Evaluating the Patient Safety Indicators: How Well Do They Perform on Veterans Health Administration Data? , 2005, Medical care.

[13]  W. Hauck,et al.  Availability of Medical Care Services in Drug Treatment Clinics Associated With Lower Repeated Emergency Department Use , 2005, Medical care.

[14]  S. Normand,et al.  An Administrative Claims Model Suitable for Profiling Hospital Performance Based on 30-Day Mortality Rates Among Patients With an Acute Myocardial Infarction , 2006, Circulation.

[15]  P. Halfon,et al.  Validation of the Potentially Avoidable Hospital Readmission Rate as a Routine Indicator of the Quality of Hospital Care , 2006, Medical care.

[16]  A. Majeed,et al.  Identifying Patients at High Risk of Emergency Hospital Admissions: A Logistic Regression Analysis , 2006 .

[17]  D. Wennberg,et al.  Case finding for patients at risk of readmission to hospital: development of algorithm to identify high risk patients , 2006, BMJ : British Medical Journal.

[18]  D. Hoaglin,et al.  Enhancement of claims data to improve risk adjustment of hospital mortality. , 2007, JAMA.

[19]  J. Walkup,et al.  Studying Prescription Drug Use and Outcomes With Medicaid Claims Data: Strengths, Limitations, and Strategies , 2007, Medical care.

[20]  J. Hughes,et al.  Identifying Potentially Preventable Readmissions , 2008, Health care financing review.

[21]  M. Coory,et al.  Using routine inpatient data to identify patients at risk of hospital readmission , 2009, BMC health services research.

[22]  J. Avorn,et al.  High-dimensional Propensity Score Adjustment in Studies of Treatment Effects Using Health Care Claims Data , 2009, Epidemiology.

[23]  E. Rackow Rehospitalizations among patients in the Medicare fee-for-service program. , 2009, The New England journal of medicine.

[24]  Kari Ferver,et al.  The use of claims data in healthcare research. , 2009 .

[25]  Tosha B. Wetterneck,et al.  Hospital Readmission in General Medicine Patients: A Prediction Model , 2009, Journal of General Internal Medicine.

[26]  J. Rassen,et al.  Confounding Control in Healthcare Database Research: Challenges and Potential Approaches , 2010, Medical care.

[27]  G. Jacobson,et al.  Medicare Spending and Use of Medical Services for Beneficiaries in Nursing Homes and Other Long-Term Care Facilities: , 2010 .

[28]  Y. Tabak,et al.  An Automated Model to Identify Heart Failure Patients at Risk for 30-Day Readmission or Death Using Electronic Medical Record Data , 2010, Medical care.

[29]  P. Austin,et al.  Derivation and validation of an index to predict early death or unplanned readmission after discharge from hospital to the community , 2010, Canadian Medical Association Journal.

[30]  P. Austin,et al.  Proportion of hospital readmissions deemed avoidable: a systematic review , 2011, Canadian Medical Association Journal.

[31]  Michael R. O'Connell,et al.  Hospitalizations for Chronic Pancreatitis in Allegheny County, Pennsylvania, USA , 2011, Pancreatology.

[32]  Amanda H. Salanitro,et al.  Risk prediction models for hospital readmission: a systematic review. , 2011, JAMA.

[33]  Christoph U. Lehmann,et al.  Association of self-reported hospital discharge handoffs with 30-day readmissions. , 2013, JAMA internal medicine.

[34]  E. Marks Complexity science and the readmission dilemma. , 2013, JAMA internal medicine.

[35]  J. Schnipper,et al.  Potentially avoidable 30-day hospital readmissions in medical patients: derivation and validation of a prediction model. , 2013, JAMA internal medicine.

[36]  Reena Mahajan,et al.  Use of the International Classification of Diseases, 9th revision, coding in identifying chronic hepatitis B virus infection in health system data: implications for national surveillance , 2013, J. Am. Medical Informatics Assoc..