Needle in a Haystack: Natural Language Processing to Identify Serious Illness.

BACKGROUND Alone, administrative data poorly identifies patients with palliative care needs. OBJECTIVE To identify patients with uncommon, yet devastating, illnesses using a combination of administrative data and natural language processing (NLP). DESIGN/SETTING Retrospective cohort study using the electronic medical records of a healthcare network totaling over 2500 hospital beds. We sought to identify patient populations with two unique disease processes associated with a poor prognosis: pneumoperitoneum and leptomeningeal metastases from breast cancer. MEASUREMENTS Patients with pneumoperitoneum or leptomeningeal metastasis from breast cancer were identified through administrative codes and NLP. RESULTS Administrative codes alone resulted in identification of 6438 patients with possible pneumoperitoneum and 557 patients with possible leptomeningeal metastasis. Adding NLP to this analysis reduced the number of patients to 869 with pneumoperitoneum and 187 with leptomeningeal metastasis secondary to breast cancer. Administrative codes alone yielded a 13% positive predictive value (PPV) for pneumoperitoneum and 25% PPV for leptomeningeal metastasis. The combination of administrative codes and NLP achieved a PPV of 100%. The entire process was completed within hours. CONCLUSIONS Adding NLP to the use of administrative codes allows for rapid identification of seriously ill patients with otherwise difficult to detect disease processes and eliminates costly, tedious, and time-intensive manual chart review. This method enables studies to evaluate the effectiveness of treatment, including palliative interventions, for unique populations of seriously ill patients who cannot be identified by administrative codes alone.

[1]  L. Gilson,et al.  Building the Field of Health Policy and Systems Research: Social Science Matters , 2011, PLoS medicine.

[2]  A. Walker,et al.  ‘Caveat emptor’: the cautionary tale of endocarditis and the potential pitfalls of clinical coding data—an electronic health records study , 2019, BMC Medicine.

[3]  Mark T Hegel,et al.  Effects of a palliative care intervention on clinical outcomes in patients with advanced cancer: the Project ENABLE II randomized controlled trial. , 2009, JAMA.

[4]  K. Pogoda,et al.  Determinants of prolonged survival for breast cancer patient groups with leptomeningeal metastasis (LM) , 2018, Journal of Neuro-Oncology.

[5]  L. Gilson,et al.  Building the Field of Health Policy and Systems Research: Framing the Questions , 2011, PLoS medicine.

[6]  T. Murdoch,et al.  The inevitable application of big data to health care. , 2013, JAMA.

[7]  Vincent Mor,et al.  Change in end-of-life care for Medicare beneficiaries: site of death, place of care, and health care transitions in 2000, 2005, and 2009. , 2013, JAMA.

[8]  L. Gilson,et al.  Building the Field of Health Policy and Systems Research: An Agenda for Action , 2011, PLoS medicine.

[9]  R. Morrison,et al.  Determinants of Medical Expenditures in the Last 6 Months of Life , 2011, Annals of Internal Medicine.

[10]  S KelleyAmy,et al.  Identifying the Population with Serious Illness: The “Denominator” Challenge , 2018 .

[11]  H. Krumholz Big data and new knowledge in medicine: the thinking, training, and tools needed for a learning health system. , 2014, Health affairs.

[12]  A. Walling,et al.  Defining Serious Illness Among Adult Surgical Patients. , 2019, Journal of pain and symptom management.

[13]  B. Ferrell,et al.  Frequent and Early Death Limits Quality of Life Assessment in Patients with Advanced Malignancies Evaluated for Palliative Surgical Intervention , 2012, Annals of Surgical Oncology.

[14]  S. Enguídanos,et al.  Increased Satisfaction with Care and Lower Costs: Results of a Randomized Trial of In‐Home Palliative Care , 2007, Journal of the American Geriatrics Society.

[15]  S. Johnston,et al.  Treatment and prognosis of leptomeningeal disease secondary to metastatic breast cancer: A single-centre experience. , 2017, Breast.

[16]  P. May,et al.  Improving palliative care with machine learning and routine data: a rapid review , 2019, HRB open research.

[17]  D. Bates,et al.  Big data in health care: using analytics to identify and manage high-risk and high-cost patients. , 2014, Health affairs.

[18]  M. Ross,et al.  Pneumoperitoneum in the Cancer Patient , 2007, Annals of Surgical Oncology.