Identification of Patients in Need of Advanced Care for Depression Using Data Extracted From a Statewide Health Information Exchange: A Machine Learning Approach

Background As the most commonly occurring form of mental illness worldwide, depression poses significant health and economic burdens to both the individual and community. Different types of depression pose different levels of risk. Individuals who suffer from mild forms of depression may recover without any assistance or be effectively managed by primary care or family practitioners. However, other forms of depression are far more severe and require advanced care by certified mental health providers. However, identifying cases of depression that require advanced care may be challenging to primary care providers and health care team members whose skill sets run broad rather than deep. Objective This study aimed to leverage a comprehensive range of patient-level diagnostic, behavioral, and demographic data, as well as past visit history data from a statewide health information exchange to build decision models capable of predicting the need of advanced care for depression across patients presenting at Eskenazi Health, the public safety net health system for Marion County, Indianapolis, Indiana. Methods Patient-level diagnostic, behavioral, demographic, and past visit history data extracted from structured datasets were merged with outcome variables extracted from unstructured free-text datasets and were used to train random forest decision models that predicted the need of advanced care for depression across (1) the overall patient population and (2) various subsets of patients at higher risk for depression-related adverse events; patients with a past diagnosis of depression; patients with a Charlson comorbidity index of ≥1; patients with a Charlson comorbidity index of ≥2; and all unique patients identified across the 3 above-mentioned high-risk groups. Results The overall patient population consisted of 84,317 adult (aged ≥18 years) patients. A total of 6992 (8.29%) of these patients were in need of advanced care for depression. Decision models for high-risk patient groups yielded area under the curve (AUC) scores between 86.31% and 94.43%. The decision model for the overall patient population yielded a comparatively lower AUC score of 78.87%. The variance of optimal sensitivity and specificity for all decision models, as identified using Youden J Index, is as follows: sensitivity=68.79% to 83.91% and specificity=76.03% to 92.18%. Conclusions This study demonstrates the ability to automate screening for patients in need of advanced care for depression across (1) an overall patient population or (2) various high-risk patient groups using structured datasets covering acute and chronic conditions, patient demographics, behaviors, and past visit history. Furthermore, these results show considerable potential to enable preventative care and can be easily integrated into existing clinical workflows to improve access to wraparound health care services.

[1]  Rebecca Gray,et al.  The Patient-Centered Medical Home , 2013, Annals of Internal Medicine.

[2]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[3]  L. Sharp,et al.  Screening for depression across the lifespan: a review of measures for use in primary care settings. , 2002, American family physician.

[4]  Mark Olfson,et al.  Proportion of antidepressants prescribed without a psychiatric diagnosis is growing. , 2011, Health affairs.

[5]  Hadi Kharrazi,et al.  IT-enabled Community Health Interventions: Challenges, Opportunities, and Future Directions , 2014, EGEMS.

[6]  Blair T. Johnson,et al.  Rethinking recommendations for screening for depression in primary care , 2012, Canadian Medical Association Journal.

[7]  Ricardo Araya,et al.  Computerised cognitive behaviour therapy (cCBT) as treatment for depression in primary care (REEACT trial): large scale pragmatic randomised controlled trial , 2016, BMJ : British Medical Journal.

[8]  Gerhard Andersson,et al.  Psychotherapy for Depression in Adults: A Meta-Analysis of Comparative Outcome Studies , 2010 .

[9]  Alan R. Aronson,et al.  An overview of MetaMap: historical perspective and recent advances , 2010, J. Am. Medical Informatics Assoc..

[10]  B. Druss,et al.  Mental disorders and medical comorbidity. , 2011, The Synthesis project. Research synthesis report.

[11]  C. Hewitt,et al.  Screening for Depression in Medical Settings with the Patient Health Questionnaire (PHQ): A Diagnostic Meta-Analysis , 2007, Journal of General Internal Medicine.

[12]  J. Markowitz,et al.  Psychotherapy effectiveness for major depression: a randomized trial in a Finnish community , 2016, BMC Psychiatry.

[13]  M. Giacomini,et al.  Patient experiences of depression and anxiety with chronic disease: a systematic review and qualitative meta-synthesis. , 2013, Ontario health technology assessment series.

[14]  Lonnie Blevins,et al.  The Indiana network for patient care: a working local health information infrastructure. An example of a working infrastructure collaboration that links data from five health systems and hundreds of millions of entries. , 2005, Health affairs.

[15]  J. Marc Overhage,et al.  Case Study 1 – The Indiana Health Information Exchange , 2016 .

[16]  W. Katon,et al.  Treatment of dysthymia and minor depression in primary care: A randomized controlled trial in older adults. , 2000, JAMA.

[17]  C. Dowrick,et al.  Medicalising unhappiness: new classification of depression risks more patients being put on drug treatment from which they will not benefit , 2013, BMJ.

[18]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[19]  Hadi Kharrazi,et al.  The Value of Unstructured Electronic Health Record Data in Geriatric Syndrome Case Identification , 2018, Journal of the American Geriatrics Society.

[20]  Hadi Kharrazi,et al.  Comparing Population-based Risk-stratification Model Performance Using Demographic, Diagnosis and Medication Data Extracted From Outpatient Electronic Health Records Versus Administrative Claims , 2017, Medical care.

[21]  Pascal Vincent,et al.  Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Yanjun Qi Random Forest for Bioinformatics , 2012 .

[23]  D. Melzer,et al.  Clinical Practice and Epidemiology in Mental Health the Distribution of the Common Mental Disorders: Social Inequalities in Europe , 2005 .

[24]  Shaun J. Grannis,et al.  Toward better public health reporting using existing off the shelf approaches: A comparison of alternative cancer detection approaches using plaintext medical data and non-dictionary based feature selection , 2016, J. Biomed. Informatics.

[25]  C. Whittington,et al.  Brief psychological therapies for anxiety and depression in primary care: meta-analysis and meta-regression , 2010, BMC medicine.

[26]  L. Kerr,et al.  Screening tools for depression in primary care: the effects of culture, gender, and somatic symptoms on the detection of depression. , 2001, The Western journal of medicine.

[27]  H. Wagner,et al.  Minor depression in family practice: functional morbidity, co-morbidity, service utilization and outcomes , 2000, Psychological Medicine.

[28]  J. Zeber,et al.  The cost-utility of screening for depression in primary care. , 2001, Annals of internal medicine.

[29]  A. Beck,et al.  Beck Depression Inventory–II , 2011 .

[30]  Yiqiang Chen,et al.  Weighted extreme learning machine for imbalance learning , 2013, Neurocomputing.

[31]  M. Salek Cornell Scale for Depression in Dementia , 1997, International Psychogeriatrics.

[32]  Fernando Nogueira,et al.  Imbalanced-learn: A Python Toolbox to Tackle the Curse of Imbalanced Datasets in Machine Learning , 2016, J. Mach. Learn. Res..

[33]  Ronald C Kessler,et al.  The economic burden of adults with major depressive disorder in the United States (2005 and 2010). , 2015, The Journal of clinical psychiatry.

[34]  R. Spitzer,et al.  The PHQ-15: Validity of a New Measure for Evaluating the Severity of Somatic Symptoms , 2002, Psychosomatic medicine.

[35]  A. Yohannes,et al.  Depression and anxiety in chronic heart failure and chronic obstructive pulmonary disease: prevalence, relevance, clinical implications and management principles , 2010, International journal of geriatric psychiatry.

[36]  Lorie A. Kloda,et al.  There are no randomized controlled trials that support the United States Preventive Services Task Force guideline on screening for depression in primary care: a systematic review , 2014, BMC Medicine.

[37]  P. Corrigan How stigma interferes with mental health care. , 2004, The American psychologist.

[38]  N. Unnikrishnan Nair,et al.  Kullback–Leibler divergence: A quantile approach , 2016 .

[39]  W. Youden,et al.  Index for rating diagnostic tests , 1950, Cancer.

[40]  Elizabeth A. Bayliss,et al.  Primary Care Physician Insights Into a Typology of the Complex Patient in Primary Care , 2015, The Annals of Family Medicine.

[41]  Mohammad Khalilia,et al.  Predicting disease risks from highly imbalanced data using random forest , 2011, BMC Medical Informatics Decis. Mak..

[42]  J. Lépine,et al.  The increasing burden of depression , 2011, Neuropsychiatric disease and treatment.

[43]  John W. Loonsk,et al.  A proposed national research and development agenda for population health informatics: summary recommendations from a national expert workshop , 2017, J. Am. Medical Informatics Assoc..

[44]  J. Ockene,et al.  Screening for depression in adults: U.S. preventive services task force recommendation statement. , 2009, Annals of internal medicine.

[45]  Spyridon S Marinopoulos,et al.  The Charlson comorbidity index is adapted to predict costs of chronic disease in primary care patients. , 2008, Journal of clinical epidemiology.

[46]  C. Sherbourne,et al.  Impact of disseminating quality improvement programs for depression in managed primary care: a randomized controlled trial. , 2000, JAMA.

[47]  S. Kasthurirathne The use of clinical, behavioral, and social determinants of health to improve identification of patients in need of advanced care for depression , 2018 .

[48]  Shaun J. Grannis,et al.  Assessing the capacity of social determinants of health data to augment predictive models identifying patients in need of wraparound social services , 2018, J. Am. Medical Informatics Assoc..

[49]  Keith Hawton,et al.  Risk factors for suicide in individuals with depression: a systematic review. , 2013, Journal of affective disorders.

[50]  W. Katon Epidemiology and treatment of depression in patients with chronic medical illness. , 2011, Dialogues in clinical neuroscience.

[51]  Simon Gilbody,et al.  Should we screen for depression? , 2006, BMJ : British Medical Journal.

[52]  F. Pouwer,et al.  Type 2 diabetes mellitus as a risk factor for the onset of depression: a systematic review and meta-analysis , 2010, Diabetologia.

[53]  R. Mojtabai Clinician-Identified Depression in Community Settings: Concordance with Structured-Interview Diagnoses , 2013, Psychotherapy and Psychosomatics.

[54]  Janet B W Williams Standardizing the Hamilton Depression Rating Scale: past, present, and future , 2009, European Archives of Psychiatry and Clinical Neuroscience.

[55]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..