Toward personalizing treatment for depression: predicting diagnosis and severity

Objective Depression is a prevalent disorder difficult to diagnose and treat. In particular, depressed patients exhibit largely unpredictable responses to treatment. Toward the goal of personalizing treatment for depression, we develop and evaluate computational models that use electronic health record (EHR) data for predicting the diagnosis and severity of depression, and response to treatment. Materials and methods We develop regression-based models for predicting depression, its severity, and response to treatment from EHR data, using structured diagnosis and medication codes as well as free-text clinical reports. We used two datasets: 35 000 patients (5000 depressed) from the Palo Alto Medical Foundation and 5651 patients treated for depression from the Group Health Research Institute. Results Our models are able to predict a future diagnosis of depression up to 12 months in advance (area under the receiver operating characteristic curve (AUC) 0.70–0.80). We can differentiate patients with severe baseline depression from those with minimal or mild baseline depression (AUC 0.72). Baseline depression severity was the strongest predictor of treatment response for medication and psychotherapy. Conclusions It is possible to use EHR data to predict a diagnosis of depression up to 12 months in advance and to differentiate between extreme baseline levels of depression. The models use commonly available data on diagnosis, medication, and clinical progress notes, making them easily portable. The ability to automatically determine severity can facilitate assembly of large patient cohorts with similar severity from multiple sites, which may enable elucidation of the moderators of treatment response in the future.

[1]  Jennifer Tjia,et al.  MINI-SENTINEL SYSTEMATIC EVALUATION OF HEALTH OUTCOME OF INTEREST DEFINITIONS FOR STUDIES USING ADMINISTRATIVE DATA CONGESTIVE HEART FAILURE , 2011 .

[2]  Lin Chen,et al.  Importance of multi-modal approaches to effectively identify cataract cases from electronic health records , 2012, J. Am. Medical Informatics Assoc..

[3]  C Kooperberg,et al.  The use of phenome‐wide association studies (PheWAS) for exploration of novel genotype‐phenotype relationships and pleiotropy discovery , 2011, Genetic epidemiology.

[4]  Irwin Nazareth,et al.  Does recognition of depression in primary care affect outcome? The PREDICT-NL study. , 2012, Family practice.

[5]  Rong Xu,et al.  A Comprehensive Analysis of Five Million UMLS Metathesaurus Terms Using Eighteen Million MEDLINE Citations. , 2010, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[6]  Chun-Nan Hsu,et al.  Implications of the Dirichlet Assumption for Discretization of Continuous Variables in Naive Bayesian Classifiers , 2004, Machine Learning.

[7]  Olivier Bodenreider,et al.  Exploring semantic groups through visual approaches , 2003, J. Biomed. Informatics.

[8]  Robert A Cain,et al.  Navigating the Sequenced Treatment Alternatives to Relieve Depression (STAR*D) study: practical outcomes and implications for depression treatment in primary care. , 2007, Primary care.

[9]  R. Valuck,et al.  Antidepressant discontinuation and risk of suicide attempt: a retrospective, nested case-control study. , 2009, The Journal of clinical psychiatry.

[10]  S. Hebbring The challenges, advantages and future of phenome-wide association studies , 2014, Immunology.

[11]  L. Sharp,et al.  Screening for depression across the lifespan: a review of measures for use in primary care settings. , 2002, American family physician.

[12]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[13]  J. Scott,et al.  Can We Predict the Persistence of Depression? , 1992, British Journal of Psychiatry.

[14]  D. Kupfer,et al.  Acute and longer-term outcomes in depressed outpatients requiring one or several treatment steps: a STAR*D report. , 2006, The American journal of psychiatry.

[15]  I. Kohane,et al.  Electronic medical records for discovery research in rheumatoid arthritis , 2010, Arthritis care & research.

[16]  Gregory E Simon,et al.  Personalized medicine for depression: can we match patients with treatments? , 2010, The American journal of psychiatry.

[17]  Robyn M Leventhal,et al.  Severity of Depression and Response to Antidepressants and Placebo: An Analysis of the Food and Drug Administration Database , 2002, Journal of clinical psychopharmacology.

[18]  B. Lebowitz,et al.  Evaluation of outcomes with citalopram for depression using measurement-based care in STAR*D: implications for clinical practice. , 2006, The American journal of psychiatry.

[19]  Pedro J. Caraballo,et al.  Impact of data fragmentation across healthcare centers on the accuracy of a high-throughput clinical phenotyping algorithm for specifying subjects with type 2 diabetes mellitus , 2012, J. Am. Medical Informatics Assoc..

[20]  Ron Kohavi,et al.  Supervised and Unsupervised Discretization of Continuous Features , 1995, ICML.

[21]  J. Denny,et al.  Naïve Electronic Health Record phenotype identification for Rheumatoid arthritis. , 2011, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[22]  A. Mitchell,et al.  Clinical diagnosis of depression in primary care: a meta-analysis , 2009, The Lancet.

[23]  Wendy W. Chapman,et al.  ConText: An algorithm for determining negation, experiencer, and temporal status from clinical reports , 2009, J. Biomed. Informatics.

[24]  Wilson D. Pace,et al.  Enhancing Electronic Health Record Measurement of Depression Severity and Suicide Ideation: A Distributed Ambulatory Research in Therapeutics Network (DARTNet) Study , 2012, The Journal of the American Board of Family Medicine.

[25]  R. DeRubeis,et al.  Antidepressant drug effects and depression severity: a patient-level meta-analysis. , 2010, JAMA.

[26]  Cui Tao,et al.  Unified Medical Language System term occurrences in clinical notes: a large-scale corpus analysis , 2012, J. Am. Medical Informatics Assoc..

[27]  Christopher G Chute,et al.  Analyzing the heterogeneity and complexity of Electronic Health Record oriented phenotyping algorithms. , 2011, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[28]  N. Shah,et al.  Pharmacovigilance Using Clinical Notes , 2013, Clinical pharmacology and therapeutics.

[29]  Suzette J. Bielinski,et al.  Use of diverse electronic medical record systems to identify genetic risk for type 2 diabetes within a genome-wide association study , 2012, J. Am. Medical Informatics Assoc..

[30]  B. Grant,et al.  Epidemiology of major depressive disorder: results from the National Epidemiologic Survey on Alcoholism and Related Conditions. , 2005, Archives of general psychiatry.

[31]  J. Chao,et al.  Increased Risks of Developing Anxiety and Depression in Young Patients With Crohn's Disease , 2011, The American Journal of Gastroenterology.

[32]  Mark Olfson,et al.  A systematic review of validated methods for identifying depression using administrative data , 2012, Pharmacoepidemiology and drug safety.

[33]  P. Nutting,et al.  Barriers to initiating depression treatment in primary care practice , 2002, Journal of General Internal Medicine.

[34]  K. Rost,et al.  The deliberate misdiagnosis of major depression in primary care. , 1994, Archives of family medicine.