A support vector machine model provides an accurate transcript-level-based diagnostic for major depressive disorder

Major depressive disorder (MDD) is a critical cause of morbidity and disability with an economic cost of hundreds of billions of dollars each year, necessitating more effective treatment strategies and novel approaches to translational research. A notable barrier in addressing this public health threat involves reliable identification of the disorder, as many affected individuals remain undiagnosed or misdiagnosed. An objective blood-based diagnostic test using transcript levels of a panel of markers would provide an invaluable tool for MDD as the infrastructure—including equipment, trained personnel, billing, and governmental approval—for similar tests is well established in clinics worldwide. Here we present a supervised classification model utilizing support vector machines (SVMs) for the analysis of transcriptomic data readily obtained from a peripheral blood specimen. The model was trained on data from subjects with MDD (n=32) and age- and gender-matched controls (n=32). This SVM model provides a cross-validated sensitivity and specificity of 90.6% for the diagnosis of MDD using a panel of 10 transcripts. We applied a logistic equation on the SVM model and quantified a likelihood of depression score. This score gives the probability of a MDD diagnosis and allows the tuning of specificity and sensitivity for individual patients to bring personalized medicine closer in psychiatry.

[1]  Douglas M. Hawkins,et al.  Assessing Model Fit by Cross-Validation , 2003, J. Chem. Inf. Comput. Sci..

[2]  M. Roca,et al.  Diagnosis delay in first episodes of major depression: a study of primary care patients in Spain. , 2013, Journal of affective disorders.

[3]  D. Pregibon Logistic Regression Diagnostics , 1981 .

[4]  D. Mohr,et al.  Major depressive disorder , 2016, Nature Reviews Disease Primers.

[5]  L. Horrocks,et al.  Glycerophospholipids in brain: their metabolism, incorporation into membranes, functions, and involvement in neurological disorders. , 2000, Chemistry and physics of lipids.

[6]  M S Faith,et al.  Relationships between obesity and DSM-IV major depressive disorder, suicide ideation, and suicide attempts: results from a general population study. , 2000, American journal of public health.

[7]  Olga V. Demler,et al.  Lifetime prevalence and age-of-onset distributions of DSM-IV disorders in the National Comorbidity Survey Replication. , 2005, Archives of general psychiatry.

[8]  D. Sheehan,et al.  The Mini-International Neuropsychiatric Interview (M.I.N.I.): the development and validation of a structured diagnostic psychiatric interview for DSM-IV and ICD-10. , 1998, The Journal of clinical psychiatry.

[9]  J. O'brien,et al.  Tay-Sachs Disease: Generalized Absence of a Beta-D-N-Acetylhexosaminidase Component , 1969, Science.

[10]  Olga V. Demler,et al.  The epidemiology of major depressive disorder: results from the National Comorbidity Survey Replication (NCS-R). , 2003, JAMA.

[11]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[12]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[13]  D. Pine,et al.  The risk for early-adulthood anxiety and depressive disorders in adolescents with anxiety and depressive disorders. , 1998, Archives of general psychiatry.

[14]  M. Kunik,et al.  Racial Disparities in Diagnosis and Treatment of Depression: A Literature Review , 2007, Psychiatric Quarterly.

[15]  David C. Mohr,et al.  Blood transcriptomic biomarkers in adult primary care patients with major depressive disorder undergoing cognitive behavioral therapy , 2014, Translational Psychiatry.

[16]  W Gardner,et al.  Discovery of blood transcriptomic markers for depression in animal models and pilot validation in subjects with early-onset major depression , 2012, Translational Psychiatry.

[17]  J. Kornhuber,et al.  Brain membrane lipids in major depression and anxiety disorders. , 2015, Biochimica et biophysica acta.

[18]  G. Freedman,et al.  Burden of Depressive Disorders by Country, Sex, Age, and Year: Findings from the Global Burden of Disease Study 2010 , 2013, PLoS medicine.

[19]  E. Redei,et al.  Blood transcriptomic markers for major depression: from animal models to clinical settings , 2015, Annals of the New York Academy of Sciences.

[20]  B. Gaynes,et al.  The Depression Treatment Cascade in Primary Care: A Public Health Perspective , 2012, Current Psychiatry Reports.

[21]  Marie-Hélène Savard,et al.  Bridging the communication gap between physicians and their patients with physical symptoms of depression. , 2004, Primary care companion to the Journal of clinical psychiatry.

[22]  S. Cichon,et al.  A genome-wide association study implicates diacylglycerol kinase eta (DGKH) and several other genes in the etiology of bipolar disorder , 2008, Molecular Psychiatry.

[23]  M. Hamilton A RATING SCALE FOR DEPRESSION , 1960, Journal of neurology, neurosurgery, and psychiatry.

[24]  Mehmet Fatih Akay,et al.  Support vector machines combined with feature selection for breast cancer diagnosis , 2009, Expert Syst. Appl..

[25]  H. Wittchen,et al.  Diagnosing major depression in the elderly: evidence for response bias in standardized diagnostic interviews? , 1994, Journal of psychiatric research.

[26]  Bernhard E. Boser,et al.  A training algorithm for optimal margin classifiers , 1992, COLT '92.

[27]  Alexander J. Smola,et al.  Advances in Large Margin Classifiers , 2000 .

[28]  N. Baumann,et al.  Psychiatric manifestations revealing inborn errors of metabolism in adolescents and adults , 2007, Journal of Inherited Metabolic Disease.

[29]  Xiao-Li Meng,et al.  Disparity in depression treatment among racial and ethnic minority populations in the United States. , 2008, Psychiatric services.

[30]  E. Fried,et al.  Depression is not a consistent syndrome: An investigation of unique symptom patterns in the STAR*D study. , 2015, Journal of affective disorders.

[31]  S Greenfield,et al.  Detection of depressive disorder for patients receiving prepaid or fee-for-service care. Results from the Medical Outcomes Study. , 1989, JAMA.

[32]  Ashutosh Kumar Singh,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2010 .

[33]  M. Steger,et al.  Posttraumatic stress disorder, guilt, depression, and meaning in life among military veterans. , 2009, Journal of traumatic stress.

[34]  Kevin P. Murphy,et al.  Machine learning - a probabilistic perspective , 2012, Adaptive computation and machine learning series.

[35]  J A Bilello,et al.  Assessment of a multi-assay, serum-based biological diagnostic test for major depressive disorder: a Pilot and Replication Study , 2013, Molecular Psychiatry.

[36]  Avi Ma'ayan,et al.  Enrichr: interactive and collaborative HTML5 gene list enrichment analysis tool , 2013, BMC Bioinformatics.

[37]  Michael King,et al.  Stigma: the feelings and experiences of 46 people with mental illness , 2004, British Journal of Psychiatry.

[38]  N. Ryan,et al.  Schedule for Affective Disorders and Schizophrenia for School-Age Children-Present and Lifetime Version (K-SADS-PL): initial reliability and validity data. , 1997, Journal of the American Academy of Child and Adolescent Psychiatry.

[39]  J. Xuan,et al.  Classification algorithms for phenotype prediction in genomics and proteomics. , 2008, Frontiers in bioscience : a journal and virtual library.

[40]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.