A Novel Decision Tree for Depression Recognition in Speech

Depression is a common mental disorder worldwide which causes a range of serious outcomes. The diagnosis of depression relies on patient-reported scales and psychiatrist interview which may lead to subjective bias. In recent years, more and more researchers are devoted to depression recognition in speech , which may be an effective and objective indicator. This study proposes a new speech segment fusion method based on decision tree to improve the depression recognition accuracy and conducts a validation on a sample of 52 subjects (23 depressed patients and 29 healthy controls). The recognition accuracy are 75.8% and 68.5% for male and female respectively on gender-dependent models. It can be concluded from the data that the proposed decision tree model can improve the depression classification performance.

[1]  Nicholas B. Allen,et al.  Detection of Clinical Depression in Adolescents’ Speech During Family Interactions , 2011, IEEE Transactions on Biomedical Engineering.

[2]  D. Mitchell Wilkes,et al.  Acoustical properties of speech as indicators of depression and suicidal risk , 2000, IEEE Transactions on Biomedical Engineering.

[3]  G. Dunbar,et al.  The Mini International Neuropsychiatric Interview (MINI). A short diagnostic structured interview: reliability and validity according to the CIDI , 1997, European Psychiatry.

[4]  J. Olesen,et al.  The economic cost of brain disorders in Europe , 2012, European journal of neurology.

[5]  Thomas E Joiner,et al.  Warning signs for suicide on the Internet: a descriptive study. , 2006, Suicide & life-threatening behavior.

[6]  Gábor Kiss,et al.  Investigation of cross-lingual depression prediction possibilities based on speech processing , 2017, 2017 8th IEEE International Conference on Cognitive Infocommunications (CogInfoCom).

[7]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[8]  Mohammad H. Mahoor,et al.  Social risk and depression: Evidence from manual and automatic facial expression analysis , 2013, 2013 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG).

[9]  Gábor Kiss,et al.  Comparison of read and spontaneous speech in case of automatic detection of depression , 2017, 2017 8th IEEE International Conference on Cognitive Infocommunications (CogInfoCom).

[10]  Lang He,et al.  Automated depression analysis using convolutional neural networks from speech , 2018, J. Biomed. Informatics.

[11]  Dongmei Jiang,et al.  Multimodal Measurement of Depression Using Deep Learning Models , 2017, AVEC@ACM Multimedia.

[12]  Bernhard E. Boser,et al.  A training algorithm for optimal margin classifiers , 1992, COLT '92.

[13]  J. A. Marks The North Wind and the Sun , 2016 .

[14]  Yonghong Xie,et al.  An Improved Multi-label Relief Feature Selection Algorithm for Unbalanced Datasets , 2017 .

[15]  Anna Esposito,et al.  Automatic Detection of Depressive States from Speech , 2018, Multidisciplinary Approaches to Neural Computing.

[16]  Meysam Asgari,et al.  Improvements to harmonic model for extracting better speech features in clinical applications , 2018, Comput. Speech Lang..

[17]  Larry A. Rendell,et al.  The Feature Selection Problem: Traditional Methods and a New Algorithm , 1992, AAAI.

[18]  Dongmei Jiang,et al.  Decision Tree Based Depression Classification from Audio Video and Language Information , 2016, AVEC@ACM Multimedia.

[19]  Anna Esposito,et al.  Depression Speaks: Automatic Discrimination between Depressed and Non-Depressed Speakers Based on Nonverbal Speech Features , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[20]  M. Hamilton A RATING SCALE FOR DEPRESSION , 1960, Journal of neurology, neurosurgery, and psychiatry.

[21]  Luo Yuejia,et al.  Revision of the Chinese Facial Affective Picture System , 2011 .

[22]  Zhenyu Liu,et al.  Speech pause time: A potential biomarker for depression detection , 2017, 2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[23]  R. Spitzer,et al.  The PHQ-9: validity of a brief depression severity measure. , 2001, Journal of general internal medicine.

[24]  Chao Zhang,et al.  Research on depression detection algorithm combine acoustic rhythm with sparse face recognition , 2017, Cluster Computing.

[25]  M. Landau Acoustical Properties of Speech as Indicators of Depression and Suicidal Risk , 2008 .

[26]  Elizabeth Shriberg,et al.  Effects of feature type, learning algorithm and speaking style for depression detection from speech , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[27]  Björn Schuller,et al.  Opensmile: the munich versatile and fast open-source audio feature extractor , 2010, ACM Multimedia.

[28]  Wonyong Sung,et al.  A statistical model-based voice activity detection , 1999, IEEE Signal Processing Letters.

[29]  J. Rabe-Jabłońska,et al.  [Affective disorders in the fourth edition of the classification of mental disorders prepared by the American Psychiatric Association -- diagnostic and statistical manual of mental disorders]. , 1993, Psychiatria polska.

[30]  D. Kupfer,et al.  Interval between onset of sleep and rapid-eye-movement sleep as an indicator of depression. , 1972, Lancet.

[31]  Nicholas B. Allen,et al.  Multichannel Weighted Speech Classification System for Prediction of Major Depression in Adolescents , 2013, IEEE Transactions on Biomedical Engineering.