Ensemble-based depression detection in speech

Depression detection using speech signal is becoming an attractive topic because it is fast, convenient and non-invasive. Many researches aimed at improving depression classification performance. This study investigated application of ensemble learners in depression detection and compared three speaking styles (interview, reading and picture description) in ensembles. A speech dataset collecting from 184 subjects (92 depressed patients and 92 healthy controls) was used for these goals. The results showed that ensemble learners perform better than individual learners apparently. Interview is a more effective speaking style than reading and picture description for speech acquisition. These findings suggest us ensemble model based multi-utterance in interview is the best way to detect depression.

[1]  Nicholas B. Allen,et al.  Early prediction of major depression in adolescents using glottal wave characteristics and Teager Energy parameters , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[2]  Thomas A. Widiger,et al.  Diagnostic and Statistical Manual of Mental Disorders (DSM) , 2011 .

[3]  Nivja H. Jong,et al.  Praat script to detect syllable nuclei and measure speech rate automatically , 2009, Behavior research methods.

[4]  Fernando De la Torre,et al.  Detecting depression from facial actions and vocal prosody , 2009, 2009 3rd International Conference on Affective Computing and Intelligent Interaction and Workshops.

[5]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[6]  Kagan Tumer,et al.  Theoretical Foundations Of Linear And Order Statistics Combiners For Neural Pattern Classifiers , 1995 .

[7]  Zhang Jing Study on Chinese Word Sentiment Polarity Automatic Estimation , 2010 .

[8]  Alberto Suárez,et al.  Aggregation Ordering in Bagging , 2004 .

[9]  Michael Wagner,et al.  Detecting depression: A comparison between spontaneous and read speech , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[10]  G. Dunbar,et al.  The Mini International Neuropsychiatric Interview (MINI). A short diagnostic structured interview: reliability and validity according to the CIDI , 1997, European Psychiatry.

[11]  M. Hamilton A RATING SCALE FOR DEPRESSION , 1960, Journal of neurology, neurosurgery, and psychiatry.

[12]  Luo Yuejia,et al.  Revision of the Chinese Facial Affective Picture System , 2011 .

[13]  Björn Schuller,et al.  Opensmile: the munich versatile and fast open-source audio feature extractor , 2010, ACM Multimedia.

[14]  Michael Wagner,et al.  Cross-Cultural Depression Recognition from Vocal Biomarkers , 2016, INTERSPEECH.

[15]  D. Battle Diagnostic and Statistical Manual of Mental Disorders (DSM). , 2013 .

[16]  Zhenyu Liu,et al.  Detection of depression in speech , 2015, 2015 International Conference on Affective Computing and Intelligent Interaction (ACII).

[17]  Nicholas B. Allen,et al.  Multichannel Weighted Speech Classification System for Prediction of Major Depression in Adolescents , 2013, IEEE Transactions on Biomedical Engineering.

[18]  David B. Skalak,et al.  The Sources of Increased Accuracy for Two Proposed Boosting Algorithms , 1996, AAAI 1996.

[19]  Thomas Armstrong,et al.  Eye tracking of attention in the affective disorders: a meta-analytic review and synthesis. , 2012, Clinical psychology review.

[20]  G. Yule,et al.  On the association of attributes in statistics, with examples from the material of the childhood society, &c , 1900, Proceedings of the Royal Society of London.

[21]  D. Mitchell Wilkes,et al.  Direct acoustic feature using iterative EM algorithm and spectral energy for classifying suicidal speech , 2007, INTERSPEECH.

[22]  Mykola Pechenizkiy,et al.  What's Your Current Stress Level? Detection of Stress Patterns from GSR Sensor Data , 2011, 2011 IEEE 11th International Conference on Data Mining Workshops.

[23]  Tamás D. Gedeon,et al.  A comparative study of different classifiers for detecting depression from spontaneous speech , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[24]  Klaus Mathiak,et al.  Impaired pitch identification as a potential marker for depression , 2012, BMC Psychiatry.

[25]  G. Yule On the Association of Attributes in Statistics: With Illustrations from the Material of the Childhood Society, &c , 1900 .

[26]  Bernhard E. Boser,et al.  A training algorithm for optimal margin classifiers , 1992, COLT '92.

[27]  J. Peifer,et al.  Investigating the role of glottal features in classifying clinical depression , 2003, Proceedings of the 25th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (IEEE Cat. No.03CH37439).

[28]  Nicholas B. Allen,et al.  Prediction of major depression in adolescents using an optimized multi-channel weighted speech classification system , 2014, Biomed. Signal Process. Control..

[29]  Olga V. Demler,et al.  Prevalence, severity, and comorbidity of 12-month DSM-IV disorders in the National Comorbidity Survey Replication. , 2005, Archives of general psychiatry.

[30]  Xindong Wu,et al.  Ensemble pruning via individual contribution ordering , 2010, KDD.

[31]  Michael Wagner,et al.  Head Pose and Movement Analysis as an Indicator of Depression , 2013, 2013 Humaine Association Conference on Affective Computing and Intelligent Interaction.

[32]  Zhi-Hua Zhou,et al.  Ensemble Methods: Foundations and Algorithms , 2012 .

[33]  Nicholas B. Allen,et al.  Detection of Clinical Depression in Adolescents’ Speech During Family Interactions , 2011, IEEE Transactions on Biomedical Engineering.

[34]  D. Mitchell Wilkes,et al.  Investigation of vocal jitter and glottal flow spectrum as possible cues for depression and near-term suicidal risk , 2004, IEEE Transactions on Biomedical Engineering.