A Novel Bimodal Fusion-based Model for Depression Recognition

Depression is a common mental disorder which is harmful to our family, economics and society. Many people cannot receive timely mental health services, and the diagnosis process is subjective. A primary way for reducing harm is finding an objective and effective depression detection approach. Speech and video are two promising behavior indicators for depression. In this paper, we proposed a speech and video bimodal fusion model based on time-frequency analysis and convolutional neural network for this goal. For the testing of the proposed method, a speech and video dataset of 292 participants were employed for cross-validation. Compared with the single modal classification results, the classification accuracy and generalization ability of this gender-independent model are further improved, which is helpful for the identification of depression.

[1]  Bin Hu,et al.  EEG-based mild depressive detection using feature selection methods and classifiers , 2016, Comput. Methods Programs Biomed..

[2]  Mohammad H. Mahoor,et al.  Social risk and depression: Evidence from manual and automatic facial expression analysis , 2013, 2013 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG).

[3]  HuBin,et al.  EEG-based mild depressive detection using feature selection methods and classifiers , 2016 .

[4]  C. Bai,et al.  Cytokines and serotonin transporter in patients with major depression , 2006, Progress in Neuro-Psychopharmacology and Biological Psychiatry.

[5]  Qin Dai,et al.  More excited for negative facial expressions in depression: Evidence from an event-related potential study , 2012, Clinical Neurophysiology.

[6]  M. Landau Acoustical Properties of Speech as Indicators of Depression and Suicidal Risk , 2008 .

[7]  G. Dunbar,et al.  The Mini International Neuropsychiatric Interview (MINI). A short diagnostic structured interview: reliability and validity according to the CIDI , 1997, European Psychiatry.

[8]  Abdenour Hadid,et al.  A Deep Multiscale Spatiotemporal Network for Assessing Depression From Facial Dynamics , 2022, IEEE Transactions on Affective Computing.

[9]  Thomas E Joiner,et al.  Warning signs for suicide on the Internet: a descriptive study. , 2006, Suicide & life-threatening behavior.

[10]  M. Barclay,et al.  Manic-Depressive Insanity and Paranoia , 1921, The Indian Medical Gazette.