Facial geometry and speech analysis for depression detection

Depression is one of the most prevalent mental disorders, burdening many people world-wide. A system with the potential of serving as a decision support system is proposed, based on novel features extracted from facial expression geometry and speech, by interpreting non-verbal manifestations of depression. The proposed system has been tested both in gender independent and gender based modes, and with different fusion methods. The algorithms were evaluated for several combinations of parameters and classification schemes, on the dataset provided by the Audio/Visual Emotion Challenge of 2013 and 2014. The proposed framework achieved a precision of 94.8% for detecting persons achieving high scores on a self-report scale of depressive symptomatology. Optimal system performance was obtained using a nearest neighbour classifier on the decision fusion of geometrical features in the gender independent mode, and audio based features in the gender based mode; single visual and audio decisions were combined with the OR binary operation.

[1]  P. Ekman,et al.  Facial action coding system , 2019 .

[2]  Fan Yang,et al.  Depression Assessment by Fusing High and Low Level Features from Audio, Video, and Text , 2016, AVEC@ACM Multimedia.

[3]  Peter Robinson,et al.  OpenFace: An open source facial behavior analysis toolkit , 2016, 2016 IEEE Winter Conference on Applications of Computer Vision (WACV).

[4]  Fan Yang,et al.  Video-based depression detection using local Curvelet binary patterns in pairwise orthogonal planes , 2016, 2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[5]  Jeffrey M Girard,et al.  Automated Audiovisual Depression Analysis. , 2015, Current opinion in psychology.

[6]  Michael Wagner,et al.  Eye movement analysis for depression detection , 2013, 2013 IEEE International Conference on Image Processing.

[7]  Stefan Winkler,et al.  Group happiness assessment using geometric features and dataset balancing , 2016, ICMI.

[8]  H. Ellgring Nonverbal communication in depression , 1989 .

[9]  A. Beck,et al.  Comparison of Beck Depression Inventories -IA and -II in psychiatric outpatients. , 1996, Journal of personality assessment.

[10]  John Kane,et al.  COVAREP — A collaborative voice analysis repository for speech technologies , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[11]  J. K. Mandal,et al.  Activity recognition system using inbuilt sensors of smart mobile phone and minimizing feature vectors , 2015, Microsystem Technologies.

[12]  Björn W. Schuller,et al.  AVEC 2014: 3D Dimensional Affect and Depression Recognition Challenge , 2014, AVEC '14.

[13]  David DeVault,et al.  The Distress Analysis Interview Corpus of human and computer interviews , 2014, LREC.

[14]  Tiago H. Falk,et al.  Model Fusion for Multimodal Depression Classification and Level Detection , 2014, AVEC '14.

[15]  Fabien Ringeval,et al.  AVEC 2016: Depression, Mood, and Emotion Recognition Workshop and Challenge , 2016, AVEC@ACM Multimedia.