Deep Multi-task Learning for Depression Detection and Prediction in Longitudinal Data

Depression is among the most prevalent mental disorders, affecting millions of people of all ages globally. Machine learning techniques have shown effective in enabling automated detection and prediction of depression for early intervention and treatment. However, they are challenged by the relative scarcity of instances of depression in the data. In this work we introduce a novel deep multi-task recurrent neural network to tackle this challenge, in which depression classification is jointly optimized with two auxiliary tasks, namely one-class metric learning and anomaly ranking. The auxiliary tasks introduce an inductive bias that improves the classification model's generalizability on small depression samples. Further, unlike existing studies that focus on learning depression signs from static data without considering temporal dynamics, we focus on longitudinal data because i) temporal changes in personal development and family environment can provide critical cues for psychiatric disorders and ii) it may enable us to predict depression before the illness actually occurs. Extensive experimental results on child depression data show that our model is able to i) achieve nearly perfect performance in depression detection and ii) accurately predict depression 2-4 years before the clinical diagnosis, substantially outperforming seven competing methods.

[1]  Azher Uddin,et al.  Depression Level Prediction Using Deep Spatiotemporal Features and Multilayer Bi-LTSM , 2022, IEEE Transactions on Affective Computing.

[2]  Guodong Guo,et al.  Visually Interpretable Representation Learning for Depression Recognition from Facial Images , 2020, IEEE Transactions on Affective Computing.

[3]  Jie Zhou,et al.  SenseMood: Depression Detection on Social Media , 2020, ICMR.

[4]  Stevie Chancellor,et al.  Methods in predictive techniques for mental health status on social media: a critical review , 2020, npj Digital Medicine.

[5]  Elton H. Matsushima,et al.  See and Read: Detecting Depression Symptoms in Higher Education Students Using Multimodal Social Media Data , 2019, ICWSM.

[6]  Koichi Shinoda,et al.  Multimodal Fusion of BERT-CNN and Gated CNN Representations for Depression Detection , 2019, AVEC@MM.

[7]  Anton van den Hengel,et al.  Deep Anomaly Detection with Deviation Networks , 2019, KDD.

[8]  U. Rajendra Acharya,et al.  Automated Depression Detection Using Deep Representation and Sequence Learning with EEG Signals , 2019, Journal of Medical Systems.

[9]  B. Penninx,et al.  Longitudinal Association Between Depression and Inflammatory Markers: Results From the Netherlands Study of Depression and Anxiety , 2019, Biological Psychiatry.

[10]  P. Cuijpers,et al.  Factors associated with depression over time in head and neck cancer patients: A systematic review , 2019, Psycho-oncology.

[11]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[12]  Seetha Hari,et al.  Learning From Imbalanced Data , 2019, Advances in Computer and Electrical Engineering.

[13]  Guodong Guo,et al.  Automated Depression Diagnosis Based on Deep Networks to Encode Facial Appearance and Dynamics , 2018, IEEE Transactions on Affective Computing.

[14]  Huanbo Luan,et al.  Cross-Domain Depression Detection via Harvesting Social Media , 2018, IJCAI.

[15]  U. Rajendra Acharya,et al.  Automated EEG-based screening of depression using deep convolutional neural network , 2018, Comput. Methods Programs Biomed..

[16]  Diana Inkpen,et al.  Deep Learning for Depression Detection of Twitter Users , 2018, CLPsych@NAACL-HTL.

[17]  Z. Rosberger,et al.  A screening algorithm for early detection of major depressive disorder in head and neck cancer patients post‐treatment: Longitudinal study , 2018, Psycho-oncology.

[18]  Alexander Russell,et al.  Joint Modeling of Heterogeneous Sensing Data for Depression Assessment via Multi-task Learning , 2018, Proc. ACM Interact. Mob. Wearable Ubiquitous Technol..

[19]  Bin Hu,et al.  A Pervasive Approach to EEG-Based Depression Detection , 2018, Complex..

[20]  Qiang Yang,et al.  An Overview of Multi-task Learning , 2018 .

[21]  Christian Poellabauer,et al.  Topic Modeling Based Multi-modal Depression Detection , 2017, AVEC@ACM Multimedia.

[22]  Dongmei Jiang,et al.  Multimodal Measurement of Depression Using Deep Learning Models , 2017, AVEC@ACM Multimedia.

[23]  Tat-Seng Chua,et al.  Depression Detection via Harvesting Social Media: A Multimodal Dictionary Learning Solution , 2017, IJCAI.

[24]  Yu Zhang,et al.  A Survey on Multi-Task Learning , 2017, IEEE Transactions on Knowledge and Data Engineering.

[25]  Sebastian Ruder,et al.  An Overview of Multi-Task Learning in Deep Neural Networks , 2017, ArXiv.

[26]  Panayiotis G. Georgiou,et al.  Multimodal and Multiresolution Depression Detection from Speech and Facial Landmark Features , 2016, AVEC@ACM Multimedia.

[27]  Yunhong Wang,et al.  DepAudioNet: An Efficient Deep Model for Audio based Depression Classification , 2016, AVEC@ACM Multimedia.

[28]  Ya Li,et al.  Multi task sequence learning for depression scale prediction from video , 2015, 2015 International Conference on Affective Computing and Intelligent Interaction (ACII).

[29]  Michael Wagner,et al.  Eye movement analysis for depression detection , 2013, 2013 IEEE International Conference on Image Processing.

[30]  Li Sun,et al.  A Depression Detection Model Based on Sentiment Analysis in Micro-blog Social Network , 2013, PAKDD Workshops.

[31]  Jeffrey F. Cohn,et al.  Detecting Depression Severity from Vocal Prosody , 2013, IEEE Transactions on Affective Computing.

[32]  R. Kessler,et al.  Prevalence, persistence, and sociodemographic correlates of DSM-IV disorders in the National Comorbidity Survey Replication Adolescent Supplement. , 2012, Archives of general psychiatry.

[33]  Chandan Srivastava,et al.  Support Vector Data Description , 2011 .

[34]  Nicholas B. Allen,et al.  Detection of Clinical Depression in Adolescents’ Speech During Family Interactions , 2011, IEEE Transactions on Biomedical Engineering.

[35]  Nicholas B. Allen,et al.  Influence of acoustic low-level descriptors in the detection of clinical depression in adolescents , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[36]  Fernando De la Torre,et al.  Detecting depression from facial actions and vocal prosody , 2009, 2009 3rd International Conference on Affective Computing and Intelligent Interaction and Workshops.

[37]  Xin Feng,et al.  Longitudinal associations between emotion regulation and depression in preadolescent girls: moderation by the caregiving environment. , 2009, Developmental psychology.

[38]  P. Jensen,et al.  Improving recognition of adolescent depression in primary care. , 2006, Archives of pediatrics & adolescent medicine.

[39]  Yann LeCun,et al.  Dimensionality Reduction by Learning an Invariant Mapping , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[40]  D. Berthelsen,et al.  Growing up in Australia: the longitudinal study of Australian children (LSAC) , 2013 .

[41]  J. Rushton,et al.  Epidemiology of depressive symptoms in the National Longitudinal Study of Adolescent Health. , 2002, Journal of the American Academy of Child and Adolescent Psychiatry.

[42]  T. Suslow,et al.  Detection of Facial Expressions of Emotions in Depression , 2001, Perceptual and motor skills.

[43]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.