Cross-Domain Depression Detection via Harvesting Social Media

Depression detection is a significant issue for human well-being. In previous studies, online detection has proven effective in Twitter, enabling proactive care for depressed users. Owing to cultural differences, replicating the method to other social media platforms, such as Chinese Weibo, however, might lead to poor performance because of insufficient available labeled (self-reported depression) data for model training. In this paper, we study an interesting but challenging problem of enhancing detection in a certain target domain (e.g. Weibo) with ample Twitter data as the source domain. We first systematically analyze the depression-related feature patterns across domains and summarize two major detection challenges, namely isomerism and divergency. We further propose a cross-domain Deep Neural Network model with Feature Adaptive Transformation & Combination strategy (DNN-FATC) that transfers the relevant information across heterogeneous domains. Experiments demonstrate improved performance compared to existing heterogeneous transfer methods or training directly in the target domain (over 3.4% improvement in F1), indicating the potential of our model to enable depression detection via social media for more countries with different cultural settings.

[1]  A. Kleinman,et al.  Culture and depression. , 1986, The New England journal of medicine.

[2]  Leonardo Max Batista Claudino,et al.  Beyond LDA: Exploring Supervised Topic Modeling for Depression-Related Language in Twitter , 2015, CLPsych@HLT-NAACL.

[3]  Trevor Darrell,et al.  Efficient Learning of Domain-invariant Image Representations , 2013, ICLR.

[4]  Minsu Park,et al.  Depressive Moods of Users Portrayed in Twitter , 2012 .

[5]  Trevor Darrell,et al.  What you saw is not what you get: Domain adaptation using asymmetric kernel transforms , 2011, CVPR 2011.

[6]  Qingpeng Zhang,et al.  Understanding Online Health Groups for Depression: Social Network and Linguistic Perspectives , 2016, Journal of medical Internet research.

[7]  Ivor W. Tsang,et al.  Learning With Augmented Features for Supervised and Semi-Supervised Heterogeneous Domain Adaptation , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  He Li,et al.  Developing Simplified Chinese Psychological Linguistic Analysis Dictionary for Microblog , 2013, Brain and Health Informatics.

[9]  ScienceOpen Admin,et al.  Chinese General Practice , 2017 .

[10]  Lianhong Cai,et al.  Interpretable aesthetic features for affective image classification , 2013, 2013 IEEE International Conference on Image Processing.

[11]  Shigenobu Kobayashi,et al.  The aim and method of the color image scale , 2009 .

[12]  Tat-Seng Chua,et al.  Depression Detection via Harvesting Social Media: A Multimodal Dictionary Learning Solution , 2017, IJCAI.

[13]  David W. McDonald,et al.  Perception Differences between the Depressed and Non-Depressed Users in Twitter , 2013, ICWSM.

[14]  Michael A. Jensen,et al.  IEEE Transactions on Antennas and Propagation Announces Special Issue on Wireless Communications With the phenomenal growth in wireless communications technology, researchers are facing significant challenges in realizing the much anticipated , 2006 .

[15]  Cynthia A. Brewer Color Research Applications in Mapping and Visualization , 2004, Color Imaging Conference.

[16]  No Value,et al.  IEEE International Conference on Image Processing , 2003 .

[17]  Tingshao Zhu,et al.  Predicting psychological features based on web behavioral data: Mental health status and subjective well-being , 2015 .

[18]  Herbert Süße,et al.  The Method of Normalization to Determine Invariants , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[19]  Greg Wilkinson The British Journal of Psychiatry: Achieving Excellence , 1994 .

[20]  Li Sun,et al.  An Improved Model for Depression Detection in Micro-Blog Social Network , 2013, 2013 IEEE 13th International Conference on Data Mining Workshops.

[21]  James W. Pennebaker,et al.  Linguistic Inquiry and Word Count (LIWC2007) , 2007 .

[22]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[23]  Antal van den Bosch,et al.  Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics , 2007 .

[24]  Rupak Chakravarty,et al.  New England Journal of Medicine - A Bibliometric Study , 2014, BIOINFORMATICS 2014.

[25]  T. Kailath The Divergence and Bhattacharyya Distance Measures in Signal Selection , 1967 .