Detection and Analysis of Self-Disclosure in Online News Commentaries

Online users engage in self-disclosure - revealing personal information to others - in pursuit of social rewards. However, there are associated costs of disclosure to users' privacy. User profiling techniques support the use of contributed content for a number of purposes, e.g., micro-targeting advertisements. In this paper, we study self-disclosure as it occurs in newspaper comment forums. We explore a longitudinal dataset of about 60,000 comments on 2202 news articles from four major English news websites. We start with detection of language indicative of various types of self-disclosure, leveraging both syntactic and semantic information present in texts. Specifically, we use dependency parsing for subject, verb, and object extraction from sentences, in conjunction with named entity recognition to extract linguistic indicators of self-disclosure. We then use these indicators to examine the effects of anonymity and topic of discussion on self-disclosure. We find that anonymous users are more likely to self-disclose than identifiable users, and that self-disclosure varies across topics of discussion. Finally, we discuss the implications of our findings for user privacy.

[1]  Peter Buxmann,et al.  Understanding Self-Disclosure on Social Networking Sites - A Literature Review , 2017, AMCIS.

[2]  Dhaval Patel,et al.  TiDE: Template-Independent Discourse Data Extraction , 2015, DaWaK.

[3]  Michail Tsikerdekis,et al.  The effects of perceived anonymity and anonymity states on conformity and groupthink in online communities: A Wikipedia study , 2013, J. Assoc. Inf. Sci. Technol..

[4]  W. Kruskal,et al.  Use of Ranks in One-Criterion Variance Analysis , 1952 .

[5]  A. Joinson Self‐disclosure in computer‐mediated communication: The role of self‐awareness and visual anonymity , 2001 .

[6]  Z. Rubin Disclosing oneself to a stranger: Reciprocity and its limits , 1975 .

[7]  Erin E. Hollenbaugh,et al.  The Effects of Anonymity on Self-Disclosure in Blogs: An Application of the Online Disinhibition Effect , 2013, J. Comput. Mediat. Commun..

[8]  Eduard Hovy,et al.  Extracting Opinions, Opinion Holders, and Topics Expressed in Online News Media Text , 2006 .

[9]  Yoon Hyung Choi,et al.  Self‐Disclosure Characteristics and Motivations in Social Media: Extending the Functional Model to Multiple Social Network Sites , 2015 .

[10]  Xiao Ma,et al.  Self-Disclosure and Perceived Trustworthiness of Airbnb Host Profiles , 2017, CSCW.

[11]  Natalya N. Bazarova,et al.  Self‐Disclosure in Social Media: Extending the Functional Approach to Disclosure Motivations and Characteristics on Social Network Sites , 2014 .

[12]  Alice H. Oh,et al.  Self-Disclosure and Relationship Strength in Twitter Conversations , 2012, ACL.

[13]  Rachel Greenstadt,et al.  Privacy Detective: Detecting Private Information and Collective Privacy Behavior in a Large Social Network , 2014, WPES.

[14]  Arthur D. Santana Virtuous or Vitriolic , 2014 .

[15]  J. Suler The Online Disinhibition Effect , 2004, Cyberpsychology, Behavior, and Social Networking.

[16]  Alice H. Oh,et al.  Self-disclosure topic model for classifying and analyzing Twitter conversations , 2014, EMNLP.

[17]  Munmun De Choudhury,et al.  Mental Health Discourse on reddit: Self-Disclosure, Social Support, and Anonymity , 2014, ICWSM.

[18]  Philipp K. Masur,et al.  Disclosure Management on Social Network Sites: Individual Privacy Perceptions and User-Directed Privacy Strategies , 2016 .

[19]  Hua Qian,et al.  Anonymity and Self-Disclosure on Weblogs , 2007, J. Comput. Mediat. Commun..

[20]  Robert E. Kraut,et al.  Modeling Self-Disclosure in Social Networking Sites , 2016, CSCW.

[21]  Alex Leavitt,et al.  "This is a Throwaway Account": Temporary Technical Identities and Perceptions of Anonymity in a Massive Online Community , 2015, CSCW.

[22]  Anna Cinzia Squicciarini,et al.  A hybrid epidemic model for deindividuation and antinormative behavior in online social networks , 2016, Social Network Analysis and Mining.

[23]  P. Cozby Self-disclosure: a literature review. , 1973, Psychological bulletin.

[24]  Michael Röder,et al.  Exploring the Space of Topic Coherence Measures , 2015, WSDM.

[25]  Xi Chen,et al.  How Anonymity Influence Self-disclosure Tendency on Sina Weibo: An Empirical Study , 2016 .

[26]  Melanie Nguyen,et al.  Comparing Online and Offline Self-Disclosure: A Systematic Review , 2012, Cyberpsychology Behav. Soc. Netw..

[27]  Xiao Ma,et al.  Anonymity, Intimacy and Self-Disclosure in Social Media , 2016, CHI.

[28]  K. Gwet Kappa Statistic is not Satisfactory for Assessing the Extent of Agreement Between Raters , 2002 .

[29]  Janyce Wiebe Identifying Subjective Characters in Narrative , 1990, COLING.

[30]  Erkki Sutinen,et al.  Are They Different? Affect, Feeling, Emotion, Sentiment, and Opinion Detection in Text , 2014, IEEE Transactions on Affective Computing.

[31]  Daniel M. Oppenheimer,et al.  Instructional Manipulation Checks: Detecting Satisficing to Increase Statistical Power , 2009 .

[32]  Azy Barak,et al.  Degree and Reciprocity of Self-Disclosure in Online Forums , 2007, Cyberpsychology Behav. Soc. Netw..

[33]  Adam N. Joinson,et al.  Linguistic Markers of Secrets and Sensitive Self-Disclosure in Twitter , 2012, 2012 45th Hawaii International Conference on System Sciences.