Machine prediction of personality from Facebook profiles

An increasing number of Americans use social networking sites such as Facebook, but few fully appreciate the amount of information they share with the world as a result. Although studies exist on the sharing of specific types of information (photos, posts, etc.), one area that has been less explored is how Facebook profiles can share personality information in a broad, machine-readable fashion. In this study, we apply data-mining and machine learning techniques to predict users' personality traits (specifically, the traits of the Big Five personality model) using only demographic and text-based attributes extracted from their profiles. We then use these predictions to rank individuals in terms of the five traits, predicting which users will appear in the top or bottom 5% or 10% of these traits. Our results show that when using certain models, we can find the top 10% most Open individuals with nearly 75% accuracy, and across all traits and directions, we can predict the top 10% with at least 34.5% accuracy (exceeding 21.8%, which is the best accuracy when using just the best-performing profile attribute). These results have privacy implications in terms of allowing advertisers and other groups to focus on a specific subset of individuals based on their personality traits.

[1]  Alessandro Acquisti,et al.  Imagined Communities: Awareness, Information Sharing, and Privacy on the Facebook , 2006, Privacy Enhancing Technologies.

[2]  Craig Ross,et al.  Personality and motivations associated with Facebook use , 2009, Comput. Hum. Behav..

[3]  Sandra L. Calvert,et al.  College students' social networking experiences on Facebook , 2009 .

[4]  Taghi M. Khoshgoftaar,et al.  Ordering Fault-Prone Software Modules , 2003, Software Quality Journal.

[5]  Samuel D. Gosling,et al.  Personality Impressions Based on Facebook Profiles , 2007, ICWSM.

[6]  Cliff Lampe,et al.  The Benefits of Facebook "Friends: " Social Capital and College Students' Use of Online Social Network Sites , 2007, J. Comput. Mediat. Commun..

[7]  Adam N. Joinson,et al.  Looking at, looking up or keeping up with people?: motives and use of facebook , 2008, CHI.

[8]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[9]  Kerk F. Kee,et al.  Is There Social Capital in a Social Network Site?: Facebook Use and College Students’ Life Satisfaction, Trust, and Participation 1 , 2009 .

[10]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[11]  Ron Kohavi,et al.  The Power of Decision Tables , 1995, ECML.

[12]  Heather Richter Lipford,et al.  Understanding Privacy Settings in Facebook with an Audience View , 2008, UPSEC.

[13]  John Raacke,et al.  MySpace and Facebook: Applying the Uses and Gratifications Theory to Exploring Friend-Networking Sites , 2008, Cyberpsychology Behav. Soc. Netw..

[14]  Alessandro Acquisti,et al.  Information revelation and privacy in online social networks , 2005, WPES '05.

[15]  Bernhard Debatin,et al.  Facebook and Online Privacy: Attitudes, Behaviors, and Unintended Consequences , 2009, J. Comput. Mediat. Commun..

[16]  James W. Pennebaker,et al.  Linguistic Inquiry and Word Count (LIWC2007) , 2007 .

[17]  Cliff Lampe,et al.  Changes in use and perception of facebook , 2008, CSCW.

[18]  Cliff Lampe,et al.  A face(book) in the crowd: social Searching vs. social browsing , 2006, CSCW '06.

[19]  A. Tellegen,et al.  PERSONALITY PROCESSES AND INDIVIDUAL DIFFERENCES An Alternative "Description of Personality": The Big-Five Factor Structure , 2022 .

[20]  Juliane M. Stopfer,et al.  Facebook Profiles Reflect Actual Personality, Not Self-Idealization , 2010, Psychological science.