Predicting Active Users' Personality Based on Micro-Blogging Behaviors

Because of its richness and availability, micro-blogging has become an ideal platform for conducting psychological research. In this paper, we proposed to predict active users' personality traits through micro-blogging behaviors. 547 Chinese active users of micro-blogging participated in this study. Their personality traits were measured by the Big Five Inventory, and digital records of micro-blogging behaviors were collected via web crawlers. After extracting 845 micro-blogging behavioral features, we first trained classification models utilizing Support Vector Machine (SVM), differentiating participants with high and low scores on each dimension of the Big Five Inventory. The classification accuracy ranged from 84% to 92%. We also built regression models utilizing PaceRegression methods, predicting participants' scores on each dimension of the Big Five Inventory. The Pearson correlation coefficients between predicted scores and actual scores ranged from 0.48 to 0.54. Results indicated that active users' personality traits could be predicted by micro-blogging behaviors.

[1]  Kimberly Young,et al.  Internet Addiction: The Emergence of a New Clinical Disorder , 1998, Cyberpsychology Behav. Soc. Netw..

[2]  Craig Ross,et al.  The Influence of Shyness on the Use of Facebook in an Undergraduate Sample , 2009, Cyberpsychology Behav. Soc. Netw..

[3]  D. McAdams,et al.  Personality development: continuity and change over the life course. , 2010, Annual review of psychology.

[4]  N. Tallent Psychological testing. , 1960, The American journal of nursing.

[5]  Yair Amichai-Hamburger,et al.  Internet and personality , 2002, Comput. Hum. Behav..

[6]  S. Gosling,et al.  A very brief measure of the Big-Five personality domains , 2003 .

[7]  G. Domino,et al.  Psychological Testing: An Introduction , 1999 .

[8]  John L. Smith,et al.  Using the Internet for psychological research: personality testing on the World Wide Web. , 1999, British journal of psychology.

[9]  L. R. Goldberg THE DEVELOPMENT OF MARKERS FOR THE BIG-FIVE FACTOR STRUCTURE , 1992 .

[10]  Daniele Quercia,et al.  Our Twitter Profiles, Our Selves: Predicting Personality with Twitter , 2011, 2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int'l Conference on Social Computing.

[11]  Alberto Maria Segre,et al.  The Use of Twitter to Track Levels of Disease Activity and Public Concern in the U.S. during the Influenza A H1N1 Pandemic , 2011, PloS one.

[12]  R. R. Abidin Psychological Assessment Resources , 1995 .

[13]  John Yen,et al.  Proceedings of the 9th WebKDD and 1st SNA-KDD 2007 workshop on Web mining and social network analysis , 2007, KDD 2007.

[14]  D. Funder,et al.  The Effect of Information on Consensus and Accuracy in Personality Judgment , 1998 .

[15]  Timothy W. Finin,et al.  Why we twitter: understanding microblogging usage and communities , 2007, WebKDD/SNA-KDD '07.

[16]  Y. Hamburger,et al.  The relationship between extraversion and neuroticism and the different uses of the Internet. , 2000 .

[17]  V. Benet‐Martínez,et al.  Personality and the prediction of consequential outcomes. , 2006, Annual review of psychology.

[18]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machine Classifiers , 1999, Neural Processing Letters.

[19]  Bernd Marcus,et al.  Personality in cyberspace: personal Web sites as media for personality expressions and impressions. , 2006, Journal of personality and social psychology.

[20]  Samuel D. Gosling,et al.  Manifestations of Personality in Online Social Networks: Self-Reported Facebook-Related Behaviors and Observable Profile Information , 2011, Cyberpsychology Behav. Soc. Netw..

[21]  R. McCrae,et al.  An introduction to the five-factor model and its applications. , 1992, Journal of personality.

[22]  P. Borkenau,et al.  Trait inferences: Sources of validity at zero acquaintance. , 1992 .

[23]  Jon Oberlander,et al.  Identifying more bloggers: Towards large scale personality classification of personal weblogs , 2007, ICWSM.

[24]  Jinwoo Kim,et al.  How to visually create clear personalities with blogs? , 2004, SIGGRAPH '04.

[25]  Chong-Ho Choi,et al.  Input feature selection for classification problems , 2002, IEEE Trans. Neural Networks.

[26]  Jennifer Golbeck,et al.  Predicting personality with social media , 2011, CHI Extended Abstracts.

[27]  J. Schuerger,et al.  Essentials of 16PF Assessment , 2003 .

[28]  Per Carlbring,et al.  Internet vs. paper and pencil administration of questionnaires commonly used in panic/agoraphobia research , 2007, Comput. Hum. Behav..

[29]  Jennifer Golbeck,et al.  Predicting Personality from Twitter , 2011, 2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int'l Conference on Social Computing.

[30]  Yong Wang A New Approach to Fitting Linear Models in High Dimensional Spaces , 2000 .

[31]  Lin Qiu,et al.  Understanding the psychological motives behind microblogging. , 2010, Studies in health technology and informatics.

[32]  Soraya Mehdizadeh,et al.  Self-Presentation 2.0: Narcissism and Self-Esteem on Facebook , 2010, Cyberpsychology Behav. Soc. Netw..

[33]  Maria E. Jabon,et al.  The Expression of Personality in Virtual Worlds , 2011 .

[34]  Steven M. LaValle,et al.  On the Relationship between Classical Grid Search and Probabilistic Roadmaps , 2004, Int. J. Robotics Res..