Predicting self-monitoring skills using textual posts on Facebook

The popularity of the social networking site Facebook (FB) has grown unprecedented during the past five years. The research question investigated is whether posts on FB would also be applicable for the prediction of users' psychological traits such as self-monitoring (SM) skill that is supposed to be linked with users' expression behavior in the online environment. We present a model to evaluate the relationship between the posts and SM skills. The aim of this study is twofold: first, to evaluate the quality of responses to the Snyder's Self-Monitoring Questionnaire (1974) collected via the Internet; and secondly, to explore the textual features of the posts in different SM-level groups. The prediction of posts resulted in an approximate 60% accuracy compared with the classification made by Snyder's SM scale. The variable ''family'' was found the most significant predictor in structured textual analysis via Linguistic Inquiry and Word Count (LIWC). The emoticons and Internet slangs were extracted as the most robust classifiers in the unstructured textual analysis. We concluded that the textual posts on the FB Wall could partially predict the users' SM skills. Besides, we recommend that researchers always check the validity of Internet data using the methodology presented here to ensure the data is valid before being processed.

[1]  Qiwei He,et al.  Screening for posttraumatic stress disorder using verbal features in self narratives: A text mining approach , 2012, Psychiatry Research.

[2]  Marilyn A. Walker,et al.  Using Linguistic Cues for the Automatic Recognition of Personality in Conversation and Text , 2007, J. Artif. Intell. Res..

[3]  Tom Buchanan,et al.  Online assessment: Desirable or dangerous? , 2002 .

[4]  T. Graepel,et al.  Private traits and attributes are predictable from digital records of human behavior , 2013, Proceedings of the National Academy of Sciences.

[5]  M. Snyder Self-monitoring of expressive behavior. , 1974 .

[6]  Michal Kosinski,et al.  Mining Facebook Data for Predictive Personality Modeling , 2013, Proceedings of the International AAAI Conference on Web and Social Media.

[7]  J. M. Digman PERSONALITY STRUCTURE: EMERGENCE OF THE FIVE-FACTOR MODEL , 1990 .

[8]  Tapio Elomaa The Biases of Decision Tree Pruning Strategies , 1999, IDA.

[9]  Jenny Rosenberg,et al.  Online Impression Management: Personality Traits and Concerns for Secondary Goals as Predictors of Self-Presentation Tactics on Facebook , 2011, J. Comput. Mediat. Commun..

[10]  O. John The "Big Five" factor taxonomy: Dimensions of personality in the natural language and in questionnaires. , 1990 .

[11]  Bernard P. Veldkamp,et al.  Classifying unstructed textual data using the Product Score Model: an alternative text mining algorithm , 2012 .

[12]  Helene Fowkes,et al.  A method based on the chi-square test for document classification , 2001, SIGIR '01.

[13]  J. Pennebaker,et al.  LEXICAL PREDICTORS OFPERSONALITY TYPE , 2005 .

[14]  Fabio Pianesi,et al.  Workshop on Computational Personality Recognition: Shared Task , 2013, Proceedings of the International AAAI Conference on Web and Social Media.

[15]  Cees A. W. Glas Modification Indices for the 2PL and the Nominal Response Model. Research Report 98-04. , 1998 .

[16]  Tomas Chamorro-Premuzic,et al.  Handbook of individual differences. , 2011 .

[17]  J. Singer,et al.  Cognitive, social, and physiological determinants of emotional state. , 1962, Psychological review.

[18]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[19]  Benjamin Littenberg,et al.  Relationship Between Self‐report and an Objective Measure of Television‐viewing Time in Adults , 2010, Obesity.

[20]  Fritz Drasgow,et al.  Psychological testing on the Internet: new problems, old issues. , 2004, The American psychologist.

[21]  E. Taal,et al.  Application of the health assessment questionnaire disability index to various rheumatic diseases , 2010, Quality of Life Research.

[22]  Jeffrey A. Hall,et al.  Strategic misrepresentation in online dating: The effects of gender, self-monitoring, and personality traits , 2010 .

[23]  Sotiris B. Kotsiantis,et al.  Supervised Machine Learning: A Review of Classification Techniques , 2007, Informatica.

[24]  J. Pennebaker,et al.  Linguistic styles: language use as an individual difference. , 1999, Journal of personality and social psychology.

[25]  J. Rost,et al.  Applications of Latent Trait and Latent Class Models in the Social Sciences , 1998 .

[26]  Cornelis A.W. Glas,et al.  A Comparison of Item-Fit Statistics for the Three-Parameter Logistic Model , 2003 .

[27]  Georg Rasch,et al.  Probabilistic Models for Some Intelligence and Attainment Tests , 1981, The SAGE Encyclopedia of Research Design.

[28]  Nicole B. Ellison,et al.  Managing Impressions Online: Self-Presentation Processes in the Online Dating Environment , 2006, J. Comput. Mediat. Commun..

[29]  Carlos Salas,et al.  Objective vs. Self-Reported Physical Activity and Sedentary Time: Effects of Measurement Method on Relationships with Risk Biomarkers , 2012, PloS one.

[30]  Mark R. Leary,et al.  Handbook of self and identity , 2003 .

[31]  Ivan Bruha,et al.  From machine learning to knowledge discovery: Survey of preprocessing and postprocessing , 2000, Intell. Data Anal..

[32]  Jeffrey A. Hall,et al.  Self-monitoring, honesty, and cue use on Facebook: The relationship with user extraversion and conscientiousness , 2013, Comput. Hum. Behav..

[33]  Jeffrey T. Hancock,et al.  Separating Fact From Fiction: An Examination of Deceptive Self-Presentation in Online Dating Profiles , 2008, Personality & social psychology bulletin.

[34]  James H. Martin,et al.  Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition , 2000 .

[35]  J. Pennebaker,et al.  The Psychological Meaning of Words: LIWC and Computerized Text Analysis Methods , 2010 .

[36]  Cornelis A.W. Glas,et al.  A Person Fit Test For Irt Models For Polytomous Items , 2007 .

[37]  Загоровская Ольга Владимировна,et al.  Исследование влияния пола и психологических характеристик автора на количественные параметры его текста с использованием программы Linguistic Inquiry and Word Count , 2015 .

[38]  Jessica Greenebaum Managing Impressions , 2012 .

[39]  Qiwei He,et al.  Assessing impact of differential symptom functioning on post‐traumatic stress disorder (PTSD) diagnosis , 2014, International journal of methods in psychiatric research.

[40]  F. Lord Applications of Item Response Theory To Practical Testing Problems , 1980 .

[41]  Reynol Junco,et al.  Comparing actual and self-reported measures of Facebook use , 2013, Comput. Hum. Behav..

[42]  Cees A. W. Glas,et al.  DETECTION OF DIFFERENTIAL ITEM FUNCTIONING USING LAGRANGE MULTIPLIER TESTS , 1996 .

[43]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[44]  Samuel D. Gosling,et al.  Personality Impressions Based on Facebook Profiles , 2007, ICWSM.

[45]  Francisco Iacobelli,et al.  Large Scale Personality Classification of Bloggers , 2011, ACII.

[46]  D. Smith (On) Self-Presentation , 1989 .

[47]  R. D. Bock,et al.  Marginal maximum likelihood estimation of item parameters: Application of an EM algorithm , 1981 .

[48]  Tracii Ryan,et al.  Who uses Facebook? An investigation into the relationship between the Big Five, shyness, narcissism, loneliness, and Facebook usage , 2011, Comput. Hum. Behav..