Personality Prediction for Microblog Users with Active Learning Method

Personality research on social media is a hot topic recently due to the rapid development of social medias well as the central importance of personality in psychology, but it is hard to acquire adequate appropriate labeled samples. Our research aims to choose the right users to be labeled to improve the accuracy of predicting. Given a set of Microblog users’ public information (e.g., number of followers) and a few labeled users, the task is to predict personality of other unlabeled users. The active learning regression algorithm has been employed to establish predicting model in this paper, and the experimental results demonstrate our method can fairly well predict the personality of Microblog users.

[1]  William A. Gale,et al.  A sequential algorithm for training text classifiers , 1994, SIGIR '94.

[2]  H. Sebastian Seung,et al.  Query by committee , 1992, COLT '92.

[3]  Hamido Fujita,et al.  Personality Estimation Application for Social Media , 2012, SoMeT.

[4]  Lin Qiu,et al.  You are what you tweet: Personality expression and perception on Twitter , 2012 .

[5]  John L. Smith,et al.  Using the Internet for psychological research: personality testing on the World Wide Web. , 1999, British journal of psychology.

[6]  H. Bourlard,et al.  Auto-association by multilayer perceptrons and singular value decomposition , 1988, Biological Cybernetics.

[7]  Jennifer Golbeck,et al.  Predicting personality with social media , 2011, CHI Extended Abstracts.

[8]  Lyle H. Ungar,et al.  Machine Learning manuscript No. (will be inserted by the editor) Active Learning for Logistic Regression: , 2007 .

[9]  John Langford,et al.  Agnostic active learning , 2006, J. Comput. Syst. Sci..

[10]  Michael J. Prince,et al.  Does Active Learning Work? A Review of the Research , 2004 .

[11]  Shinichi Nakajima,et al.  Pool-based active learning in approximate linear regression , 2009, Machine Learning.

[12]  T. Landauer,et al.  Indexing by Latent Semantic Analysis , 1990 .

[13]  Samuel D. Gosling,et al.  Manifestations of Personality in Online Social Networks: Self-Reported Facebook-Related Behaviors and Observable Profile Information , 2011, Cyberpsychology Behav. Soc. Netw..

[14]  Luis Mateus Rocha,et al.  Singular value decomposition and principal component analysis , 2003 .

[15]  E. Thompson Development and validation of an international English big-five Mini-Markers , 2008 .

[16]  Gene H. Golub,et al.  Singular value decomposition and least squares solutions , 1970, Milestones in Matrix Computation.

[17]  Tingshao Zhu,et al.  Big-Five Personality Prediction Based on User Behaviors at Social Network Sites , 2012, ArXiv.

[18]  Masashi Sugiyama,et al.  Active Learning in Approximately Linear Regression Based on Conditional Expectation of Generalization Error , 2006, J. Mach. Learn. Res..

[19]  Xiaojin Zhu,et al.  --1 CONTENTS , 2006 .

[20]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[21]  Tom M. Mitchell,et al.  Generalization as Search , 2002 .