Inferring Gender of Micro-Blog Users based on Multi-Classifiers Fusion

Knowing user demographic traits offers a great potential for public information. Most researches have used local features to predict user demographic traits. Since this method did not make the most of user global features, the prediction performance was low. In this paper, our goal tries to use an ensemble learning method to improve the prediction performance through multi-classifiers fusion. Our work makes three important contributions. Firstly, we show how to predict Sina Micro-blog users’ genders based on his/her text published on the social network. Secondly, we show that user’s personality traits can also be used to infer gender. And last and thirdly, we propose multi-classifiers fusion to predict users’ genders, and give the experimental results that validate our method by comparing it with a different local features dataset. Our experiment demonstrates that our method can improve the accuracy rate, the recall rate of prediction, and the F value.

[1]  Ana-Maria Popescu,et al.  Democrats, republicans and starbucks afficionados: user classification in twitter , 2011, KDD.

[2]  Jahna Otterbacher,et al.  Inferring gender of movie reviewers: exploiting writing style, content and metadata , 2010, CIKM.

[3]  Wei Huazhe Face Verification by Confusing Local Bayesian Classifier , 2016 .

[4]  Vincent S. Tseng,et al.  Demographic Prediction Based on User's Mobile Behaviors , 2012 .

[5]  Venkata Rama Kiran Garimella,et al.  Political search trends , 2012, SIGIR '12.

[6]  Krishna P. Gummadi,et al.  You are who you know: inferring user profiles in online social networks , 2010, WSDM '10.

[7]  Zhenyu Liu,et al.  Inferring Privacy Information from Social Networks , 2006, ISI.

[8]  Bhavani M. Thuraisingham,et al.  Inferring private information using social network data , 2009, WWW '09.

[9]  Svitlana Volkova,et al.  Inferring Latent User Properties from Texts Published in Social Media , 2015, AAAI.

[10]  Maen Takruri,et al.  Multi-classifier decision fusion for enhancing melanoma recognition accuracy , 2016, 2016 5th International Conference on Electronic Devices, Systems and Applications (ICEDSA).

[11]  Lise Getoor,et al.  To join or not to join: the illusion of privacy in social networks with mixed public and private user profiles , 2009, WWW '09.

[12]  T. Graepel,et al.  Private traits and attributes are predictable from digital records of human behavior , 2013, Proceedings of the National Academy of Sciences.

[13]  Shuguang Huang,et al.  Predicting Big-Five Personality for Micro-blog Based on Robust Multi-task Learning , 2017, ICPCSEE.

[14]  Atallah Ibrahim Hashad,et al.  Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classifiers Fusion , 2015 .

[15]  Lin Hong-fei Research on Gender Recognition for Character in Text , 2010 .

[16]  Lei Li,et al.  Inferring privacy information via social relations , 2008, 2008 IEEE 24th International Conference on Data Engineering Workshop.

[17]  Youtian Du,et al.  User Authentication Through Mouse Dynamics , 2013, IEEE Transactions on Information Forensics and Security.

[18]  Milad Shokouhi,et al.  Inferring the demographics of search users: social data meets search queries , 2013, WWW.