Inferring social media users' demographics from profile pictures: A Face++ analysis on twitter users

In this research, we evaluate the applicability of using facial recognition of social media account profile pictures to infer the demographic attributes of gender, race, and age of the account owners leveraging a commercial and well-known image service, specifically Face++. Our goal is to determine the feasibility of this approach for actual system implementation. Using a dataset of approximately 10,000 Twitter profile pictures, we use Face++ to classify this set of images for gender, race, and age. We determine that about 30% of these profile pictures contain identifiable images of people using the current state-of-the-art automated means. We then employ human evaluations to manually tag both the set of images that were determined to contain faces and the set that was determined not to contain faces, comparing the results to Face++. Of the thirty percent that Face++ identified as containing a face, about 80% are more likely than not the account holder based on our manual classification, with a variety of issues in the remaining 20%. Of the images that Face++ was unable to detect a face, we isolate a variety of likely issues preventing this detection, when a face actually appeared in the image. Overall, we find the applicability of automatic facial recognition to infer demographics for system development to be problematic, despite the reported high accuracy achieved for image test collections.

[1]  Wajdi Dhifli,et al.  Face Recognition in the Wild , 2016, KES.

[2]  George Azzopardi,et al.  Gender Recognition from Face Images Using a Fusion of SVM Classifiers , 2016, ICIAR.

[3]  Paul A. Longley,et al.  The Geotemporal Demographics of Twitter Usage , 2015 .

[4]  Ali Shojaie,et al.  Using Twitter for Demographic and Social Science Research: Tools for Data Collection and Processing , 2014, Sociological methods & research.

[5]  Fabrício Benevenuto,et al.  Linguistic Diversities of Demographic Groups in Twitter , 2017, HT.

[6]  Venkata Rama Kiran Garimella,et al.  Inferring international and internal migration patterns from Twitter data , 2014, WWW.

[7]  Philip S. Yu,et al.  Empirical Evaluation of Profile Characteristics for Gender Classification on Twitter , 2013, 2013 12th International Conference on Machine Learning and Applications.

[8]  Bernard J. Jansen,et al.  Generating Cultural Personas from Social Data: A Perspective of Middle Eastern Users , 2017, 2017 5th International Conference on Future Internet of Things and Cloud Workshops (FiCloudW).

[9]  Timothy Cribbin,et al.  An Interactive Method for Inferring Demographic Attributes in Twitter , 2015, HT.

[10]  Bernard J. Jansen,et al.  Persona Generation from Aggregated Social Media Data , 2017, CHI Extended Abstracts.

[11]  Fabrício Benevenuto,et al.  White, man, and highly followed: gender and race inequalities in Twitter , 2017, WI.

[12]  Milad Shokouhi,et al.  Inferring the demographics of search users: social data meets search queries , 2013, WWW.

[13]  Wenyi Huang,et al.  Inferring nationalities of Twitter users and studying inter-national linking , 2014, HT.

[14]  Jisun An,et al.  #greysanatomy vs. #yankees: Demographics and Hashtag Use on Twitter , 2016, ICWSM.

[15]  Bernard J. Jansen,et al.  Personas for Content Creators via Decomposed Aggregate Audience Statistics , 2017, 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[16]  Bernard J. Jansen,et al.  Towards Automatic Persona Generation Using Social Media , 2016, 2016 IEEE 4th International Conference on Future Internet of Things and Cloud Workshops (FiCloudW).

[17]  Nicholas Jing Yuan,et al.  You Are Where You Go: Inferring Demographic Attributes from Location Check-ins , 2015, WSDM.

[18]  Philip S. Yu,et al.  Language independent gender classification on Twitter , 2013, 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013).

[19]  Yujing Jiang,et al.  Learning Compact Face Representation: Packing a Face into an int32 , 2014, ACM Multimedia.