What demographic attributes do our digital footprints reveal? A systematic review

To what extent does our online activity reveal who we are? Recent research has demonstrated that the digital traces left by individuals as they browse and interact with others online may reveal who they are and what their interests may be. In the present paper we report a systematic review that synthesises current evidence on predicting demographic attributes from online digital traces. Studies were included if they met the following criteria: (i) they reported findings where at least one demographic attribute was predicted/inferred from at least one form of digital footprint, (ii) the method of prediction was automated, and (iii) the traces were either visible (e.g. tweets) or non-visible (e.g. clickstreams). We identified 327 studies published up until October 2018. Across these articles, 14 demographic attributes were successfully inferred from digital traces; the most studied included gender, age, location, and political orientation. For each of the demographic attributes identified, we provide a database containing the platforms and digital traces examined, sample sizes, accuracy measures and the classification methods applied. Finally, we discuss the main research trends/findings, methodological approaches and recommend directions for future research.

[1]  Mahmoud Al-Ayyoub,et al.  Emotion analysis of Arabic articles and its impact on identifying the author's gender , 2015, 2015 IEEE/ACS 12th International Conference of Computer Systems and Applications (AICCSA).

[2]  David Pinto,et al.  Unsupervised method for the authorship identification task Notebook for PAN at CLEF 2014 , 2014 .

[3]  Antonio Torralba,et al.  Face-to-BMI: Using Computer Vision to Infer Body Mass Index on Social Media , 2017, ICWSM.

[4]  Rao Muhammad Adeel Nawab,et al.  Cross-Genre Author Profile Prediction Using Stylometry-Based Approach , 2016, CLEF.

[5]  Erhan Sezerer,et al.  Gender Prediction From Tweets With Convolutional Neural Networks: Notebook for PAN at CLEF 2018 , 2018, CLEF.

[6]  Walter Daelemans,et al.  Text-Based Age and Gender Prediction for Online Safety Monitoring , 2015 .

[7]  Berkant Barla Cambazoglu,et al.  Chat Mining for Gender Prediction , 2006, ADVIS.

[8]  Brendan T. O'Connor,et al.  A Latent Variable Model for Geographic Lexical Variation , 2010, EMNLP.

[9]  Benno Stein,et al.  Overview of the 2 nd Author Profiling Task at PAN 2014 , 2014 .

[10]  Antoine Boutet,et al.  What’s in Twitter, I know what parties are popular and who you are supporting now! , 2013, 2012 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining.

[11]  Lyle H. Ungar,et al.  Discovering User Attribute Stylistic Differences via Paraphrasing , 2016, AAAI.

[12]  Paolo Rosso,et al.  On the impact of emotions on author profiling , 2016, Inf. Process. Manag..

[13]  S. Sikström,et al.  “She” and “He” in News Media Messages: Pronoun Use Reflects Gender Biases in Semantic Contexts , 2015 .

[14]  Takahide Hoshide,et al.  What is he/she like?: Estimating Twitter user attributes from contents and social neighbors , 2013, 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013).

[15]  Mirco Kocher,et al.  UniNE at CLEF 2015 Author Profiling: Notebook for PAN at CLEF 2015 , 2015, CLEF.

[16]  M. Williams,et al.  Who Tweets? Deriving the Demographic Characteristics of Age, Occupation and Social Class from Twitter User Meta-Data , 2015, PloS one.

[17]  Rostislav Khlebnikov,et al.  Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) , 2016 .

[18]  Michele C. Weigle,et al.  Demographic Prediction of Mobile User from Phone Usage , 2012 .

[19]  Philip S. Yu,et al.  Say It with Colors: Language-Independent Gender Classification on Twitter , 2014, Online Social Media Analysis and Visualization.

[20]  Benno Stein,et al.  Overview of the 5th Author Profiling Task at PAN 2017: Gender and Language Variety Identification in Twitter , 2017, CLEF.

[21]  Davood Rafiei,et al.  Predicting political preference of Twitter users , 2013, ASONAM.

[22]  Shlomo Argamon,et al.  Automatically profiling the author of an anonymous text , 2009, CACM.

[23]  Carl Vogel,et al.  Style-based Distance Features for Author Profiling Notebook for PAN at CLEF 2013 , 2013, CLEF.

[24]  Svitlana Volkova,et al.  Mining User Interests to Predict Perceived Psycho-Demographic Traits on Twitter , 2016, 2016 IEEE Second International Conference on Big Data Computing Service and Applications (BigDataService).

[25]  Mark Cieliebak,et al.  Word Unigram Weighing for Author Profiling at PAN 2018: Notebook for PAN at CLEF 2018 , 2018, CLEF.

[26]  Shlomo Argamon,et al.  Exploiting subjectivity analysis in blogs to improve political leaning categorization , 2008, SIGIR '08.

[27]  Tomoki Taniguchi,et al.  Author Profiling with Word+Character Neural Attention Network , 2017, CLEF.

[28]  David Yarowsky,et al.  Classifying latent user attributes in twitter , 2010, SMUC '10.

[29]  Ivandré Paraboni,et al.  Author Profiling using Word Embeddings with Subword Information: Notebook for PAN at CLEF 2018 , 2018, CLEF.

[30]  Nemanja Djuric,et al.  Leveraging Blogging Activity on Tumblr to Infer Demographics and Interests of Users for Advertising Purposes , 2016, #Microposts.

[31]  S. Vicari Twitter and Non-Elites: Interpreting Power Dynamics in the Life Story of the (#)BRCA Twitter Stream , 2017, Social media + society.

[32]  Aron Culotta,et al.  Learning from noisy label proportions for classifying online social data , 2017, Social Network Analysis and Mining.

[33]  Michal Meina,et al.  Ensemble-based Classification for Author Profiling Using Various Features Notebook for PAN at CLEF 2013 , 2013, CLEF.

[34]  D. McAdams,et al.  The Psychology of Life Stories , 2001 .

[35]  R. Lakoff,et al.  Language and woman's place : text and commentaries , 2004 .

[36]  Lars Backstrom,et al.  ePluribus: Ethnicity on Social Networks , 2010, ICWSM.

[37]  Adrian Popescu,et al.  Mining User Home Location and Gender from Flickr Tags , 2010, ICWSM.

[38]  Stefan Conrad,et al.  Exploring the Effects of Cross-Genre Machine Learning for Author Profiling in PAN 2016 , 2016, CLEF.

[39]  Laura L. Carstensen,et al.  Evidence for a Life-Span Theory of Socioemotional Selectivity , 1995 .

[40]  Sang-Wook Kim,et al.  Photos Don't Have Me, But How Do You Know Me?: Analyzing and Predicting Users on Instagram , 2018, UMAP.

[41]  Ee-Peng Lim,et al.  On predicting religion labels in microblogging networks , 2014, SIGIR.

[42]  Markus Koch,et al.  Linking visual concept detection with viewer demographics , 2012, ICMR '12.

[43]  Rik van Noord,et al.  Using Translated Data to Improve Deep Learning Author Profiling Models: Notebook for PAN at CLEF 2018 , 2018, CLEF.

[44]  Daniela Moctezuma,et al.  Gender and language-variety Identification with MicroTC , 2017, CLEF.

[45]  M. Gordon Language, Society and the Elderly: Discourse, Identity and Ageing , 1994 .

[46]  Fusheng Wang,et al.  A Comparative Study of Demographic Attribute Inference in Twitter , 2015, ICWSM.

[47]  Carlos Sarraute,et al.  A study of age and gender seen through mobile phone usage patterns in Mexico , 2014, 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2014).

[48]  Fermín L. Cruz,et al.  ITALICA at PAN 2013: An Ensemble Learning Approach to Author Profiling Notebook for PAN at CLEF 2013 , 2013, CLEF.

[49]  Hugo Jair Escalante,et al.  Using Intra-Profile Information for Author Profiling Notebook for PAN at CLEF 2014 , 2014 .

[50]  Pablo Barberá,et al.  Less is more? How demographic sample weights can improve public opinion estimates based on Twitter data. , 2016 .

[51]  Mishuana R. Goeman Mark My Words , 2013 .

[52]  Di Ma,et al.  Demographic Information Inference through Meta-Data Analysis of Wi-Fi Traffic , 2018, IEEE Transactions on Mobile Computing.

[53]  John Cardiff,et al.  Twitter Author Profiling Using Word Embeddings and Logistic Regression , 2017, CLEF.

[54]  Timothy Cribbin,et al.  An Interactive Method for Inferring Demographic Attributes in Twitter , 2015, HT.

[55]  Teresa Gonçalves,et al.  Multilingual author profiling using word embedding averages and SVMs , 2016, 2016 10th International Conference on Software, Knowledge, Information Management & Applications (SKIMA).

[56]  James Caverlee,et al.  Location prediction in social media based on tie strength , 2013, CIKM.

[57]  Yiannis Kompatsiaris,et al.  Assessing the Reliability of Facebook User Profiling , 2015, WWW.

[58]  Derek Ruths,et al.  Geolocation Prediction in Twitter Using Social Networks: A Critical Analysis and Review of Current Practice , 2015, ICWSM.

[59]  Erhan Sezerer,et al.  Gender Prediction from Turkish Tweets with Neural Networks , 2019, 2019 27th Signal Processing and Communications Applications Conference (SIU).

[60]  Rui Li,et al.  Multiple Location Profiling for Users and Relationships from Social Network and Content , 2012, Proc. VLDB Endow..

[61]  Daniel Dichiu,et al.  Using Machine Learning Algorithms for Author Profiling In Social Media , 2016, CLEF.

[62]  Roman Kern,et al.  Profiling Microblog Authors using Concreteness and Sentiment - Know-Center at PAN 2016 Author Profiling , 2016, CLEF.

[63]  Derek Ruths,et al.  Classifying Political Orientation on Twitter: It's Not Easy! , 2013, ICWSM.

[64]  J. Pennebaker,et al.  PERSONALITY PROCESSES AND INDIVIDUAL DIFFERENCES Words of Wisdom: Language Use Over the Life Span , 2003 .

[65]  Hugo Jair Escalante,et al.  INAOE's Participation at PAN'15: Author Profiling task , 2015, CLEF.

[66]  Margaret L. Kern,et al.  Personality, Gender, and Age in the Language of Social Media: The Open-Vocabulary Approach , 2013, PloS one.

[67]  Aron Culotta,et al.  Mining the Demographics of Political Sentiment from Twitter Using Learning from Label Proportions , 2017, 2017 IEEE International Conference on Data Mining (ICDM).

[68]  Gary M. Weiss,et al.  Identifying user traits by mining smart phone accelerometer data , 2011, SensorKDD '11.

[69]  Eric B. Weiser,et al.  Gender Differences in Internet Use Patterns and Internet Application Preferences: A Two-Sample Comparison , 2000, Cyberpsychology Behav. Soc. Netw..

[70]  Malvina Nissim,et al.  GronUP: Groningen User Profiling: Notebook for PAN at CLEF 2016 , 2016 .

[71]  Dongwon Lee,et al.  @Phillies Tweeting from Philly? Predicting Twitter User Locations with Spatial Word Usage , 2012, 2012 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining.

[72]  L. Lunsky Childhood and Society. , 1965 .

[73]  Jussi Karlgren,et al.  Authorship Profiling Without Using Topical Information: Notebook for PAN at CLEF 2018 , 2018, CLEF.

[74]  Nils Schaetti UniNE at CLEF 2017: TF-IDF and Deep-Learning for Author Profiling , 2017, CLEF.

[75]  Palakorn Achananuparp,et al.  Insights from Machine-Learned Diet Success Prediction , 2015, PSB.

[76]  Yulia Tsvetkov,et al.  Writer Profiling Without the Writer's Text , 2017, SocInfo.

[77]  Fei Wang,et al.  Age Detection for Chinese Users in Weibo , 2015, WAIM.

[78]  Berkant Barla Cambazoglu,et al.  Chat mining: Predicting user and message attributes in computer-mediated communication , 2008, Inf. Process. Manag..

[79]  David Bamman,et al.  Gender in Twitter: Styles, stances, and social networks , 2012, ArXiv.

[80]  Asaf Shabtai,et al.  Noise Reduction of Mobile Sensors Data in the Prediction of Demographic Attributes , 2015, 2015 2nd ACM International Conference on Mobile Software Engineering and Systems.

[81]  A. Stefanidis,et al.  Harvesting ambient geospatial information from social media feeds , 2011, GeoJournal.

[82]  Jeongkyu Lee,et al.  User Profiling of Flickr: Integrating Multiple Types of Features for Gender Classification , 2015 .

[83]  Gerd Stumme,et al.  Gender Inference using Statistical Name Characteristics in Twitter , 2016, MISNC.

[84]  Julio Gonzalo,et al.  Overview of RepLab 2014: Author Profiling and Reputation Dimensions for Online Reputation Management , 2014, CLEF.

[85]  Matthias Hollick,et al.  Show me your phone, I will tell you who your friends are: analyzing smartphone data to identify social relationships , 2015, MUM.

[86]  José Palazzo Moreira de Oliveira,et al.  Exploring Information Retrieval features for Author Profiling Notebook for PAN at CLEF 2014 , 2014 .

[87]  Malvina Nissim,et al.  GronUP: Groningen User Profiling , 2016, CLEF.

[88]  Jamal Ahmad Khan Author Profile Prediction Using Trend and Word Frequency Based Analysis in Text , 2017, CLEF.

[89]  S. Gosling,et al.  e-Perceptions : Personality Impressions Based on Personal Websites , 2004 .

[90]  Koen W. De Bock,et al.  Predicting Website Audience Demographics forWeb Advertising Targeting Using Multi-Website Clickstream Data , 2010, Fundam. Informaticae.

[91]  Rodrigo Ribeiro Oliveira,et al.  Using Character n-grams and Style Features for Gender and Language Variety Classification , 2017, CLEF.

[92]  Jacob Ratkiewicz,et al.  Predicting the Political Alignment of Twitter Users , 2011, 2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int'l Conference on Social Computing.

[93]  T. Graepel,et al.  Private traits and attributes are predictable from digital records of human behavior , 2013, Proceedings of the National Academy of Sciences.

[94]  Mirco Musolesi,et al.  It's the way you check-in: identifying users in location-based social networks , 2014, COSN '14.

[95]  Clayton Fink,et al.  Inferring Gender from the Content of Tweets: A Region Specific Example , 2012, ICWSM.

[96]  Pablo Barberá Birds of the Same Feather Tweet Together: Bayesian Ideal Point Estimation Using Twitter Data , 2015, Political Analysis.

[97]  Dominique Estival,et al.  Author attribution with email messages , 2008 .

[98]  Aron Culotta,et al.  Co-Training for Demographic Classification Using Deep Learning from Label Proportions , 2017, 2017 IEEE International Conference on Data Mining Workshops (ICDMW).

[99]  Jacques Savoy,et al.  UniNE at CLEF 2017: Author Profiling Reasoning , 2017, CLEF.

[100]  Son Bao Pham,et al.  Author Profiling for Vietnamese Blogs , 2009, 2009 International Conference on Asian Language Processing.

[101]  Daniel Dichiu,et al.  Automatic Profiling of Twitter Users Based on Their Tweets: Notebook for PAN at CLEF 2015 , 2015, CLEF.

[102]  Ingmar Weber,et al.  You Are What Apps You Use: Demographic Prediction Based on User's Apps , 2016, ICWSM.

[103]  Lyle H. Ungar,et al.  Analyzing Biases in Human Perception of User Age and Gender from Text , 2016, ACL.

[104]  P. McNamara,et al.  Parkinson’s Disease and Politeness , 2010, Journal of language and social psychology.

[105]  Tammara L. Jenkins,et al.  Language Analysis as a Window to Bereaved Parents’ Emotions During a Parent–Physician Bereavement Meeting , 2015, Journal of language and social psychology.

[106]  José Palazzo Moreira de Oliveira,et al.  Examining Multiple Features for Author Profiling , 2014, J. Inf. Data Manag..

[107]  Dong Nguyen,et al.  "How Old Do You Think I Am?" A Study of Language and Age in Twitter , 2013, ICWSM.

[108]  Ravi Kumar,et al.  "I know what you did last summer": query logs and user privacy , 2007, CIKM '07.

[109]  Xiaojun Ma,et al.  Twitter User Gender Inference Using Combined Analysis of Text and Image Processing , 2014, VL@COLING.

[110]  Dongwon Lee,et al.  Teens are from mars, adults are from venus: analyzing and predicting age groups with behavioral characteristics in instagram , 2016, WebSci.

[111]  Milad Shokouhi,et al.  Inferring the Demographics of Search Users , 2013 .

[112]  Thamar Solorio,et al.  A Simple Approach to Author Profiling in MapReduce , 2014, CLEF.

[113]  A. Culotta,et al.  Using County Demographics to Infer Attributes of Twitter Users , 2014 .

[114]  Michael D. Smith,et al.  Predicting the Political Sentiment of Web Log Posts Using Supervised Machine Learning Techniques Coupled with Feature Selection , 2006, WEBKDD.

[115]  Paolo Rosso,et al.  On the Multilingual and Genre Robustness of EmoGraphs for Author Profiling in Social Media , 2015, CLEF.

[116]  James W. Pennebaker,et al.  Linguistic Inquiry and Word Count (LIWC2007) , 2007 .

[117]  Arno Scharl,et al.  Analyzing the public discourse on works of fiction – Detection and visualization of emotion in online coverage about HBO’s Game of Thrones , 2016, Inf. Process. Manag..

[118]  David Yarowsky,et al.  Hierarchical Bayesian Models for Latent Attribute Detection in Social Media , 2011, ICWSM.

[119]  Ingmar Weber,et al.  Quantified Self Meets Social Media: Sharing of Weight Updates on Twitter , 2016, Digital Health.

[120]  Sofiane Abbar,et al.  You Tweet What You Eat: Studying Food Consumption Through Twitter , 2014, CHI.

[121]  Marcos André Gonçalves,et al.  He Votes or She Votes? Female and Male Discursive Strategies in Twitter Political Hashtags , 2014, PloS one.

[122]  Mung Chiang,et al.  Quantifying Political Leaning from Tweets, Retweets, and Retweeters , 2016, IEEE Transactions on Knowledge and Data Engineering.

[123]  Mahmoud Al-Ayyoub,et al.  Author gender identification from Arabic text , 2017, J. Inf. Secur. Appl..

[124]  David Garcia,et al.  Leaking privacy and shadow profiles in online social networks , 2017, Science Advances.

[125]  Carlos Sarraute,et al.  Harnessing Mobile Phone Social Network Topology to Infer Users Demographic Attributes , 2014, SNAKDD'14.

[126]  Zdenek Salzmann Women, men and language: A sociolinguistic account of gender differences in language By Jennifer Coates (review) , 1994 .

[127]  Walter Daelemans,et al.  Predicting age and gender in online social networks , 2011, SMUC '11.

[128]  Walter Daelemans,et al.  TwiSty: A Multilingual Twitter Stylometry Corpus for Gender and Personality Profiling , 2016, LREC.

[129]  Lyle H. Ungar,et al.  Beyond Binary Labels: Political Ideology Prediction of Twitter Users , 2017, ACL.

[130]  J. Pennebaker,et al.  The Psychological Meaning of Words: LIWC and Computerized Text Analysis Methods , 2010 .

[131]  Lars Backstrom,et al.  Find me if you can: improving geographical prediction with social and spatial proximity , 2010, WWW '10.

[132]  Rick Kosse,et al.  Mixing Traditional Methods with Neural Networks for Gender Prediction: Notebook for PAN at CLEF 2018 , 2018, CLEF.

[133]  John D. Burger,et al.  Discriminating Gender on Twitter , 2011, EMNLP.

[134]  Svitlana Volkova,et al.  Inferring User Political Preferences from Streaming Communications , 2014, ACL.

[135]  Leysia Palen,et al.  Microblogging during two natural hazards events: what twitter may contribute to situational awareness , 2010, CHI.

[136]  Lawrence B. Holder,et al.  Using Graphical Features To Improve Demographic Prediction From Smart Phone Data , 2017, NDA@SIGMOD.

[137]  Nitesh V. Chawla,et al.  User Modeling on Demographic Attributes in Big Mobile Social Networks , 2017, ACM Trans. Inf. Syst..

[138]  Caroline Brun,et al.  XRCE Personal Language Analytics Engine for Multilingual Author Profiling: Notebook for PAN at CLEF 2015 , 2015, CLEF.

[139]  Prasant Mohapatra,et al.  Predicting user traits from a snapshot of apps installed on a smartphone , 2014, MOCO.

[140]  Rajarathnam Chandramouli,et al.  Author gender identification from text , 2011, Digit. Investig..

[141]  Vanessa Frías-Martínez,et al.  A Gender-Centric Analysis of Calling Behavior in a Developing Economy Using Call Detail Records , 2010, AAAI Spring Symposium: Artificial Intelligence for Development.

[142]  K. Schaie The course of adult intellectual development. , 1994, The American psychologist.

[143]  Danny Azucar,et al.  Predicting the Big 5 personality traits from digital footprints on social media: A meta-analysis , 2018 .

[144]  Jiebo Luo,et al.  Inferring Home Location from User's Photo Collections based on Visual Content and Mobility Patterns , 2014, GeoMM '14.

[145]  Jason Radford,et al.  Piloting a theory-based approach to inferring gender in big data , 2017, 2017 IEEE International Conference on Big Data (Big Data).

[146]  Ho-Jin Lee,et al.  User Age Profile Assessment Using SMS Network Neighbors' Age Profiles , 2009, 2009 International Conference on Advanced Information Networking and Applications Workshops.

[147]  D. Ruths,et al.  What's in a Name? Using First Names as Features for Gender Inference in Twitter , 2013, AAAI Spring Symposium: Analyzing Microtext.

[148]  David Yarowsky,et al.  Broadly Improving User Classification via Communication-Based Name and Location Clustering on Twitter , 2013, NAACL.

[149]  Mark Stevenson,et al.  Using TF-IDF n-gram and Word Embedding Cluster Ensembles for Author Profiling , 2017, CLEF.

[150]  Sara Rosenthal,et al.  Age Prediction in Blogs: A Study of Style, Content, and Online Behavior in Pre- and Post-Social Media Generations , 2011, ACL.

[151]  T. Yarkoni,et al.  Choosing Prediction Over Explanation in Psychology: Lessons From Machine Learning , 2017, Perspectives on psychological science : a journal of the Association for Psychological Science.

[152]  M. Kosinski,et al.  Deep Neural Networks Are More Accurate Than Humans at Detecting Sexual Orientation From Facial Images , 2018, Journal of personality and social psychology.

[153]  Blaz Skrlj,et al.  Multilingual Gender Classification with Multi-view Deep Learning: Notebook for PAN at CLEF 2018 , 2018, CLEF.

[154]  Bo Luo,et al.  Building Topic Models to Predict Author Attributes from Twitter Messages , 2015, CLEF.

[155]  Wendy Liu,et al.  Homophily and Latent Attribute Inference: Inferring Latent Attributes of Twitter Users from Neighbors , 2012, ICWSM.

[156]  Paul B. Baltes,et al.  Theoretical propositions of life-span developmental psychology : On the dynamics between growth and decline , 1987 .

[157]  Noriji Kato,et al.  Content-Aware Multi-task Neural Networks for User Gender Inference Based on Social Media Images , 2016, 2016 IEEE International Symposium on Multimedia (ISM).

[158]  Edzer J. Pebesma,et al.  A Machine Learning Approach to Demographic Prediction using Geohashes , 2017, SocialSens@CPSWeek.

[159]  Hong Yang,et al.  Profiling Web users using big data , 2018, Social Network Analysis and Mining.

[160]  Òscar Garibo i Orts A Big Data approach to gender classification in Twitter: Notebook for PAN at CLEF 2018 , 2018, CLEF.

[161]  Nitesh V. Chawla,et al.  Inferring user demographics and social strategies in mobile social networks , 2014, KDD.

[162]  Victor C. M. Leung,et al.  Demographic information prediction based on smartphone application usage , 2014, 2014 International Conference on Smart Computing.

[163]  Francisco Rangel Author Profile in Social Media: Identifying Information about Gender, Age, Emotions and beyond , 2013 .

[164]  Marie-Francine Moens,et al.  Age and Gender Identification in Social Media , 2014, CLEF.

[165]  J. Pennebaker,et al.  Are Women Really More Talkative Than Men? , 2007, Science.

[166]  Katja Filippova,et al.  User Demographics and Language in an Implicit Social Network , 2012, EMNLP.

[167]  Davide Buscaldi,et al.  A Random Forest Approach for Authorship Profiling , 2015, CLEF.

[168]  Nicholas Jing Yuan,et al.  You Are Where You Go: Inferring Demographic Attributes from Location Check-ins , 2015, WSDM.

[169]  Jahna Otterbacher,et al.  Inferring gender of movie reviewers: exploiting writing style, content and metadata , 2010, CIKM.

[170]  Peter Knees,et al.  Prediction of User Demographics from Music Listening Habits , 2017, CBMI.

[171]  M Meisel Jurgen,et al.  Language change across the lifespan , 2013 .

[172]  Marcelo Luis Errecalde,et al.  Profile-based Approach for Age and Gender Identification , 2016, CLEF.

[173]  J. Lo,et al.  Fast Estimation of Ideal Points with Massive Data , 2016, American Political Science Review.

[174]  Mung Chiang,et al.  Quantifying Political Leaning from Tweets and Retweets , 2013, ICWSM.

[175]  Mirco Kocher UniNE at CLEF 2016: Author Clustering , 2016, CLEF.

[176]  D. Culibrk,et al.  Demographic Attributes Prediction on the Real-World Mobile Data , 2012 .

[177]  Jeffrey Nichols,et al.  Where Is This Tweet From? Inferring Home Locations of Twitter Users , 2012, ICWSM.

[178]  Grigori Sidorov,et al.  Adapting Cross-Genre Author Profiling to Language and Corpus , 2016, CLEF.

[179]  Thamar Solorio,et al.  Using Wide Range of Features for Author profiling , 2015, CLEF.

[180]  Jacques Savoy,et al.  UniNE at CLEF 2015 Author Identification: Notebook for PAN at CLEF 2015 , 2015, CLEF.

[181]  Graciela María de Jesús Ramírez Alonso,et al.  Custom Document Embeddings Via the Centroids Method: Gender Classification in an Author Profiling Task: Notebook for PAN at CLEF 2018 , 2018, CLEF.

[182]  Cathy Zhang,et al.  Predicting gender from blog posts , 2010 .

[183]  Shlomo Argamon,et al.  Effects of Age and Gender on Blogging , 2006, AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs.

[184]  Teresa Gonçalves,et al.  Age and Gender Classification of Tweets Using Convolutional Neural Networks , 2017, MOD.

[185]  P. Howard,et al.  The Upheavals in Egypt and Tunisia: The Role of Digital Media , 2011 .

[186]  Vrizlynn L. L. Thing,et al.  Content-centric Age and Gender Profiling Notebook for PAN at CLEF 2013 , 2013, CLEF.

[187]  Philip S. Yu,et al.  Empirical Evaluation of Profile Characteristics for Gender Classification on Twitter , 2013, 2013 12th International Conference on Machine Learning and Applications.

[188]  Youngme Moon,et al.  Personalization and Personality: Some Effects of Customizing Message Style Based on Consumer Personality , 2002 .

[189]  H. Giles,et al.  Communication accommodation theory: A look back and a look ahead , 2005 .

[190]  Christiane Gelitz You Are What You Like , 2011 .

[191]  Sharad Goel,et al.  Who Does What on the Web: A Large-Scale Study of Browsing Behavior , 2012, ICWSM.

[192]  Qiang Yang,et al.  Report of Task 3: Your Phone Understands You , 2012 .

[193]  S. Gosling,et al.  A room with a cue: personality judgments based on offices and bedrooms. , 2002, Journal of personality and social psychology.

[194]  Teruo Higashino,et al.  Twitter user profiling based on text and community mining for market analysis , 2013, Knowl. Based Syst..

[195]  Carlos Sarraute,et al.  A Bayesian approach to income inference in a communication network , 2016, 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[196]  Dong Nguyen,et al.  "TweetGenie: automatic age prediction from tweets" by D. Nguyen, R. Gravel, D. Trieschnigg, and T. Meder; with Ching-man Au Yeung as coordinator , 2013, LINK.

[197]  Jon Oberlander,et al.  The Identity of Bloggers: Openness and Gender in Personal Weblogs , 2006, AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs.

[198]  Stratis Ioannidis,et al.  BlurMe: inferring and obfuscating user gender based on ratings , 2012, RecSys.

[199]  Philipp Schaer,et al.  Experimental IR Meets Multilinguality, Multimodality, and Interaction , 2017, Lecture Notes in Computer Science.

[200]  Steven Skiena,et al.  Name-ethnicity classification from open sources , 2009, KDD.

[201]  Mark Cieliebak,et al.  Author Profiling with Bidirectional RNNs using Attention with GRUs , 2017, CLEF.

[202]  S Thara,et al.  Ensemble Learning Approach for Author Profiling , 2014, CLEF.

[203]  Glen Szczypka,et al.  Are you Scared Yet?: Evaluating Fear Appeal Messages in Tweets about the Tips Campaign. , 2014, The Journal of communication.

[204]  Derek Ruths,et al.  Gender Inference of Twitter Users in Non-English Contexts , 2013, EMNLP.

[205]  Azucena Montes Rendón,et al.  Tweets Classification using Corpus Dependent Tags, Character and POS N-grams , 2015, CLEF.

[206]  Dawn O. Braithwaite,et al.  Explaining Communication : Contemporary Theories and Exemplars , 2013 .

[207]  Daniel Castro-Castro,et al.  Author Profiling, instance-based Similarity Classification , 2017, CLEF.

[208]  David Yarowsky,et al.  Improving Gender Prediction of Social Media Users via Weighted Annotator Rationales , 2014 .

[209]  A. Graesser,et al.  Pronoun Use Reflects Standings in Social Hierarchies , 2014 .

[210]  Jiebo Luo,et al.  The Eyes of the Beholder: Gender Prediction Using Images Posted in Online Social Networks , 2014, 2014 IEEE International Conference on Data Mining Workshop.

[211]  J. Holmes,et al.  The handbook of language and gender , 2003 .

[212]  José María Gómez Hidalgo,et al.  Experiments with SMS Translation and Stochastic Gradient Descent in Spanish Text Author Profiling Notebook for PAN at CLEF 2013 , 2013, CLEF.

[213]  Carlos Sarraute,et al.  Inference of demographic attributes based on mobile phone usage patterns and social network topology , 2015, Social Network Analysis and Mining.

[214]  Renato Miranda,et al.  Inferring User Social Class in Online Social Networks , 2014, SNAKDD'14.

[215]  Dominique Estival,et al.  Author Profiling for English and Arabic Emails , 2008 .

[216]  Carolyn Penstein Rosé,et al.  Author Age Prediction from Text using Linear Regression , 2011, LaTeCH@ACL.

[217]  Sheila Kinsella,et al.  "I'm eating a sandwich in Glasgow": modeling locations with tweets , 2011, SMUC '11.

[218]  Carl Vogel,et al.  Style-based distance features for author verification - Notebook for PAN at CLEF 2013. , 2013 .

[219]  Yi-Hsuan Yang,et al.  Inferring personal traits from music listening history , 2012, MIRUM '12.

[220]  Virgílio A. F. Almeida,et al.  We know where you live: privacy characterization of foursquare behavior , 2012, UbiComp.

[221]  Nils Schaetti,et al.  Character-based Convolutional Neural Network and ResNet18 for Twitter Author Profiling: Notebook for PAN at CLEF 2018 , 2018, CLEF.

[222]  George Giannakopoulos,et al.  Author Profiling using Stylometric and Structural Feature Groupings , 2015, CLEF.

[223]  Jun Ma,et al.  Gender Prediction Based on Data Streams of Smartphone Applications , 2015, BigCom.

[224]  Krishna P. Gummadi,et al.  You are who you know: inferring user profiles in online social networks , 2010, WSDM '10.

[225]  F. de Terlizzi,et al.  Quantitative ultrasound of the hand phalanges in a cohort of monozygotic twins: influence of genetic and environmental factors , 2005, Skeletal Radiology.

[226]  Xiang Yan,et al.  Gender Classification of Weblog Authors , 2006, AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs.

[227]  Desislava Zhekova,et al.  CAPS: A Cross-genre Author Profiling System , 2016, CLEF.

[228]  Trevor Cohn,et al.  A user-centric model of voting intention from Social Media , 2013, ACL.

[229]  Michael Granitzer,et al.  Stacked Gender Prediction from Tweet Texts and Images: Notebook for PAN at CLEF 2018 , 2018, CLEF.

[230]  Martha-Alicia Rocha,et al.  Semantic-based Features for Author Profiling Identification: First insights Notebook for PAN at CLEF 2013 , 2013, CLEF.

[231]  Jeff Gavin,et al.  Social identity formation during the emergence of the occupy movement , 2015 .

[232]  Sudeshna Sarkar,et al.  Learning Age and Gender of Blogger from Stylistic Variation , 2009, PReMI.

[233]  Luke S Sloan Who Tweets in the United Kingdom? Profiling the Twitter Population Using the British Social Attitudes Survey 2015 , 2017 .

[234]  Shlomo Argamon,et al.  Finding Political Blogs and Their Political Leanings , 2008 .

[235]  Liviu P. Dinu,et al.  Including Dialects and Language Varieties in Author Profiling , 2017, CLEF.

[236]  Kalina Bontcheva,et al.  Where's @wally?: a classification approach to geolocating users based on their social ties , 2013, HT.

[237]  Vincent S. Tseng,et al.  Demographic Prediction Based on User's Mobile Behaviors , 2012 .

[238]  Lyle H. Ungar,et al.  Exploring Stylistic Variation with Age and Income on Twitter , 2016, ACL.

[239]  Dominique Estival,et al.  TAT: An Author Profiling Tool with Application to Arabic Emails , 2007, ALTA.

[240]  P. Eckert,et al.  Language and Gender: Introduction to the study of language and gender , 2013 .

[241]  Jalal Kawash Online Social Media Analysis and Visualization , 2014, Lecture Notes in Social Networks.

[242]  Luke S Sloan,et al.  Who Tweets with Their Location? Understanding the Relationship between Demographic Characteristics and the Use of Geoservices and Geotagging on Twitter , 2015, PloS one.

[243]  Prasant Mohapatra,et al.  Your Installed Apps Reveal Your Gender and More! , 2015, MOCO.

[244]  Jiebo Luo,et al.  Towards Lifestyle Understanding: Predicting Home and Vacation Locations from User's Online Photo Collections , 2015, ICWSM.

[245]  Ed H. Chi,et al.  Tweets from Justin Bieber's heart: the dynamics of the location field in user profiles , 2011, CHI.

[246]  George M. Mohay,et al.  Language and Gender Author Cohort Analysis of E-mail for Computer Forensics , 2002 .

[247]  Mario Baldi,et al.  Identifying Personal Information in Internet Traffic , 2015, COSN.

[248]  Teresa Gonçalves,et al.  Age and Gender Identification using Stacking for Classification , 2016, CLEF.

[249]  Richard Bonneau,et al.  Political Expression and Action on Social Media: Exploring the Relationship Between Lower- and Higher-Threshold Political Activities Among Twitter Users in Italy , 2015, J. Comput. Mediat. Commun..

[250]  Rao Muhammad Adeel Nawab,et al.  Author's Traits Prediction on Twitter Data using Content Based Approach , 2015, CLEF.

[251]  A. Arvidsson,et al.  Echo Chamber or Public Sphere? Predicting Political Orientation and Measuring Political Homophily in Twitter Using Big Data , 2014 .

[252]  Barry Smyth,et al.  Uncovering Measurements of Social and Demographic Behavior From Smartphone Location Data , 2013, IEEE Transactions on Human-Machine Systems.

[253]  Daniela Moctezuma,et al.  Gender Identification through Multi-modal Tweet Analysis using MicroTC and Bag of Visual Words: Notebook for PAN at CLEF 2018 , 2018, CLEF.

[254]  James J. Bradac,et al.  Empirical Support for the Gender-as-Culture Hypothesis: An Intercultural Analysis of Male/Female Language Differences. , 2001 .

[255]  Jeffrey T. Hancock,et al.  On Lying and Being Lied To: A Linguistic Analysis of Deception in Computer-Mediated Communication , 2007 .

[256]  Marko Dragojevic,et al.  Communication Accommodation Theory , 2015 .

[257]  Isabell M. Welpe,et al.  Predicting Elections with Twitter: What 140 Characters Reveal about Political Sentiment , 2010, ICWSM.

[258]  George Karypis,et al.  Content-Based Methods for Predicting Web-Site Demographic Attributes , 2010, 2010 IEEE International Conference on Data Mining.

[259]  Keith W. Ross,et al.  Estimating age privacy leakage in online social networks , 2012, 2012 Proceedings IEEE INFOCOM.

[260]  Somnath Banerjee,et al.  Automatic Author Profiling Based on Linguistic and Stylistic Features Notebook for PAN at CLEF 2013 , 2013, CLEF.

[261]  Benno Stein,et al.  Overview of the 4th Author Profiling Task at PAN 2016: Cross-Genre Evaluations , 2016, CLEF.

[262]  D. Rao Detecting Latent User Properties in Social Media , 2010 .

[263]  A. Joinson,et al.  Characterizing the Linguistic Chameleon: Personal and Social Correlates of Linguistic Style Accommodation , 2016 .

[264]  Antoine Boutet,et al.  What's in Your Tweets? I Know Who You Supported in the UK 2010 General Election , 2012, ICWSM.

[265]  Teresa Gonçalves,et al.  Multi-Language Neural Network Model with Advance Preprocessor for Gender Classification over Social Media: Notebook for PAN at CLEF 2018 , 2018, CLEF.

[266]  Eduard H. Hovy,et al.  Weakly Supervised User Profile Extraction from Twitter , 2014, ACL.

[267]  Senja Pollak,et al.  PAN 2017: Author Profiling - Gender and Language Variety Prediction , 2017, CLEF.

[268]  M. Grossman,et al.  Linguistic Aspects of Primary Progressive Aphasia. , 2018, Annual review of linguistics.

[269]  Edson R. D. Weren Information Retrieval Features for Personality Traits , 2015, CLEF.

[270]  Jiliang Tang,et al.  Understanding and Predicting Weight Loss with Mobile Social Networking Data , 2017, CIKM.

[271]  Solee Kim,et al.  An on-device gender prediction method for mobile users using representative wordsets , 2016, Expert Syst. Appl..

[272]  Carlos Sarraute,et al.  Inference of Socioeconomic Status in a Communication Graph , 2016 .

[273]  J. Pennebaker,et al.  Linguistic styles: language use as an individual difference. , 1999, Journal of personality and social psychology.

[274]  Ben Verhoeven,et al.  Gender Profiling for Slovene Twitter communication: the Influence of Gender Marking, Content and Style , 2017, BSNLP@EACL.

[275]  Shlomo Argamon,et al.  Mining the Blogosphere: Age, gender and the varieties of self-expression , 2007, First Monday.

[276]  George Tzanetakis,et al.  Proceedings of the second international ACM workshop on Music information retrieval with user-centered and multimodal strategies , 2011, MM 2011.

[277]  Matthew Purver,et al.  Twitter Language Use Reflects Psychological Differences between Democrats and Republicans , 2015, PloS one.

[278]  P. Howard,et al.  Digital Media and the Arab Spring , 2013 .

[279]  Golnoosh Farnadi,et al.  Age, Gender and Personality Recognition using Tweets in a Multilingual setting , 2015, CLEF 2015.

[280]  Soroush Vosoughi,et al.  Twitter Demographic Classification Using Deep Multi-modal Multi-task Learning , 2017, ACL.

[281]  Rao Muhammad Adeel Nawab,et al.  Predicting an Author's Demographics from Text using Topic Modeling Approach , 2015, CLEF.

[282]  Steven Skiena,et al.  Exact Age Prediction in Social Networks , 2015, WWW.

[283]  Thomas Oshiobughie Ugheoke Detecting the Gender of a Tweet Sender , 2014 .

[284]  Lior Rokach,et al.  Predict Demographic Information Using Word2vec on Spatial Trajectories , 2018, UMAP.

[285]  Ferran Plà,et al.  Segmenting Target Audiences: Automatic Author Profiling using Tweets: Notebook for PAN at CLEF 2015 , 2015, CLEF.

[286]  Jing Tao,et al.  Predicting attributes and friends of mobile users from AP-Trajectories , 2018, Inf. Sci..

[287]  Erez Zadok,et al.  Unifying biological image formats with HDF5 , 2009, CACM.

[288]  George M. Mohay,et al.  Gender-preferential text mining of e-mail discourse , 2002, 18th Annual Computer Security Applications Conference, 2002. Proceedings..

[289]  Arjun Mukherjee,et al.  Improving Gender Classification of Blog Authors , 2010, EMNLP.

[290]  Cecilia Ovesdotter Alm,et al.  Toward inferring the age of Twitter users with their use of nonstandard abbreviations and lexicon , 2014, Proceedings of the 2014 IEEE 15th International Conference on Information Reuse and Integration (IEEE IRI 2014).

[291]  Tsuhan Chen,et al.  Estimating age, gender, and identity using first name priors , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[292]  Craig H. Martell,et al.  Age Detection in Chat , 2009, 2009 IEEE International Conference on Semantic Computing.

[293]  Antoine Boutet,et al.  Member Classification and Party Characteristics in Twitter during UK Election , 2011 .

[294]  Benno Stein,et al.  Overview of the 3rd Author Profiling Task at PAN 2015 , 2015, CLEF.

[295]  José Palazzo Moreira de Oliveira,et al.  Using Simple Content Features for the Author Profiling Task Notebook for PAN at CLEF 2013 , 2013, CLEF.

[296]  Behram F. T. Mistree,et al.  Gaydar: Facebook Friendships Expose Sexual Orientation , 2009, First Monday.

[297]  Hüseyin Oktay,et al.  Demographic Breakdown of Twitter Users: An analysis based on names , 2014 .

[298]  Alexey Romanov,et al.  Language Variety and Gender Classification for Author Profiling in PAN 2017 , 2017, CLEF.

[299]  Eric Medvet,et al.  An Author Verification Approach Based on Differential Features: Notebook for PAN at CLEF 2015 , 2015, CLEF.

[300]  Michael Granitzer,et al.  INSA LYON and UNI PASSAU's Participation at PAN@CLEF'17: Author Profiling task , 2017, CLEF.

[301]  Malvina Nissim,et al.  N-GrAM: New Groningen Author-profiling Model , 2017, CLEF.

[302]  Shlomo Argamon,et al.  Political Leaning Categorization by Exploring Subjectivities in Political Blogs , 2008, DMIN.

[303]  W. Au,et al.  Effects of Age and Gender on Hand Motion Tasks , 2015, Parkinson's disease.

[304]  Paolo Rosso,et al.  On the Identification of Emotions and Authors' Gender in Facebook Comments on the Basis of their Writing Style , 2013, ESSEM@AI*IA.

[305]  L. Carstensen,et al.  Emotional experience in everyday life across the adult life span. , 2000, Journal of personality and social psychology.

[306]  Hsiu-Yuan Wang,et al.  User acceptance of mobile internet based on the Unified Theory of Acceptance and Use of Technology: Investigating the determinants and gender differences , 2010 .

[307]  Teresa Gonçalves,et al.  Author Profiling using SVMs and Word Embedding Averages , 2016, CLEF.

[308]  Zachary Miller,et al.  Gender Prediction on Twitter Using Stream Algorithms with N-Gram Character Features , 2012 .

[309]  Iryna Gurevych,et al.  Can We Hide in the Web? Large Scale Simultaneous Age and Gender Author Profiling in Social Media Notebook for PAN at CLEF 2013 , 2013, CLEF.

[310]  Aron Culotta,et al.  Inferring latent attributes of Twitter users with label regularization , 2015, NAACL.

[311]  Jyh-Shing Roger Jang,et al.  Gender Identification and Age Estimation of Users Based on Music Metadata , 2014, ISMIR.

[312]  Peiquan Jin,et al.  Predicting Age Range of Users over Microblog Dataset , 2013 .

[313]  Hua Li,et al.  Demographic prediction based on user's browsing behavior , 2007, WWW '07.

[314]  Lamia Hadrich Belguith,et al.  Author Profiling Using Style-based Features Notebook for PAN at CLEF 2013 , 2013, CLEF.

[315]  Son Bao Pham,et al.  Using Content-Based Features for Author Profiling of Vietnamese Forum Posts , 2016 .

[316]  Nikolaos Aletras,et al.  An analysis of the user occupational class through Twitter content , 2015, ACL.

[317]  Wenyi Huang,et al.  Inferring nationalities of Twitter users and studying inter-national linking , 2014, HT.

[318]  Magdalena Jankowska,et al.  CNG Text Classification for Authorship Profiling Task Notebook for PAN at CLEF 2013 , 2013, CLEF.

[319]  Yoram Bachrach,et al.  Studying User Income through Language, Behaviour and Affect in Social Media , 2015, PloS one.

[320]  Adrián Pastor López-Monroy,et al.  A Straightforward Multimodal Approach for Author Profiling: Notebook for PAN at CLEF 2018 , 2018, CLEF.

[321]  Yuchun Guo,et al.  Tags and titles of videos you watched tell your gender , 2014, 2014 IEEE International Conference on Communications (ICC).

[322]  Lamia Hadrich Belguith,et al.  Machine learning for classifying authors of anonymous tweets, blogs, reviews and social media , 2014 .

[323]  Kyumin Lee,et al.  You are where you tweet: a content-based approach to geo-locating twitter users , 2010, CIKM.

[324]  Zachary Miller,et al.  Gender Identification on Twitter Using the Modified Balanced Winnow , 2012 .

[325]  José Carlos González,et al.  DAEDALUS at PAN 2014: Guessing Tweet Author's Gender and Age , 2014, CLEF.

[326]  M. Pinquart,et al.  Human development in times of social change: Theoretical considerations and research needs , 2004 .

[327]  Jing Tao,et al.  Inferring Demographics and Social Networks of Mobile Device Users on Campus From AP-Trajectories , 2017, WWW.

[328]  Chris Pool,et al.  Author Profiling based on Text and Images: Notebook for PAN at CLEF 2018 , 2018, CLEF.

[329]  Detmar W. Straub,et al.  Gender Differences in the Perception and Use of E-Mail: An Extension to the Technology Acceptance Model , 1997, MIS Q..

[330]  Pawel Teisseyre,et al.  What Do Your Look-alikes Say about You? Exploiting Strong and Weak Similarities for Author Profiling , 2015, CLEF.

[331]  Philip S. Yu,et al.  Language independent gender classification on Twitter , 2013, 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013).

[332]  Teresa Gonçalves,et al.  Multilingual Author Profiling using LSTMs: Notebook for PAN at CLEF 2018 , 2018, CLEF.

[333]  Vasudeva Varma,et al.  Author Profiling: Predicting Age and Gender from Blogs Notebook for PAN at CLEF 2013 , 2013, CLEF.

[334]  Iqra Ameer,et al.  Identification of Author Personality Traits using Stylistic Features: Notebook for PAN at CLEF 2015 , 2015, CLEF.

[335]  Cecilia Ovesdotter Alm,et al.  User-annotated microtext data for modeling and analyzing users' sociolinguistic characteristics and age grading , 2014, 2014 IEEE Eighth International Conference on Research Challenges in Information Science (RCIS).

[336]  M. Williams,et al.  Knowing the Tweeters: Deriving Sociologically Relevant Demographics from Twitter , 2013 .

[337]  Johan Bollen,et al.  Modeling Public Mood and Emotion: Twitter Sentiment and Socio-Economic Phenomena , 2009, ICWSM.

[338]  Shrikanth S. Narayanan,et al.  A System for Real-time Twitter Sentiment Analysis of 2012 U.S. Presidential Election Cycle , 2012, ACL.

[339]  M. Saravanan,et al.  Predicting Customer Demographics in a Mobile Social Network , 2011, 2011 International Conference on Advances in Social Networks Analysis and Mining.

[340]  A. Smeaton,et al.  On Using Twitter to Monitor Political Sentiment and Predict Election Results , 2011 .

[341]  Qiang Yang,et al.  User demographics prediction based on mobile data , 2013, Pervasive Mob. Comput..

[342]  Hugo Jair Escalante,et al.  INAOE's Participation at PAN'13: Author Profiling Task Notebook for PAN at CLEF 2013 , 2013, CLEF.

[343]  Scott Counts,et al.  The psychology of job loss: using social media data to characterize and predict unemployment , 2016, WebSci.

[344]  Benyuan Liu,et al.  Predicting Flu Trends using Twitter data , 2011, 2011 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS).

[345]  Takahide Hoshide,et al.  Demographic and Psychographic Estimation of Twitter Users Using Social Structures , 2014, Online Social Media Analysis and Visualization.

[346]  Khaled Alrifai,et al.  Arabic Tweeps Gender and Dialect Prediction , 2017, CLEF.

[347]  Qiaozhu Mei,et al.  Classifying the Political Leaning of News Articles and Users from User Votes , 2011, ICWSM.

[348]  Helena Gómez-Adorno,et al.  Language- and Subtask-Dependent Feature Selection and Classifier Parameter Tuning for Author Profiling , 2017, CLEF.

[349]  Graça Bressan,et al.  Age Groups Classification in Social Network Using Deep Learning , 2017, IEEE Access.

[350]  Frank Schweitzer,et al.  Online privacy as a collective phenomenon , 2014, COSN '14.

[351]  Moniek Nieuwenhuis,et al.  Twitter Text and Image Gender Classification with a Logistic Regression N-Gram Model: Notebook for PAN at CLEF 2018 , 2018, CLEF.

[352]  Yaakov HaCohen-Kerner,et al.  Author Profiling: Gender Prediction from Tweets and Images: Notebook for PAN at CLEF 2018 , 2018, CLEF.

[353]  David Yarowsky,et al.  Exploring Demographic Language Variations to Improve Multilingual Sentiment Analysis in Social Media , 2013, EMNLP.

[354]  Julia Baquero,et al.  Author Profiling Using Corpus Statistics, Lexicons and Stylistic Features Notebook for PAN at CLEF-2013 , 2013, CLEF.

[355]  Susana Ladra,et al.  Fast compressed-based strategies for author profiling of social media texts , 2016, CERI.

[356]  Paolo Rosso,et al.  Use of Language and Author Profiling : Identification of Gender and Age , 2013 .

[357]  Keith W. Ross,et al.  What's in a Name: A Study of Names, Gender Inference, and Gender Behavior in Facebook , 2011, DASFAA Workshops.

[358]  L. Youngblade,et al.  Agency and communion attributes in adults’ spontaneous self-representations , 2004, International journal of behavioral development.

[359]  Yassine Benajiba,et al.  Subword-based Deep Averaging Networks for Author Profiling in Social Media , 2017, CLEF.

[360]  Benjamin Van Durme,et al.  Using Conceptual Class Attributes to Characterize Social Media Users , 2013, ACL.

[362]  Carlos Sarraute,et al.  Comparison of Feature Extraction Methods and Predictors for Income Inference , 2018, ArXiv.

[363]  Isabell M. Welpe,et al.  Election Forecasts With Twitter , 2011 .

[364]  Teresa Gonçalves,et al.  Author Profiling Using Support Vector Machines , 2016, CLEF.

[365]  Preslav Nakov,et al.  SU@PAN'2015: Experiments in Author Profiling , 2015, CLEF.

[366]  Aron Culotta,et al.  Predicting Twitter User Demographics using Distant Supervision from Website Traffic Data , 2016, J. Artif. Intell. Res..

[367]  Svitlana Volkova,et al.  On Predicting Sociodemographic Traits and Emotions from Communications in Social Networks and Their Implications to Online Self-Disclosure , 2015, Cyberpsychology Behav. Soc. Netw..

[368]  Leandro Nunes de Castro,et al.  Gender Classification of Twitter Data Based on Textual Meta-Attributes Extraction , 2016, WorldCIST.

[369]  D. Sharma Language and woman’s place: Text and commentaries. Edited by Mary Bucholtz. Oxford: Oxford University Press, 2004. , 2007 .

[370]  Mitchell A. Thornton,et al.  Demographic Group Classification of Smart Device Users , 2015, 2015 IEEE 14th International Conference on Machine Learning and Applications (ICMLA).

[371]  Dan Murray,et al.  Inferring Demographic Attributes of Anonymus Internet Users , 1999, WEBKDD.

[372]  David Bamman,et al.  Gender identity and lexical variation in social media , 2012, 1210.4567.

[373]  Yejin Choi,et al.  Gender Attribution: Tracing Stylometric Evidence Beyond Topic and Genre , 2011, CoNLL.

[374]  Ellen M. Voorhees,et al.  Using Replicates in Information Retrieval Evaluation , 2017, ACM Trans. Inf. Syst..

[375]  Pia Pichler The handbook of language and gender , 2005, Language in Society.

[376]  Jennifer Coates Language and Gender: A Reader , 2011 .

[377]  Rajarathnam Chandramouli,et al.  Gender identification from E-mails , 2009, 2009 IEEE Symposium on Computational Intelligence and Data Mining.

[378]  Vasudeva Varma,et al.  Author Profiling using LDA and Maximum Entropy Notebook for PAN at CLEF 2013 , 2013, CLEF.

[379]  Maarten Sap,et al.  Developing Age and Gender Predictive Lexica over Social Media , 2014, EMNLP.

[380]  Fahad Bin Muhaya,et al.  Estimating Twitter User Location Using Social Interactions--A Content Based Approach , 2011, 2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int'l Conference on Social Computing.

[381]  Virgílio A. F. Almeida,et al.  Beware of What You Share: Inferring Home Location in Social Networks , 2012, 2012 IEEE 12th International Conference on Data Mining Workshops.

[382]  R. Lakoff Language and woman's place , 1973, Language in Society.

[383]  Diana Inkpen,et al.  Gender Identification in Twitter using N-grams and LSA: Notebook for PAN at CLEF 2018 , 2018, CLEF.

[384]  Golnoosh Farnadi,et al.  Cross-Genre Age and Gender Identification in Social Media , 2016, CLEF.

[385]  Manuel Montes-y-Gómez,et al.  Author Profiling for English and Spanish Text Notebook for PAN at CLEF 2013 , 2013, CLEF.

[386]  Lesly Miculicich Werlen,et al.  Statistical Learning Methods for Profiling Analysis: Notebook for PAN at CLEF 2015 , 2015, CLEF.

[387]  Darnes Vilariño Ayala,et al.  Two Methodologies Applied to the Author Profiling Task , 2013, CLEF.

[388]  Anastasia Krithara,et al.  Author Profiling using Complementary Second Order Attributes and Stylometric Features , 2016, CLEF.

[389]  Eero Hyvönen,et al.  CEUR Workshop Proceedings , 2008 .

[390]  Taha Yasseri,et al.  Early Prediction of Movie Box Office Success Based on Wikipedia Activity Big Data , 2012, PloS one.