Shirtless and Dangerous: Quantifying Linguistic Signals of Gender Bias in an Online Fiction Writing Community

Imagine a princess asleep in a castle, waiting for her prince to slay the dragon and rescue her. Tales like the famous Sleeping Beauty clearly divide up gender roles. But what about more modern stories, borne of a generation increasingly aware of social constructs like sexism and racism? Do these stories tend to reinforce gender stereotypes, or counter them? In this paper, we present a technique that combines natural language processing with a crowdsourced lexicon of stereotypes to capture gender biases in fiction. We apply this technique across 1.8 billion words of fiction from the Wattpad online writing community, investigating gender representation in stories, how male and female characters behave and are described, and how authors' use of gender stereotypes is associated with the community's ratings. We find that male over-representation and traditional gender stereotypes (e.g., dominant men and submissive women) are common throughout nearly every genre in our corpus. However, only some of these stereotypes, like sexual or violent men, are associated with highly rated stories. Finally, despite women often being the target of negative stereotypes, female authors are equally likely to write such stereotypes as men.

[1]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[2]  Kevin F. Hallock,et al.  The Gender Gap in Top Corporate Jobs , 2000 .

[3]  A. Gooden,et al.  Gender Representation in Notable Children's Picture Books: 1995–1999 , 2001 .

[4]  J. Pennebaker,et al.  PERSONALITY PROCESSES AND INDIVIDUAL DIFFERENCES Words of Wisdom: Language Use Over the Life Span , 2003 .

[5]  T. Zimmerman,et al.  Images of Gender, Race, Age, and Sexual Orientation in Disney Feature-Length AnimatedFilms , 2004 .

[6]  Danielle M. Soulliere Wrestling with Masculinity: Messages about Manhood in the WWE , 2006 .

[7]  T. Zimmerman,et al.  A Feminist Analysis of Popular Music , 2007 .

[8]  James W. Pennebaker,et al.  Linguistic Inquiry and Word Count (LIWC2007) , 2007 .

[9]  M. Lauzen,et al.  Constructing Gender Stereotypes Through Social Roles in Prime-Time Television , 2008 .

[10]  Panagiotis G. Ipeirotis,et al.  Get another label? improving data quality and data mining using multiple, noisy labelers , 2008, KDD.

[11]  Marti A. Hearst,et al.  Assessing attractiveness in online dating profiles , 2008, CHI.

[12]  P. Emons,et al.  “He Works Outside the Home; She Drinks Coffee and Does the Dishes” Gender Roles in Fiction Programs on Dutch Television , 2010 .

[13]  Cynthia Carter,et al.  Women and news: A long and winding road , 2011 .

[14]  Eric Gilbert,et al.  Phrases that signal workplace hierarchy , 2012, CSCW.

[15]  Jennifer Jenson,et al.  What's 'choice' got to do with it?: avatar selection differences between novice and expert players of World of Warcraft and Rift , 2012, FDG.

[16]  Jahna Otterbacher,et al.  Gender, writing and ranking in review forums: a case study of the IMDb , 2013, Knowledge and Information Systems.

[17]  Jessica Rose,et al.  Face it: The Impact of Gender on Social Media Images , 2012 .

[18]  D. Barker Global gender disparities in science , 2013 .

[19]  V. Larivière Global gender disparities in science , 2013 .

[20]  Saif Mohammad,et al.  CROWDSOURCING A WORD–EMOTION ASSOCIATION LEXICON , 2013, Comput. Intell..

[21]  Saeideh Bakhshi,et al.  "I need to try this"?: a statistical overview of pinterest , 2013, CHI.

[22]  Virgílio A. F. Almeida,et al.  Ladies First: Analyzing Gender Roles and Behaviors in Pinterest , 2013, ICWSM.

[23]  David García,et al.  Gender Asymmetries in Reality and Fiction: The Bechdel Test of Social Media , 2014, ICWSM.

[24]  Jeffrey T. Hancock,et al.  Experimental evidence of massive-scale emotional contagion through social networks , 2014, Proceedings of the National Academy of Sciences.

[25]  Michael S. Bernstein,et al.  We Are Dynamo: Overcoming Stalling and Friction in Collective Action for Crowd Workers , 2015, CHI.

[26]  David García,et al.  It's a Man's Wikipedia? Assessing Gender Inequality in an Online Encyclopedia , 2015, ICWSM.

[27]  Nello Cristianini,et al.  Measuring Gender Bias in News Images , 2015, WWW.

[28]  Sean A. Munson,et al.  Unequal Representation and Gender Stereotypes in Image Search Results for Occupations , 2015, CHI.

[29]  Jahna Otterbacher,et al.  Crowdsourcing Stereotypes: Linguistic Bias in Metadata Generated via GWAP , 2015, CHI.

[30]  C. Martin 2015 , 2015, Les 25 ans de l’OMC: Une rétrospective en photos.

[31]  Michael S. Bernstein,et al.  Empath: Understanding Topic Signals in Large-Scale Text , 2016, CHI.

[32]  S. Hewitt,et al.  2008 , 2018, Los 25 años de la OMC: Una retrospectiva fotográfica.

[33]  2001 , 2018, Wild Onion Nurse.