Gender differences in participation and reward on Stack Overflow

Programming is a valuable skill in the labor market, making the underrepresentation of women in computing an increasingly important issue. Online question and answer platforms serve a dual purpose in this field: they form a body of knowledge useful as a reference and learning tool, and they provide opportunities for individuals to demonstrate credible, verifiable expertise. Issues, such as male-oriented site design or overrepresentation of men among the site’s elite may therefore compound the issue of women’s underrepresentation in IT. In this paper we audit the differences in behavior and outcomes between men and women on Stack Overflow, the most popular of these Q&A sites. We observe significant differences in how men and women participate in the platform and how successful they are. For example, the average woman has roughly half of the reputation points, the primary measure of success on the site, of the average man. Using an Oaxaca-Blinder decomposition, an econometric technique commonly applied to analyze differences in wages between groups, we find that most of the gap in success between men and women can be explained by differences in their activity on the site and differences in how these activities are rewarded. Specifically, 1) men give more answers than women and 2) are rewarded more for their answers on average, even when controlling for possible confounders such as tenure or buy-in to the site. Women ask more questions and gain more reward per question. We conclude with a hypothetical redesign of the site’s scoring system based on these behavioral differences, cutting the reputation gap in half.

[1]  Premkumar T. Devanbu,et al.  How social Q&A sites are changing knowledge sharing in open source software communities , 2014, CSCW.

[2]  Teresa Lynch,et al.  Ten years of strategies to increase participation of women in computing programs: the Central Queensland University experience: 1999--2001 , 2002, SGCS.

[3]  Kakoli Roy,et al.  Examining the role of gender in career advancement at the Centers for Disease Control and Prevention. , 2010, American journal of public health.

[4]  Linda J. Sax,et al.  Women planning to major in computer science: Who are they and what makes them unique? , 2016, Comput. Sci. Educ..

[5]  L. R. Shade High noon on the electronic frontier: Conceptual issues in cyberspace , 1997 .

[6]  Latanya Sweeney,et al.  Discrimination in online ad delivery , 2013, CACM.

[7]  A. Blinder Wage Discrimination: Reduced Form and Structural Estimates , 1973 .

[8]  Margaret M. Burnett,et al.  Gender: An Important Factor in End-User Programming Environments? , 2004, 2004 IEEE Symposium on Visual Languages - Human Centric Computing.

[9]  Manju K. Ahuja Women in the information technology profession: a literature review, synthesis and research agenda , 2002, Eur. J. Inf. Syst..

[10]  Sean J. Taylor,et al.  Social Influence Bias: A Randomized Experiment , 2013, Science.

[11]  Alexander Schill,et al.  Workplace Psychology and Gamification: Theory and Application , 2015 .

[12]  Monica Stephens Gender and the GeoWeb: divisions in the production of user-generated cartographic information , 2013, GeoJournal.

[13]  William Aspray,et al.  Women and Information Technology : Research on Underrepresentation , 2010 .

[14]  David Neumark,et al.  Employers' Discriminatory Behavior and the Estimation of Wage Discrimination , 1987 .

[15]  Karrie Karahalios,et al.  Auditing Algorithms : Research Methods for Detecting Discrimination on Internet Platforms , 2014 .

[16]  Chris Parnin,et al.  "We Don't Do That Here": How Collaborative Editing with Mentors Improves Engagement in Social Q&A Communities , 2018, CHI.

[17]  Margaret M. Burnett,et al.  Tinkering and gender in end-user programmers' debugging , 2006, CHI.

[18]  Jim Hamilton,et al.  Are There Gender Differences in Professional Self-Promotion? An Empirical Case Study of LinkedIn Profiles Among Recent MBA Graduates , 2017, ICWSM.

[19]  Mohsen Jadidi,et al.  Gender Disparities in Science? Dropout, Productivity, Collaborations and Success of Male and Female Computer Scientists , 2017, Adv. Complex Syst..

[20]  Amanda Menking,et al.  The Heart Work of Wikipedia: Gendered, Emotional Labor in the World's Largest Online Encyclopedia , 2015, CHI.

[21]  Emerson Murphy-Hill,et al.  Gender differences and bias in open source: pull request acceptance of women versus men , 2017, PeerJ Comput. Sci..

[22]  Christopher R. Knittel,et al.  Racial and Gender Discrimination in Transportation Network Companies , 2016 .

[23]  David Lo,et al.  Multi-Factor Duplicate Question Detection in Stack Overflow , 2015, Journal of Computer Science and Technology.

[24]  B. Sen,et al.  Using the oaxaca–blinder decomposition as an empirical tool to analyze racial disparities in obesity , 2014, Obesity.

[25]  Loren G. Terveen,et al.  Avoiding the South Side and the Suburbs: The Geography of Mobile Crowdsourcing Markets , 2015, CSCW.

[26]  Eduardo Graells-Garrido,et al.  Women through the glass ceiling: gender asymmetries in Wikipedia , 2016, EPJ Data Science.

[27]  R. Oaxaca Male-Female Wage Differentials in Urban Labor Markets , 1973 .

[28]  R. Blanchard,et al.  Fraternal Birth Order, Family Size, and Male Homosexuality: Meta-Analysis of Studies Spanning 25 Years , 2018, Archives of sexual behavior.

[29]  Kathleen J. Lehman,et al.  Anatomy of an Enduring Gender Gap: The Evolution of Women’s Participation in Computer Science , 2017 .

[30]  J. Bentley,et al.  Gender Differences in the Careers of Academic Scientists and Engineers: A Literature Review. Special Report. , 2003 .

[31]  Muriel Niederle,et al.  Do Women shy away from Competition , 2004 .

[32]  Anat Ben-David,et al.  Platform Inequality: Gender in the Gig-Economy , 2017 .

[33]  Michael Szell,et al.  How women organize social networks different from men , 2012, Scientific Reports.

[34]  Markus Strohmaier,et al.  Inferring Gender from Names on the Web: A Comparative Evaluation of Gender Detection Methods , 2016, WWW.

[35]  Josh Lerner,et al.  The Simple Economics of Open Source , 2000 .

[36]  Bálint Daróczy,et al.  Why Do Men Get More Attention? Exploring Factors Behind Success in An Online Design Community , 2017, ICWSM.

[37]  Premkumar T. Devanbu,et al.  Gender and Tenure Diversity in GitHub Teams , 2015, CHI.

[38]  Amy Bruckman,et al.  Gender Swapping on the Internet , 1993 .

[39]  Aaron D. Shaw,et al.  The Pipeline of Online Participation Inequalities: The Case of Wikipedia Editing , 2018 .

[40]  Julita Vassileva,et al.  Does gamification work for boys and girls?: An exploratory study with a virtual learning environment , 2015, SAC.

[41]  Eryk Kopczynski,et al.  Programming Languages in GitHub: A Visualization in Hyperbolic Plane , 2017, ICWSM.

[42]  Magnus Lindelow,et al.  Analyzing Health Equity Using Household Survey Data: A Guide to Techniques and Their Implementation , 2007 .

[43]  Alexander Serebrenik,et al.  StackOverflow and GitHub: Associations between Software Development and Crowdsourced Knowledge , 2013, 2013 International Conference on Social Computing.

[44]  Joseph M. Reagle,et al.  Gender Bias in Wikipedia and Britannica , 2011 .

[45]  Alexander Serebrenik,et al.  Gender, Representation and Online Participation: A Quantitative Study of StackOverflow , 2012, SocialInformatics.

[46]  Jeffrey C. Carver,et al.  Building reputation in StackOverflow: An empirical investigation , 2013, 2013 10th Working Conference on Mining Software Repositories (MSR).

[47]  Francine D. Blau,et al.  The Gender Wage Gap: Extent, Trends, and Explanations , 2016, SSRN Electronic Journal.

[48]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[49]  Christopher T. Stanton,et al.  Landing the First Job: The Value of Intermediaries in Online Hiring , 2014 .

[50]  Alexander Serebrenik,et al.  Gender, Representation and Online Participation: A Quantitative Study of StackOverflow , 2012, 2012 International Conference on Social Informatics.

[51]  Marián Boguñá,et al.  Extracting the multiscale backbone of complex weighted networks , 2009, Proceedings of the National Academy of Sciences.

[52]  Robert Epstein,et al.  Estimates of Non-Heterosexual Prevalence: The Roles of Anonymity and Privacy in Survey Methodology , 2018, Archives of sexual behavior.

[53]  Chris Parnin,et al.  Someone like me: How does peer parity influence participation of women on stack overflow? , 2017, 2017 IEEE Symposium on Visual Languages and Human-Centric Computing (VL/HCC).

[54]  David García,et al.  Bias in Online Freelance Marketplaces: Evidence from TaskRabbit and Fiverr , 2017, CSCW.

[55]  Jure Leskovec,et al.  Discovering value from community activity on focused question answering sites: a case study of stack overflow , 2012, KDD.

[56]  Andrei Cimpian,et al.  Expectations of brilliance underlie gender distributions across academic disciplines , 2015, Science.

[57]  John Riedl,et al.  WP:clubhouse?: an exploration of Wikipedia's gender imbalance , 2011, Int. Sym. Wikis.