Comparison and benchmark of name-to-gender inference services

The increased interest in analyzing and explaining gender inequalities in tech, media, and academia highlights the need for accurate inference methods to predict a person’s gender from their name. Several such services exist that provide access to large databases of names, often enriched with information from social media profiles, culture-specific rules, and insights from sociolinguistics. We compare and benchmark five nameto-gender inference services by applying them to the classification of a test data set consisting of 7,076 manually labeled names. The compiled names are analyzed and characterized according to their geographical and cultural origin. We define a series of performance metrics to quantify various types of classification errors, and define a parameter tuning procedure to search for optimal values of the services’ free parameters. Finally, we perform benchmarks of all services under study regarding several scenarios where a particular metric is to be optimized. Subjects Data Mining and Machine Learning, Data Science, Databases, Digital Libraries

[1]  Adriana Valente,et al.  Scientific and Technological Performance by Gender , 2004 .

[2]  Marco Tullney,et al.  The Effect of Gender in the Publication Patterns in Mathematics , 2016, PloS one.

[3]  J. Nathan Matias,et al.  FollowBias: Supporting Behavior Change toward Gender Equality by Networked Gatekeepers on Social Media , 2017, CSCW.

[4]  Alexander Serebrenik,et al.  A Data Set for Social Diversity Studies of GitHub Teams , 2015, 2015 IEEE/ACM 12th Working Conference on Mining Software Repositories.

[5]  Cameron Blevins,et al.  Jane, John ... Leslie? A Historical Method for Algorithmic Gender Prediction , 2015, Digit. Humanit. Q..

[6]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[7]  Giovanni Filardo,et al.  Trends and comparison of female first authorship in high impact medical journals: observational study (1994-2014) , 2016, British Medical Journal.

[8]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[9]  Hanne Jørndrup,et al.  Who Makes The News?: Denmark - Global Media Monitoring Project 2015 National Report , 2015 .

[10]  Mohak Shah,et al.  Evaluating Learning Algorithms: Contents , 2011 .

[11]  J. Lisman,et al.  Serial representation of items during working memory maintenance at letter-selective cortical sites , 2017, bioRxiv.

[12]  Hariolf Grupp,et al.  Gender-specific patterns in patenting and publishing , 2009 .

[13]  Cindy E. Hauser,et al.  The gender gap in science: How long until women are equally represented? , 2018, PLoS biology.

[14]  Kevin S. Bonham,et al.  Women are underrepresented in computational biology: An analysis of the scholarly literature in biology, computer science and computational biology , 2017, PLoS Comput. Biol..

[15]  Kamil Wais,et al.  Gender Prediction Methods Based on First Names with genderizeR , 2016, R J..

[16]  Markus Strohmaier,et al.  Inferring Gender from Names on the Web: A Comparative Evaluation of Gender Detection Methods , 2016, WWW.

[17]  Carl T. Bergstrom,et al.  The Role of Gender in Scholarly Authorship , 2012, PloS one.

[18]  Alicia Karspeck,et al.  The use of ambient humidity conditions to improve influenza forecast , 2017, PLoS Comput. Biol..

[19]  E. Müller,et al.  Biological Maturity Status Strongly Intensifies the Relative Age Effect in Alpine Ski Racing , 2016, PloS one.

[20]  Cassidy R. Sugimoto,et al.  Bibliometrics: Global gender disparities in science , 2013, Nature.

[21]  M. HamidR.Jamali,et al.  Iranian women in science: a gender study of scientific productivity in an Islamic country , 2008, Aslib Proc..