Mitigating dataset harms requires stewardship: Lessons from 1000 papers

Machine learning datasets have elicited concerns about privacy, bias, and unethical applications, leading to the retraction of prominent datasets such as DukeMTMC, MS-Celeb-1M, and Tiny Images. In response, the machine learning community has called for higher ethical standards in dataset creation. To help inform these efforts, we studied three influential but ethically problematic face and person recognition datasets—Labeled Faces in the Wild (LFW), MS-Celeb-1M, and DukeMTMC— by analyzing nearly 1000 papers that cite them. We found that the creation of derivative datasets and models, broader technological and social change, the lack of clarity of licenses, and dataset management practices can introduce a wide range of ethical concerns. We conclude by suggesting a distributed approach to harm mitigation that considers the entire life cycle of a dataset.

[1]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[2]  Cordelia Schmid,et al.  Is that you? Metric learning approaches for face identification , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[3]  Omkar M. Parkhi,et al.  VGGFace2: A Dataset for Recognising Faces across Pose and Age , 2017, 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018).

[4]  David J. Kriegman,et al.  From Few to Many: Illumination Cone Models for Face Recognition under Variable Lighting and Pose , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Shaogang Gong,et al.  Unsupervised Tracklet Person Re-Identification , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Arvind Narayanan,et al.  Privacy, Ethics, and Data Access: A Case Study of the Fragile Families Challenge , 2018, Socius : sociological research for a dynamic world.

[7]  Erik Schultes,et al.  The FAIR Guiding Principles for scientific data management and stewardship , 2016, Scientific Data.

[8]  Luc Van Gool,et al.  Real-time facial feature detection using conditional regression forests , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  Amandalynne Paullada,et al.  Data and its (dis)contents: A survey of dataset development and use in machine learning research , 2020, Patterns.

[10]  Timnit Gebru,et al.  Gender Shades: Intersectional Accuracy Disparities in Commercial Gender Classification , 2018, FAT.

[11]  Mei Wang,et al.  Racial Faces in the Wild: Reducing Racial Bias by Information Maximization Adaptation Network , 2018, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[12]  Emily M. Bender,et al.  Data Statements for Natural Language Processing: Toward Mitigating System Bias and Enabling Better Science , 2018, TACL.

[13]  Liang Zheng,et al.  Improving Person Re-identification by Attribute and Identity Learning , 2017, Pattern Recognit..

[14]  Octavia I. Camps,et al.  DukeMTMC4ReID: A Large-Scale Multi-camera Person Re-identification Dataset , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[15]  Emily Denton,et al.  Bringing the People Back In: Contesting Benchmark Machine Learning Datasets , 2020, ArXiv.

[16]  Paul T. Groth,et al.  FAIR Data Reuse – the Path through Data Citation , 2020, Data Intelligence.

[17]  Stefanos Zafeiriou,et al.  Deformable Models of Ears in-the-Wild for Alignment and Recognition , 2017, 2017 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017).

[18]  Ke Lu,et al.  Group Re-Identification: Leveraging and Integrating Multi-Grain Information , 2018, ACM Multimedia.

[19]  F. Quimby What's in a picture? , 1993, Laboratory animal science.

[20]  Tal Hassner,et al.  Effective Unconstrained Face Recognition by Combining Multiple Descriptors and Learned Background Statistics , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Yu Wu,et al.  Pose-Guided Feature Alignment for Occluded Person Re-Identification , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[22]  Brendan T. O'Connor,et al.  Racial Disparity in Natural Language Processing: A Case Study of Social Media African-American English , 2017, ArXiv.

[23]  John R. Smith,et al.  Diversity in Faces , 2019, ArXiv.

[24]  Anil K. Jain,et al.  Unconstrained face recognition: Establishing baseline human performance via crowdsourcing , 2014, IEEE International Joint Conference on Biometrics.

[25]  Inioluwa Deborah Raji,et al.  About Face: A Survey of Facial Recognition Evaluation , 2021, ArXiv.

[26]  H. Sox,et al.  Research Misconduct, Retraction, and Cleansing the Medical Literature: Lessons from the Poehlman Case , 2006, Annals of Internal Medicine.

[27]  Debing Zhang,et al.  Lightweight Face Recognition Challenge , 2019, 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW).

[28]  Sebastian Benthall,et al.  Racial categories in machine learning , 2018, FAT.

[29]  Jodi Schneider,et al.  Continued post-retraction citation of a fraudulent clinical trial report, 11 years after it was retracted for falsifying data , 2020, Scientometrics.

[30]  Shan Li,et al.  Deep Facial Expression Recognition: A Survey , 2018, IEEE Transactions on Affective Computing.

[31]  Joseph Paul Cohen,et al.  Academic Torrents: A Community-Maintained Distributed Repository , 2014, XSEDE '14.

[32]  Jaime A. Teixeira da Silva,et al.  Why do some retracted papers continue to be cited? , 2016, Scientometrics.

[33]  Helen Nissenbaum,et al.  An Ethical Highlighter for People-Centric Dataset Creation , 2020, ArXiv.

[34]  Albert Gordo,et al.  Towards Measuring Fairness in AI: The Casual Conversations Dataset , 2021, IEEE Transactions on Biometrics, Behavior, and Identity Science.

[35]  Jian Sun,et al.  Face Alignment Via Component-Based Discriminative Search , 2008, ECCV.

[36]  Shiguang Shan,et al.  Heterogeneous Face Attribute Estimation: A Deep Multi-Task Learning Approach , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[37]  Olga Russakovsky,et al.  REVISE: A Tool for Measuring and Mitigating Bias in Visual Datasets , 2020, International Journal of Computer Vision.

[38]  Abeba Birhane,et al.  Algorithmic Injustices: Towards a Relational Ethics , 2019, ArXiv.

[39]  Xiaogang Wang,et al.  Deep Learning Face Attributes in the Wild , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[40]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[41]  Christopher Joseph Pal,et al.  Towards Standardization of Data Licenses: The Montreal Data License , 2019, ArXiv.

[42]  Shengcai Liao,et al.  A benchmark study of large-scale unconstrained face recognition , 2014, IEEE International Joint Conference on Biometrics.

[43]  Vibhaakar Sharma,et al.  Masked Face Recognition , 2021, International Journal of Advanced Research in Science, Communication and Technology.

[44]  Aylin Caliskan,et al.  Image Representations Learned With Unsupervised Pre-Training Contain Human-like Biases , 2020, FAccT.

[45]  Antonio Torralba,et al.  Ieee Transactions on Pattern Analysis and Machine Intelligence 1 80 Million Tiny Images: a Large Dataset for Non-parametric Object and Scene Recognition , 2022 .

[46]  Alberto Ruano-Ravina,et al.  Does retraction after misconduct have an impact on citations? A pre–post study , 2020, BMJ Global Health.

[47]  Yuxiao Hu,et al.  MS-Celeb-1M: A Dataset and Benchmark for Large-Scale Face Recognition , 2016, ECCV.

[48]  Alberto Del Bimbo,et al.  Effective 3D based frontalization for unconstrained face recognition , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[49]  Brian C. Lovell,et al.  Multi-Region Probabilistic Histograms for Robust and Scalable Identity Inference , 2009, ICB.

[50]  Xin Zheng,et al.  A Survey of Deep Facial Attribute Analysis , 2018, International Journal of Computer Vision.

[51]  Erik G. Learned-Miller,et al.  Unsupervised Joint Alignment of Complex Images , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[52]  Carlos D. Castillo,et al.  Deep Features for Recognizing Disguised Faces in the Wild , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[53]  Vinay Uday Prabhu,et al.  Large image datasets: A pyrrhic win for computer vision? , 2020, 2021 IEEE Winter Conference on Applications of Computer Vision (WACV).

[54]  Inioluwa Deborah Raji,et al.  Saving Face: Investigating the Ethical Concerns of Facial Recognition Auditing , 2020, AIES.

[55]  Yu Wu,et al.  Exploit the Unknown Gradually: One-Shot Video-Based Person Re-identification by Stepwise Learning , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[56]  Judit Bar-Ilan,et al.  Temporal characteristics of retracted articles , 2018, Scientometrics.

[57]  Ahmed Hosny,et al.  The Dataset Nutrition Label: A Framework To Drive Higher Data Quality Standards , 2018, Data Protection and Privacy.

[58]  Xiaogang Wang,et al.  Deep Convolutional Network Cascade for Facial Point Detection , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[59]  Yun Fu,et al.  One Label, One Billion Faces: Usage and Consistency of Racial Categories in Computer Vision , 2021, FAccT.

[60]  Tieniu Tan,et al.  A Light CNN for Deep Face Representation With Noisy Labels , 2015, IEEE Transactions on Information Forensics and Security.

[61]  Helmar Bornemann-Cimenti,et al.  Perpetuation of Retracted Publications Using the Example of the Scott S. Reuben Case: Incidences, Reasons and Possible Improvements , 2016, Sci. Eng. Ethics.

[62]  Honglak Lee,et al.  Learning to Align from Scratch , 2012, NIPS.

[63]  Emily Denton,et al.  Towards a critical race methodology in algorithmic fairness , 2019, FAT*.

[64]  Mei Wang,et al.  Deep Face Recognition: A Survey , 2018, Neurocomputing.

[65]  Grace Abuhamad,et al.  Like a Researcher Stating Broader Impact For the Very First Time , 2020, ArXiv.

[66]  Stefanos Zafeiriou,et al.  ArcFace: Additive Angular Margin Loss for Deep Face Recognition , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[67]  Chris Chambers,et al.  What next for registered reports , 2018, Nature Human Behaviour.

[68]  Marwan Mattar,et al.  Labeled Faces in the Wild: A Database forStudying Face Recognition in Unconstrained Environments , 2008 .

[69]  Arvind Narayanan,et al.  Semantics derived automatically from language corpora contain human-like biases , 2016, Science.

[70]  Xiaoou Tang,et al.  Facial Landmark Detection by Deep Multi-task Learning , 2014, ECCV.

[71]  Jian Sun,et al.  Face recognition with learning-based descriptor , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[72]  Enrique G. Ortiz,et al.  Evaluating Open-Universe Face Identification on the Web , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[73]  Timnit Gebru,et al.  Lessons from archives: strategies for collecting sociocultural data in machine learning , 2019, FAT*.

[74]  Tal Hassner,et al.  Effective face frontalization in unconstrained images , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[75]  Takeo Kanade,et al.  Multi-PIE , 2008, 2008 8th IEEE International Conference on Automatic Face & Gesture Recognition.

[76]  Stefanos Zafeiriou,et al.  Marginal Loss for Deep Face Recognition , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[77]  Yi Yang,et al.  Unlabeled Samples Generated by GAN Improve the Person Re-identification Baseline in Vitro , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[78]  Rachel K. E. Bellamy,et al.  AI Fairness 360: An Extensible Toolkit for Detecting, Understanding, and Mitigating Unwanted Algorithmic Bias , 2018, ArXiv.

[79]  Hao Wu,et al.  Masked Face Recognition Dataset and Application , 2020, ArXiv.

[80]  Andrew Zisserman,et al.  Deep Face Recognition , 2015, BMVC.

[81]  Inioluwa Deborah Raji,et al.  Model Cards for Model Reporting , 2018, FAT.

[82]  Francesco Solera,et al.  Performance Measures and a Data Set for Multi-target, Multi-camera Tracking , 2016, ECCV Workshops.

[83]  Li Fei-Fei,et al.  A Study of Face Obfuscation in ImageNet , 2021, ICML.

[84]  Andrew Y. Ng,et al.  End-to-End People Detection in Crowded Scenes , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[85]  Shree K. Nayar,et al.  Attribute and simile classifiers for face verification , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[86]  Timnit Gebru,et al.  Datasheets for datasets , 2018, Commun. ACM.