Integrating Ethics within Machine Learning Courses

This article establishes and addresses opportunities for ethics integration into Machine-learning (ML) courses. Following a survey of the history of computing ethics and the current need for ethical consideration within ML, we consider the current state of ML ethics education via an exploratory analysis of course syllabi in computing programs. The results reveal that though ethics is part of the overall educational landscape in these programs, it is not frequently a part of core technical ML courses. To help address this gap, we offer a preliminary framework, developed via a systematic literature review, of relevant ethics questions that should be addressed within an ML project. A pilot study with 85 students confirms that this framework helped them identify and articulate key ethical considerations within their ML projects. Building from this work, we also provide three example ML course modules that bring ethical thinking directly into learning core ML content. Collectively, this research demonstrates: (1) the need for ethics to be taught as integrated within ML coursework, (2) a structured set of questions useful for identifying and addressing potential issues within an ML project, and (3) novel course models that provide examples for how to practically teach ML ethics without sacrificing core course content. An additional by-product of this research is the collection and integration of recent publications in the emerging field of ML ethics education.

[1]  C. Dianne Martin,et al.  The case for integrating ethical and social impact into the computer science curriculum , 1997, ITiCSE-WGR '97.

[2]  Iyad Rahwan,et al.  Society-in-the-loop: programming the algorithmic social contract , 2017, Ethics and Information Technology.

[3]  J. Weizenbaum From Computer Power and Human Reason From Judgment to Calculation , 2007 .

[4]  John P. Sullins Ethics Boards for Research in Robotics and Artificial Intelligence: Is it Too Soon to Act? , 2017 .

[5]  Justin Li Weaving Diversity and Inclusion into CS Content (Abstract Only) , 2017, SIGCSE.

[6]  Robert Heckman,et al.  Key Concepts for a Data Science Ethics Curriculum , 2018, SIGCSE.

[7]  Louis de Koker,et al.  Regulation of Big Data: Perspectives on strategy, policy, law and privacy , 2017, Health and Technology.

[8]  Min Li,et al.  Making Choices: Ethical Decisions in a Global Context , 2016, Sci. Eng. Ethics.

[9]  Srinjita Bhaduri,et al.  Ethics Education in Context: A Case Study of Novel Ethics Activities for the CS Classroom , 2018, SIGCSE.

[10]  Shannon Vallor,et al.  An Introduction to Software Engineering Ethics , 2013 .

[11]  Pearl Brereton,et al.  Performing systematic literature reviews in software engineering , 2006, ICSE.

[12]  A. Tutt An FDA for Algorithms , 2016 .

[13]  D. Cox The Regression Analysis of Binary Sequences , 1958 .

[14]  Luciano Floridi,et al.  The Ethics of Big Data: Current and Foreseeable Issues in Biomedical Contexts , 2015, Science and Engineering Ethics.

[15]  Leen-Kiat Soh,et al.  Ethics training and decision-making: do computer science programs need help? , 2008, SIGCSE '08.

[16]  T. Graepel,et al.  Private traits and attributes are predictable from digital records of human behavior , 2013, Proceedings of the National Academy of Sciences.

[17]  Adam Tauman Kalai,et al.  Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings , 2016, NIPS.

[18]  L. Floridi,et al.  Data ethics , 2021, Effective Directors.

[19]  Adam Harkens 'Rear Window Ethics' and Discrimination: The Darker Side of Big Data , 2016 .

[20]  J. DuBois,et al.  Ethics Education in U.S. Medical Schools: A Study of Syllabi , 2002, Academic medicine : journal of the Association of American Medical Colleges.

[21]  Lech J. Janczewski,et al.  Technological, Organizational and Environmental Security and Privacy Issues of Big Data: A Literature Review , 2016, CENTERIS/ProjMAN/HCist.

[22]  Clément Iphar,et al.  Formalisation of a data analysis environment based on anomaly detection for risk assessment : Application to Maritime Domain Awareness , 2017 .

[23]  Susan Crawford,et al.  Citizen-Centered Governance: The Mayor's Office of New Urban Mechanics and the Evolution of CRM in Boston , 2013 .

[24]  Jens Mache,et al.  Teaching security using hands-on exercises (abstract only) , 2013, SIGCSE '13.

[25]  Elizabeth Goodman,et al.  Design and ethics in the era of big data , 2014, INTR.

[26]  Jens Mache,et al.  Teaching Security Using Hands-on Exercises in 2015 (Abstract Only) , 2015, SIGCSE.

[27]  Dan W. Brockt,et al.  The Theory of Justice , 2017 .

[28]  Mary Goodwin,et al.  Effective incorporation of ethics into courses that focus on programming , 2005, SIGCSE '05.

[29]  Sabina Leonelli,et al.  Locating ethics in data science: responsibility and accountability in global and distributed knowledge production systems , 2016, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[30]  Brent Mittelstadt,et al.  From Individual to Group Privacy in Big Data Analytics , 2017 .

[31]  Adam Briggle,et al.  Ethics and Science: An Introduction , 2012 .

[32]  D. Boyd,et al.  Perspectives on Big Data, Ethics, and Society , 2016 .

[33]  Gerhard Weikum,et al.  Fides: Towards a Platform for Responsible Data Science , 2017, SSDBM.

[34]  Christopher J. Sullivan,et al.  Examining paradigmatic development in criminology and criminal justice: A content analysis of research methods syllabi in doctoral programs , 2003 .

[35]  Casey Fiesler,et al.  “Participant” Perceptions of Twitter Research Ethics , 2018 .

[36]  Omer Tene,et al.  Privacy Protective Research: Facilitating Ethically Responsible Access to Administrative Data , 2018 .

[37]  Philip A.E. Brey,et al.  Philosophy of Computing and Information Technology , 2009 .

[38]  Evaggelia Pitoura,et al.  Diversity in Big Data: A Review , 2017, Big Data.

[39]  Solon Barocas,et al.  The Intuitive Appeal of Explainable Machines , 2018 .

[40]  Michael Fuller Big Data, Ethics and Religion: New Questions from a New Science , 2017 .

[41]  James R. Larus,et al.  When Computers Decide: European Recommendations on Machine-Learned Automated Decision Making , 2018 .

[42]  Nikolai Kazantsev,et al.  The Ethics of Big Data: Analytical Survey , 2015, 2015 IEEE 17th Conference on Business Informatics.

[43]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[44]  Kelly D. Martin,et al.  The role of data privacy in marketing , 2016, Journal of the Academy of Marketing Science.

[45]  Jenna Lindqvist,et al.  New challenges to personal data processing agreements: is the GDPR fit to deal with contract, accountability and liability in a world of the Internet of Things? , 2017, Int. J. Law Inf. Technol..

[46]  Liming Zhu,et al.  Continuous Validation for Data Analytics Systems , 2016, 2016 IEEE/ACM 38th International Conference on Software Engineering Companion (ICSE-C).

[47]  A. Zwitter Big Data ethics , 2014, Big Data Soc..

[48]  Stephen J Mooney,et al.  Big Data in Public Health: Terminology, Machine Learning, and Privacy. , 2018, Annual review of public health.

[49]  Frances S. Grodzinsky,et al.  Era of big data , 2016, SIGCAS Comput. Soc..

[50]  K. Crawford,et al.  Big Data and Due Process: Toward a Framework to Redress Predictive Privacy Harms , 2013 .

[51]  Bernd Carsten Stahl,et al.  The Ethics of Computing , 2016, ACM Comput. Surv..

[52]  Kirsten E. Martin Ethical Issues in the Big Data Industry , 2015, MIS Q. Executive.

[53]  Alexandra Chouldechova,et al.  Fair prediction with disparate impact: A study of bias in recidivism prediction instruments , 2016, Big Data.

[54]  Paul de Laat,et al.  Big data and algorithmic decision-making: can transparency restore accountability? , 2017, CSOC.

[55]  Avi Feller,et al.  Algorithmic Decision Making and the Cost of Fairness , 2017, KDD.

[56]  Janusz Wielki,et al.  THE SOCIAL AND ETHICAL CHALLENGES CONNECTED WITH THE BIG DATA PHENOMENON , 2015 .

[57]  G. C. Garriga,et al.  Consumer Journey Analytics in the Context of Data Privacy and Ethics , 2018 .

[58]  Norbert Wiener,et al.  The human use of human beings - cybernetics and society , 1988 .

[59]  Mario Pascalev,et al.  Privacy exchanges: restoring consent in privacy self-management , 2016, Ethics and Information Technology.

[60]  Michael Carl Tschantz,et al.  Automated Experiments on Ad Privacy Settings , 2014, Proc. Priv. Enhancing Technol..

[61]  Christopher T. Lowenkamp,et al.  False Positives, False Negatives, and False Analyses: A Rejoinder to "Machine Bias: There's Software Used across the Country to Predict Future Criminals. and It's Biased against Blacks" , 2016 .

[62]  D. Boyd,et al.  CRITICAL QUESTIONS FOR BIG DATA , 2012 .

[63]  Paul M. Schwartz,et al.  Privacy, Ethics, and Analytics , 2011, IEEE Security & Privacy.

[64]  Keith W. Miller,et al.  The Code of Ethics Quiz Show , 2017, SIGCSE.

[65]  Ching-Hua Chen,et al.  Data quality challenges for person-generated health and wellness data , 2018, IBM J. Res. Dev..

[66]  Richard Mateosian,et al.  Ethics of Big Data , 2013, IEEE Micro.

[67]  J. Weizenbaum Computer Power And Human Reason: From Judgement To Calculation , 1978 .

[68]  Tom LaGatta,et al.  Conscientious Classification: A Data Scientist's Guide to Discrimination-Aware Classification , 2017, Big Data.

[69]  Latanya Sweeney,et al.  Discrimination in online ad delivery , 2013, CACM.

[70]  Sorin Adam Matei,et al.  A Theoretical Framework for Ethical Reflection in Big Data Research , 2016, Ethical Reasoning in Big Data.

[71]  Cathy O'Neil,et al.  Conscientious Classification: A Data Scientist's Guide to Discrimination-Aware Classification , 2017, Big Data.

[72]  Liuqing Yang,et al.  Where does AlphaGo go: from church-turing thesis to AlphaGo thesis and beyond , 2016, IEEE/CAA Journal of Automatica Sinica.

[73]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[74]  Keith W. Miller,et al.  Implementing a tenth strand in the CS curriculum , 1996, CACM.

[75]  Tal Z. Zarsky,et al.  The Trouble with Algorithmic Decisions , 2016 .

[76]  Janet Davis,et al.  Incorporating social issues of computing in a small, liberal arts college: a case study , 2011, SIGCSE.

[77]  Patrick Lin,et al.  Robot Ethics: The Ethical and Social Implications of Robotics , 2011 .

[78]  Roger Clarke,et al.  Big data, big risks , 2016, Inf. Syst. J..

[79]  Andrew J. Russell,et al.  Using Ethical Reasoning to Amplify the Reach and Resonance of Professional Codes of Conduct in Training Big Data Scientists , 2014, Sci. Eng. Ethics.

[80]  Mariarosaria Taddeo,et al.  The ethics of algorithms: Mapping the debate , 2016, Big Data Soc..

[81]  Rong Tang,et al.  Data science programs in U.S. higher education: An exploratory content analysis of program description, curriculum structure, and course focus , 2016, Educ. Inf..

[82]  Wei Zhou,et al.  Business Analytics Generated Data Brokerage: Law, Ethical and Social Issues , 2017, FNSS.

[83]  Graeme G. Shanks,et al.  Ethical Implications of Big Data Analytics , 2016, ECIS.

[84]  Peter Grindrod,et al.  Beyond privacy and exposure: ethical issues within citizen-facing analytics , 2016, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[85]  L. Winner DO ARTIFACTS HAVE (cid:1) POLITICS? , 2022 .

[86]  Dee A. B. Weikle,et al.  Active Learning Strategies for Integrating the ACM Code of Ethics into CS Courses: (Abstract Only) , 2018, SIGCSE.

[87]  Felicia Chong The Pedagogy of Usability: An Analysis of Technical Communication Textbooks, Anthologies, and Course Syllabi and Descriptions , 2016 .

[88]  Grady Booch,et al.  The Human and Ethical Aspects of Big Data , 2014, IEEE Softw..

[89]  Bruce Levin,et al.  Determining Sample Sizes Needed to Detect a Difference between Two Proportions , 2004 .