Thorny Roses: Investigating the Dual Use Dilemma in Natural Language Processing

Dual use, the intentional, harmful reuse of technology and scientific artefacts, is a problem yet to be well-defined within the context of Natural Language Processing (NLP). However, as NLP technologies continue to advance and become increasingly widespread in society, their inner workings have become increasingly opaque. Therefore, understanding dual use concerns and potential ways of limiting them is critical to minimising the potential harms of research and development. In this paper, we conduct a survey of NLP researchers and practitioners to understand the depth and their perspective of the problem as well as to assess existing available support. Based on the results of our survey, we offer a definition of dual use that is tailored to the needs of the NLP community. The survey revealed that a majority of researchers are concerned about the potential dual use of their research but only take limited action toward it. In light of the survey results, we discuss the current state and potential means for mitigating dual use in NLP and propose a checklist that can be integrated into existing conference ethics-frameworks, e.g., the ACL ethics checklist.

[1]  Miryam de Lhoneux,et al.  A Two-Sided Discussion of Preregistration of NLP Research , 2023, EACL.

[2]  Girish Sastry,et al.  Generative Language Models and Automated Influence Operations: Emerging Threats and Potential Mitigations , 2023, ArXiv.

[3]  Richard Yuanzhe Pang,et al.  What Do NLP Researchers Believe? Results of the NLP Community Metasurvey , 2022, ACL.

[4]  R. Zare,et al.  The perils of machine learning in designing new chemicals and materials , 2022, Nature Machine Intelligence.

[5]  Tackling the perils of dual use in AI , 2022, Nature Machine Intelligence.

[6]  Christian Reuter,et al.  Dual-Use and Trustworthy? A Mixed Methods Analysis of AI Diffusion Between Civilian and Defense R&D , 2022, Science and Engineering Ethics.

[7]  S. Ekins,et al.  Dual use of artificial-intelligence-powered drug discovery , 2022, Nature Machine Intelligence.

[8]  Kobi Leins,et al.  Just What do You Think You're Doing, Dave?' A Checklist for Responsible Data Use in NLP , 2021, EMNLP.

[9]  Saif M. Mohammad Ethics Sheets for AI Tasks , 2021, ACL.

[10]  Michael S. Bernstein,et al.  ESR: Ethics and Society Review of Artificial Intelligence Research , 2021, ArXiv.

[11]  Claudia Ratner When “Sweetie” is not so Sweet: Artificial Intelligence and its Implications for Child Pornography , 2021 .

[12]  Emiel van Miltenburg,et al.  Preregistering NLP research , 2021, NAACL.

[13]  Emily M. Bender,et al.  Integrating Ethics into the NLP Curriculum , 2020, ACL.

[14]  Dirk Hovy,et al.  “You Sound Just Like Your Father” Commercial Machine Translation Systems Include Stylistic Biases , 2020, ACL.

[15]  Timothy Baldwin,et al.  Give Me Convenience and Give Her Death: Who Should Decide What Uses of NLP are Appropriate, and on What Basis? , 2020, ACL.

[16]  Hanna M. Wallach,et al.  Co-Designing Checklists to Understand Organizational Challenges and Opportunities around Fairness in AI , 2020, CHI.

[17]  G. Borradaile,et al.  Whose tweets are surveilled for the police: an audit of a social-media monitoring tool via log files , 2020, FAT*.

[18]  Andy Way,et al.  Getting Gender Right in Neural Machine Translation , 2019, EMNLP.

[19]  Munmun De Choudhury,et al.  A Taxonomy of Ethical Tensions in Inferring Mental Health States from Social Media , 2019, FAT.

[20]  Emily M. Bender,et al.  Data Statements for Natural Language Processing: Toward Mitigating System Bias and Enabling Better Science , 2018, TACL.

[21]  Inioluwa Deborah Raji,et al.  Model Cards for Model Reporting , 2018, FAT.

[22]  L. Winner DO ARTIFACTS HAVE (cid:1) POLITICS? , 2022 .

[23]  Dirk Hovy,et al.  The Social Impact of Natural Language Processing , 2016, ACL.

[24]  C. Grady Institutional Review Boards: Purpose and Challenges. , 2015, Chest.

[25]  Simon M. Whitby,et al.  Promoting Education of Dual-Use Issues for Life Scientists: A Comprehensive Approach , 2013 .

[26]  Åke Forsberg,et al.  Lessons learned from implementing education on dual-use in Austria, Italy, Pakistan and Sweden , 2012, Medicine, conflict, and survival.

[27]  R. M. Vazquez The Checklist Manifesto How to Get Things Right , 2011 .

[28]  C. Bosk,et al.  Forbidden Knowledge: Public Controversy and the Production of Nonknowledge1 , 2011 .

[29]  John Forge,et al.  A Note on the Definition of “Dual Use” , 2010, Sci. Eng. Ethics.

[30]  Gary E. Marchant,et al.  The Problems with Forbidding Science , 2009, Sci. Eng. Ethics.

[31]  David B. Resnik,et al.  What is “Dual Use” Research? A Response to Miller and Selgelid , 2009, Sci. Eng. Ethics.

[32]  Hans-Jörg Ehni,et al.  Dual use and the ethical responsibility of scientists , 2008, Archivum Immunologiae et Therapiae Experimentalis.

[33]  Deborah G. Johnson Forbidden Knowledge and Science as Professional Activity , 1996 .

[34]  Dilrukshi Gamage,et al.  The Emergence of Deepfakes and its Societal Implications: A Systematic Review , 2021, TTO.