Pretrained AI Models: Performativity, Mobility, and Change

The paradigm of pretrained deep learning models has recently emerged in artificial intelligence practice, allowing deployment in numerous societal settings with limited computational resources, but also embedding biases and enabling unintended negative uses. In this paper, we treat pretrained models as objects of study and discuss the ethical impacts of their sociological position. We discuss how pretrained models are developed and compared under the common task framework, but that this may make self-regulation inadequate. Further how pretrained models may have a performative effect on society that exacerbates biases. We then discuss how pretrained models move through actor networks as a kind of computationally immutable mobile, but that users also act as agents of technological change by reinterpreting them via fine-tuning and transfer. We further discuss how users may use pretrained models in malicious ways, drawing a novel connection between the responsible innovation and user-centered innovation literatures. We close by discussing how this sociological understanding of pretrained models can inform AI governance frameworks for fairness, accountability, and transparency.

[1]  T. Long,et al.  RÉFLEXIONS SUR LA PUISSANCE MOTRICE DU FEU, ET SUR LES MACHINES PROPRES A DÉVELOPPER CETTE PUISSANCE. , 1903 .

[2]  T. Good,et al.  The Self-Fulfilling Prophecy. , 1971 .

[3]  K. Arrow The Theory of Discrimination , 1971 .

[4]  M. Spence Job Market Signaling , 1973 .

[5]  A. M. Turing,et al.  Computing Machinery and Intelligence , 1950, The Philosophy of Artificial Intelligence.

[6]  B. Barnes Social Life as Bootstrapped Induction , 1983 .

[7]  R. Westrum The Social Construction of Technological Systems , 1989 .

[8]  J. E. Groves,et al.  Made in America: Science, Technology and American Modernist Poets , 1989 .

[9]  Carl W. Condit,et al.  Nature's Metropolis: Chicago and the Great West , 1991 .

[10]  Stephen Coate,et al.  Will Affirmative-Action Policies Eliminate Negative Stereotypes? , 1993 .

[11]  William M. Riggs,et al.  Incentives to innovate and the sources of innovation: the case of scientific instruments☆ , 1994 .

[12]  T. Pinch,et al.  Users as Agents of Technological Change: The Social Construction of the Automobile in the Rural United States , 1996, Technology and Culture.

[13]  Optima for Animals: Revised Edition , 1997 .

[14]  U. Neisser RISING SCORES ON INTELLIGENCE TESTS , 1997 .

[15]  R. Netz The Shaping of Deduction in Greek Mathematics: A Study in Cognitive History , 1999 .

[16]  J. Knowles,et al.  Racial Bias in Motor Vehicle Searches: Theory and Evidence , 1999, Journal of Political Economy.

[17]  László A. Székely,et al.  Inverting Random Functions II: Explicit Bounds for Discrete Maximum Likelihood Estimation, with Applications , 2002, SIAM J. Discret. Math..

[18]  E. A. Locke,et al.  Building a practically useful theory of goal setting and task motivation. A 35-year odyssey. , 2002, The American psychologist.

[19]  Trevor Pinch,et al.  How users and non-users matter , 2003 .

[20]  M. Hajer Policy without polity? Policy analysis and the institutional void , 2003 .

[21]  David Kaiser Drawing Theories Apart: The Dispersion of Feynman Diagrams in Postwar Physics , 2005 .

[22]  Carliss Y. Baldwin,et al.  How User Innovations Become Commercial Products: A Theoretical Investigation and Case Study , 2006 .

[23]  Sonali K. Shah From Innovation to Firm Formation: Contributions by Sports Enthusiasts to the Windsurfing, Snowboarding & Skateboarding Industries , 2006 .

[24]  J. Krige How Users Matter: The Co-Construction of Users and Technology , 2006 .

[25]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[26]  D. MacKenzie An Engine, Not a Camera: How Financial Models Shape Markets , 2006 .

[27]  Jock Given,et al.  The wealth of networks: How social production transforms markets and freedom , 2007, Inf. Econ. Policy.

[28]  D. Endy,et al.  Refinement and standardization of synthetic biological parts and devices , 2008, Nature Biotechnology.

[29]  Itamar Arel,et al.  Beyond the Turing Test , 2009, Computer.

[30]  M. Callon,et al.  Acting in an Uncertain World: An Essay on Technical Democracy , 2009 .

[31]  Risto Karinen,et al.  Toward Anticipatory Governance: The Experience with Nanotechnology , 2009 .

[32]  M. Schweitzer,et al.  Goals Gone Wild: The Systematic Side Effects of Overprescribing Goal Setting , 2009 .

[33]  Jacob G. Foster,et al.  Metaknowledge , 2011, Science.

[34]  Emilio Frazzoli,et al.  High-speed flight in an ergodic forest , 2012, 2012 IEEE International Conference on Robotics and Automation.

[35]  David Kaiser,et al.  Dual-use research: Self-censorship is not enough , 2012, Nature.

[36]  Bruno Latour,et al.  Visualisation and Cognition: Drawing Things Together , 2012 .

[37]  Sameep Mehta,et al.  Efficient multifaceted screening of job applicants , 2013, EDBT '13.

[38]  E. Hippel,et al.  User Community vs. Producer Innovation Development Efficiency: A First Empirical Study , 2013 .

[39]  J. Stilgoe,et al.  Developing a framework for responsible innovation* , 2013, The Ethics of Nanotechnology, Geoengineering and Clean Energy.

[40]  Miles Brundage,et al.  Artificial Intelligence and Responsible Innovation , 2013, PT-AI.

[41]  William H. Pierce Failure-Tolerant Computer Design , 2014 .

[42]  Oren Etzioni,et al.  My Computer Is an Honor Student - but How Intelligent Is It? Standardized Tests as a Measure of AI , 2016, AI Mag..

[43]  H. Sapolsky The Politics of Risk , 2016 .

[44]  Murray Campbell,et al.  I-athlon: Towards A Multidimensional Turing Test , 2016, AI Mag..

[45]  L. Floridi Faultless responsibility: on the nature and allocation of moral responsibility for distributed moral actions , 2016, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[46]  J. Hanley,et al.  Association of Off-label Drug Use and Adverse Drug Events in an Adult Population. , 2016, JAMA internal medicine.

[47]  E. von Hippel Free Innovation , 2016 .

[48]  Stuart M. Shieber Principles for Designing an AI Competition, or Why the Turing Test Fails as an Inducement Prize , 2016, AI Mag..

[49]  Lav R. Varshney,et al.  Fundamental Limits of Data Analytics in Sociotechnical Systems , 2016, Front. ICT.

[50]  Jieyu Zhao,et al.  Men Also Like Shopping: Reducing Gender Bias Amplification using Corpus-level Constraints , 2017, EMNLP.

[51]  Alexei A. Efros,et al.  Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[52]  A. Kerr,et al.  The limits of responsible innovation: Exploring care, vulnerability and precision medicine , 2017 .

[53]  Richard Socher,et al.  Learned in Translation: Contextualized Word Vectors , 2017, NIPS.

[54]  Kaiming He,et al.  Exploring the Limits of Weakly Supervised Pretraining , 2018, ECCV.

[55]  Sebastian Ruder,et al.  Universal Language Model Fine-tuning for Text Classification , 2018, ACL.

[56]  Luke S. Zettlemoyer,et al.  Deep Contextualized Word Representations , 2018, NAACL.

[57]  Omer Levy,et al.  GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding , 2018, BlackboxNLP@EMNLP.

[58]  Richard Socher,et al.  The Natural Language Decathlon: Multitask Learning as Question Answering , 2018, ArXiv.

[59]  Hyrum S. Anderson,et al.  The Malicious Use of Artificial Intelligence: Forecasting, Prevention, and Mitigation , 2018, ArXiv.

[60]  Yiling Chen,et al.  A Short-term Intervention for Long-term Fairness in the Labor Market , 2017, WWW.

[61]  Rachel K. E. Bellamy,et al.  AI Fairness 360: An Extensible Toolkit for Detecting, Understanding, and Mitigating Unwanted Algorithmic Bias , 2018, ArXiv.

[62]  Roel Dobbe,et al.  A Broader View on Bias in Automated Decision-Making: Reflecting on Epistemology and Dynamics , 2018, ArXiv.

[63]  Omer Levy,et al.  RoBERTa: A Robustly Optimized BERT Pretraining Approach , 2019, ArXiv.

[64]  R. V. Schomberg Why responsible innovation? , 2019, International Handbook on Responsible Innovation.

[65]  Jimmy J. Lin,et al.  End-to-End Open-Domain Question Answering with BERTserini , 2019, NAACL.

[66]  Nathan Srebro,et al.  From Fair Decision Making To Social Equality , 2018, FAT.

[67]  R. von Schomberg,et al.  Introduction to the International Handbook on Responsible Innovation - A Global Resource , 2019, SSRN Electronic Journal.

[68]  Miles Brundage,et al.  Understanding the movement(s) for responsible innovation , 2019, International Handbook on Responsible Innovation.

[69]  Kush R. Varshney,et al.  Increasing Trust in AI Services through Supplier's Declarations of Conformity , 2018, IBM J. Res. Dev..

[70]  M. C. Elish,et al.  Moral Crumple Zones: Cautionary Tales in Human-Robot Interaction , 2019, Engaging Science, Technology, and Society.

[71]  Jon M. Kleinberg,et al.  Discrimination in the Age of Algorithms , 2018, SSRN Electronic Journal.

[72]  Chandler May,et al.  On Measuring Social Biases in Sentence Encoders , 2019, NAACL.

[73]  Stefan Lee,et al.  ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks , 2019, NeurIPS.

[74]  Inioluwa Deborah Raji,et al.  Model Cards for Model Reporting , 2018, FAT.

[75]  Quoc V. Le,et al.  Do Better ImageNet Models Transfer Better? , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[76]  Andrew Miller,et al.  ILC: a calculus for composable, computational cryptography , 2019, IACR Cryptol. ePrint Arch..

[77]  Ilya Sutskever,et al.  Language Models are Unsupervised Multitask Learners , 2019 .

[78]  Danah Boyd,et al.  Fairness and Abstraction in Sociotechnical Systems , 2019, FAT.

[79]  John C. Duchi,et al.  Comments on Michael Jordan’s Essay “The AI Revolution Hasn’t Happened Yet" , 2019, Issue 1.

[80]  D. Heaven Should we fear an AI super-troll? , 2019, New Scientist.

[81]  Andrew McCallum,et al.  Energy and Policy Considerations for Deep Learning in NLP , 2019, ACL.

[82]  Jieh Hsiang,et al.  PatentBERT: Patent Classification with Fine-Tuning a pre-trained BERT Model , 2019, ArXiv.

[83]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[84]  Yiming Yang,et al.  XLNet: Generalized Autoregressive Pretraining for Language Understanding , 2019, NeurIPS.

[85]  Lav R. Varshney Mathematical limit theorems for computational creativity , 2019, IBM J. Res. Dev..

[86]  Jaewoo Kang,et al.  BioBERT: a pre-trained biomedical language representation model for biomedical text mining , 2019, Bioinform..

[87]  Omer Levy,et al.  SpanBERT: Improving Pre-training by Representing and Predicting Spans , 2019, TACL.

[88]  Thilo Hagendorff,et al.  The Ethics of AI Ethics: An Evaluation of Guidelines , 2019, Minds and Machines.

[89]  Oren Etzioni,et al.  Green AI , 2019, Commun. ACM.

[90]  International Handbook on Responsible Innovation — a Global Resource , 2021, Nanoethics.