论文信息 - Aligning artificial intelligence with human values: reflections from a phenomenological perspective - 字舞流文

Aligning artificial intelligence with human values: reflections from a phenomenological perspective

Artificial Intelligence (AI) must be directed at humane ends. The development of AI has produced great uncertainties of ensuring AI alignment with human values (AI value alignment) through AI operations from design to use. For the purposes of addressing this problem, we adopt the phenomenological theories of material values and technological mediation to be that beginning step. In this paper, we first discuss the AI value alignment from the relevant AI studies. Second, we briefly present what are material values and technological mediation and reflect on the AI value alignment through the lenses of these theories. We conclude that a set of finite human values can be defined and adapted to the stable life tasks that AI systems will be called upon to accomplish. The AI value alignment can also be fostered between designers and users through technological mediation. Upon that foundation, we propose a set of common principles to understand the AI value alignment through phenomenological theories. This paper contributes the unique knowledge of phenomenological theories to the discourse on AI alignment with human values.

Shahrokh Nikou | Eugene Kelly | Eric-Oluf Svee | Shengnan Han | Shahrokh Nikou | Shengnan Han | Eric-Oluf Svee | Eugene Kelly

[1] D. Ihde. Expanding Hermeneutics: Visualism in Science , 1998 .

[2] N. Rescher. Moral Issues Relating to the Economics of New Knowledge in the Biomedical Sciences , 1982 .

[3] G. Gordon Worley,et al. Robustness to fundamental uncertainty in AGI alignment , 2018, ArXiv.

[4] Kaj Sotala,et al. Defining Human Values for Value Learners , 2016, AAAI Workshop: AI, Ethics, and Society.

[5] M. Holbrook. Consumer Value: A Framework for Analysis and Research , 1999 .

[6] M. Scheler,et al. The human place in the cosmos , 2008 .

[7] Ben Taskar,et al. Learning structured prediction models: a large margin approach , 2005, ICML.

[8] S. Ulam. John von Neumann 1903-1957 , 1958 .

[9] Eugene Kelly. Revisiting Max Scheler's formalism in ethics: virtue-based ethics and moral rules in the non-formal ethics of value , 1997 .

[10] Demis Hassabis,et al. Mastering the game of Go without human knowledge , 2017, Nature.

[11] M. Rokeach. The Nature Of Human Values , 1974 .

[12] Guangyuan Liu,et al. Quantum Optimization and Quantum Learning: A Survey , 2020, IEEE Access.

[13] S. Schwartz,et al. Value Consensus and Importance , 2000 .

[14] Max Tegmark. Life 3.0: Being Human in the Age of Artificial Intelligence , 2017 .

[15] A. Schutz. Max Scheler’s Epistemology and Ethics , 1970 .

[16] Igor Aleksander,et al. Partners of humans: a realistic assessment of the role of robots in the foreseeable future , 2017, J. Inf. Technol..

[17] Eldad Davidov,et al. Refining the theory of basic individual values. , 2012, Journal of personality and social psychology.

[18] P. Bloom. Just Babies: The Origins of Good and Evil , 1836 .

[19] N. Bostrom. Astronomical Waste: The Opportunity Cost of Delayed Technological Development , 2003, Utilitas.

[20] C. Kluckhohn. 2. VALUES AND VALUE-ORIENTATIONS IN THE THEORY OF ACTION: AN EXPLORATION IN DEFINITION AND CLASSIFICATION , 1951 .

[21] Peter-Paul Verbeek,et al. Moralizing Technology: Understanding and Designing the Morality of Things , 2011 .

[22] Francesca Rossi,et al. AI4People—An Ethical Framework for a Good AI Society: Opportunities, Risks, Principles, and Recommendations , 2018, Minds and Machines.

[23] Ben Goertzel,et al. Contemporary Approaches to Artificial General Intelligence , 2007, Artificial General Intelligence.

[24] Iason Gabriel,et al. Artificial Intelligence, Values, and Alignment , 2020, Minds and Machines.

[25] David G. Hendry,et al. Value Sensitive Design , 2019 .

[26] Youngjin Yoo,et al. Digital First: The Ontological Reversal and New Challenges for Information Systems Research , 2020, MIS Q..

[27] Roman V. Yampolskiy,et al. The technological singularity , 2017 .

[28] M. Scheler,et al. Gesammelte Werke. II: Der Formalismus in der EthiK und die materiale Wertethik , 1955 .

[29] Eugene T. Kelly,et al. Material Ethics of Value: Max Scheler and Nicolai Hartmann , 2011 .

[30] Christopher L. Tucci,et al. Internet Business Models and Strategies: Text and Cases , 2002 .

[31] Virginia Dignum,et al. Responsible Artificial Intelligence: Designing Ai for Human Values , 2017 .

[32] Stuart Russell. Human Compatible: Artificial Intelligence and the Problem of Control , 2019 .

[33] D. Ihde. Technology and the lifeworld : from garden to earth , 1991 .

[34] D. Holmes,et al. Understanding human enhancement technologies through critical phenomenology , 2018, Nursing philosophy : an international journal for healthcare professionals.

[35] Andreas Trabesinger,et al. Quantum computing: towards reality , 2017, Nature.

[36] Jelena Zdravkovic,et al. Exploring business value models from the inter-organizational collaboration perspective , 2010, SAC '10.

[37] Oren Etzioni,et al. AI assisted ethics , 2016, Ethics and Information Technology.

[38] Vassilis Galanos,et al. Exploring expanding expertise: artificial intelligence as an existential threat and the role of prestigious commentators, 2014–2018 , 2018, Technol. Anal. Strateg. Manag..

[39] Dawn Song,et al. Aligning AI With Shared Human Values , 2020, ICLR.

[40] Martin Fishbein,et al. Theory-based Behavior Change Interventions: Comments on Hobbis and Sutton , 2005, Journal of health psychology.

[41] Mark O. Riedl,et al. Using Stories to Teach Human Values to Artificial Agents , 2016, AAAI Workshop: AI, Ethics, and Society.

[42] Oren Etzioni,et al. Designing AI systems that obey our laws and values , 2016, Commun. ACM.

[43] Andrew Y. Ng,et al. Pharmacokinetics of a novel formulation of ivermectin after administration to goats , 2000, ICML.

[44] F. Warneken,et al. Costly fairness in children is influenced by who is watching. , 2020, Developmental psychology.

[45] M. Lynne Markus,et al. A Foundation for the Study of IT Effects: A New Look at DeSanctis and Poole's Concepts of Structural Features and Spirit , 2008, J. Assoc. Inf. Syst..

[46] Arthur I. Miller. The Artist in the Machine: The World of AI-Powered Creativity , 2019 .

[47] Gopal P. Sarma,et al. Mammalian Value Systems , 2016, Informatica.

[48] Kristina Höök,et al. Designing with the Body: Somaesthetic Interaction Design , 2018, CHIRA.

[49] S. Schwartz. Are There Universal Aspects in the Structure and Contents of Human Values , 1994 .

[50] Stuart J. Russell,et al. Research Priorities for Robust and Beneficial Artificial Intelligence , 2015, AI Mag..

[51] Kaj Sotala,et al. Responses to the Journey to the Singularity , 2017 .

[52] Rosalind W. Picard. Affective Computing , 1997 .

[53] Geoff Walsham,et al. Are we making a better world with ICTs? Reflections on a future agenda for the IS field , 2012, J. Inf. Technol..

[54] EtzioniOren,et al. Designing AI systems that obey our laws and values , 2016 .