论文信息 - Understanding User Attitudes Towards Negative Side Effects of AI Systems

Understanding User Attitudes Towards Negative Side Effects of AI Systems

Artificial Intelligence (AI) systems deployed in the open world may produce negative side effects—which are unanticipated, undesirable outcomes that occur in addition to the intended outcomes of the system’s actions. These negative side effects affect users directly or indirectly, by violating their preferences or altering their environment in an undesirable, potentially harmful, manner. While the existing literature has started to explore techniques to overcome the impacts of negative side effects in deployed systems, there has been no prior efforts to determine how users perceive and respond to negative side effects. We surveyed 183 participants to develop an understanding of user attitudes towards side effects and how side effects impact user trust in the system. The surveys targeted two domains: an autonomous vacuum cleaner and an autonomous vehicle, each with 183 respondents. The results indicate that users are willing to tolerate side effects that are not safety-critical but prefer to minimize them as much as possible. Furthermore, users are willing to assist the system in mitigating negative side effects by providing feedback and reconfiguring the environment. Trust in the system diminishes if it fails to minimize the impacts of negative side effects over time. These results support key fundamental assumptions in existing techniques and facilitate the development of new methods to overcome negative side effects of AI systems.

Shlomo Zilberstein | Sandhya Saisubramanian | Shannon C. Roberts

[1] Shlomo Zilberstein,et al. Mitigating Negative Side Effects via Environment Shaping , 2021, AAMAS.

[2] Shlomo Zilberstein,et al. Avoiding Negative Side Effects due to Incomplete Knowledge of AI Systems , 2020, AI Mag..

[3] Shlomo Zilberstein,et al. A Multi-Objective Approach to Mitigate Negative Side Effects , 2020, IJCAI.

[4] John Schulman,et al. Concrete Problems in AI Safety , 2016, ArXiv.

[5] Moritz Körber,et al. Theoretical Considerations and Development of a Questionnaire to Measure Trust in Automation , 2018, Advances in Intelligent Systems and Computing.

[6] Edmund H. Durfee,et al. Minimax-Regret Querying on Side Effects for Safe Optimality in Factored Markov Decision Processes , 2018, IJCAI.

[7] Anca D. Dragan,et al. Inverse Reward Design , 2017, NIPS.

[8] Laurent Orseau,et al. Penalizing Side Effects using Stepwise Relative Reachability , 2018, AISafety@IJCAI.

[9] Riender Happee,et al. Public opinion on automated driving: results of an international questionnaire among 5000 respondents , 2015 .

[10] Anca D. Dragan,et al. Preferences Implicit in the State of the World , 2018, ICLR.

[11] Paul N. Bennett,et al. Will You Accept an Imperfect AI?: Exploring Designs for Adjusting End-user Expectations of AI Systems , 2019, CHI.