论文信息 - Modeling and Interpreting Expert Disagreement About Artificial Superintelligence

Modeling and Interpreting Expert Disagreement About Artificial Superintelligence

Artificial superintelligence (ASI) is artificial intelligence (AI) with capabilities that are significantly greater than human capabilities across a wide range of domains. A hallmark of the ASI issue is disagreement among experts. This paper demonstrates and discusses methodological options for modeling and interpreting expert disagreement about the risk of ASI catastrophe. Using a new model called ASI-PATH, the paper models a well-documented recent disagreement between Nick Bostrom and Ben Goertzel, two distinguished ASI experts. Three points of disagreement are considered: (1) the potential for humans to evaluate the values held by an AI, (2) the potential for humans to create an AI with values that humans would consider desirable, and (3) the potential for an AI to create for itself values that humans would consider desirable. An initial quantitative analysis shows that accounting for variation in expert judgment can have a large effect on estimates of the risk of ASI catastrophe. The risk estimates can in turn inform ASI risk management strategies, which the paper demonstrates via an analysis of the strategy of AI confinement. The paper find the optimal strength of AI confinement to depend on the balance of risk parameters (1) and (2).

[1] Roman V. Yampolskiy,et al. Leakproofing the Singularity Artificial Intelligence Confinement Problem , 2012 .

[2] Ben Goertzel,et al. Superintelligence: Fears, Promises and Potentials Reflections on Bostrom's Superintelligence, Yudkowsky's From AI to Zombies, and Weaver and Veitas's "Open-Ended Intelligence" , 2015 .

[3] N. Oreskes. The Scientific Consensus on Climate Change , 2004, Science.

[4] Ben Goertzel,et al. Nine Ways to Bias Open-Source AGI Toward Friendliness , 2012 .

[5] Seth D. Baum,et al. Risk Analysis and Risk Management for the Artificial Superintelligence Research and Development Process , 2015 .

[6] Nick Bostrom,et al. Superintelligence: Paths, Dangers, Strategies , 2014 .

[7] Stuart Armstrong,et al. How We're Predicting AI - or Failing to , 2015 .

[8] Nick Bostrom,et al. Future Progress in Artificial Intelligence: A Survey of Expert Opinion , 2013, PT-AI.

[9] R. Penrose,et al. How Long Until Human-Level AI ? Results from an Expert Assessment , 2011 .

[10] Stuart Armstrong,et al. The errors, insights and lessons of famous AI predictions – and what they mean for the future , 2014, J. Exp. Theor. Artif. Intell..

[11] Ben Goertzel,et al. Infusing Advanced AGIs with Human-Like Value Systems , 2016, Journal of Ethics and Emerging Technologies.

[12] Anthony Michael Barrett,et al. A model of pathways to artificial superintelligence catastrophe for risk and decision analysis , 2016, J. Exp. Theor. Artif. Intell..