Human-aligned artificial intelligence is a multiobjective problem
暂无分享,去创建一个
Peter Vamplew | Jane Mummery | Richard Dazeley | Cameron Foale | Sally Firmin | P. Vamplew | Sally Firmin | R. Dazeley | Jane Mummery | Cameron Foale | Richard Dazeley
[1] David A. Ferrucci,et al. Introduction to "This is Watson" , 2012, IBM J. Res. Dev..
[2] Shimon Whiteson,et al. A Survey of Multi-Objective Sequential Decision-Making , 2013, J. Artif. Intell. Res..
[3] H. Tavani,et al. Ethics and technology : controversies, questions, and strategies for ethical computing , 2010 .
[4] Daniel Angus,et al. Multiple objective ant colony optimisation , 2009, Swarm Intelligence.
[5] Mark O. Riedl,et al. Using Stories to Teach Human Values to Artificial Agents , 2016, AAAI Workshop: AI, Ethics, and Society.
[6] Peter Danielson. Can robots have a conscience? , 2009, Nature.
[7] F. Cushman. Action, Outcome, and Value , 2013, Personality and social psychology review : an official journal of the Society for Personality and Social Psychology, Inc.
[8] Michael P. Wellman. REASONING ABOUT PREFERENCE MODELS , 1985 .
[9] J. Altmann. Arms control for armed uninhabited vehicles: an ethical issue , 2013, Ethics and Information Technology.
[10] D. Dubois,et al. Beyond min aggregation in multicriteria decision: (ordered) weighted min, discri-min, leximin , 1997 .
[11] Noel E. Sharkey,et al. Death strikes from the sky: the calculus of proportionality , 2009, IEEE Technology and Society Magazine.
[12] Roman V. Yampolskiy,et al. Artificial Intelligence Safety and Cybersecurity: a Timeline of AI Failures , 2016, ArXiv.
[13] Ronald Leenes,et al. Laws on Robots, Laws by Robots, Laws in Robots: Regulating Robot Behaviour by Design , 2014 .
[14] Michael Anderson,et al. MedEthEx: A Prototype Medical Ethics Advisor , 2006, AAAI.
[15] Nick Bostrom,et al. Thinking Inside the Box: Controlling and Using an Oracle AI , 2012, Minds and Machines.
[16] C. Allen,et al. Moral Machines: Teaching Robots Right from Wrong , 2008 .
[17] Peter Vamplew,et al. Lego Mindstorms Robots as a Platform for Teaching Reinforcement Learning , 2004 .
[18] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.
[19] Ralph L. Keeney. Value-driven expert systems for decision support , 1987 .
[20] Mariarosaria Taddeo,et al. The ethics of algorithms: Mapping the debate , 2016, Big Data Soc..
[21] Peter Asaro,et al. A Body to Kick, but Still No Soul to Damn : Legal Perspectives on Robotics , 2012 .
[22] Patrick Lin,et al. Moral Machines: Contradiction in Terms or Abdication of Human Responsibility? , 2012 .
[23] George Reynolds. Ethics in Information Technology , 2002 .
[24] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[25] Ioannis P. Vlahavas,et al. Multiobjective heuristic state-space planning , 2003, Artif. Intell..
[26] Michael L. Littman,et al. Reinforcement learning improves behaviour from evaluative feedback , 2015, Nature.
[27] Daniel Dewey,et al. Reinforcement Learning and the Reward Engineering Principle , 2014, AAAI Spring Symposia.
[28] John Yearwood,et al. On the Limitations of Scalarisation for Multi-objective Reinforcement Learning of Pareto Fronts , 2008, Australasian Conference on Artificial Intelligence.
[29] W. D. Ross,et al. The Right and the Good , 1930 .
[30] Frank Dignum. Autonomous agents and social norms , 1996 .
[31] Ronald C. Arkin,et al. Governing lethal behavior: Embedding ethics in a hybrid deliberative/reactive robot architecture part I: Motivation and philosophy , 2008, 2008 3rd ACM/IEEE International Conference on Human-Robot Interaction (HRI).
[32] Peter R. Lewis,et al. A novel adaptive weight selection algorithm for multi-objective multi-agent reinforcement learning , 2014, 2014 International Joint Conference on Neural Networks (IJCNN).
[33] J. Fieldsend. Multi-Objective Particle Swarm Optimisation Methods , 2004 .
[34] Dr. Tom Murphy. The First Level of Super Mario Bros . is Easy with Lexicographic Orderings and Time Travel , 2013 .
[35] H. Eyre. Introduction I , 2022 .
[36] Aimee van Wynsberghe,et al. Service robots, care ethics, and design , 2016, Ethics and Information Technology.
[37] Douglas B. Lenat,et al. EURISKO: A Program That Learns New Heuristics and Domain Concepts , 1983, Artif. Intell..
[38] Jim Blythe,et al. Decision-Theoretic Planning , 1999, AI Mag..
[39] Thomas G. Dietterich,et al. Letter to the Editor: Research Priorities for Robust and Beneficial Artificial Intelligence: An Open Letter , 2015, AI Mag..
[40] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .
[41] Yiannis Demiris,et al. Evolving policies for multi-reward partially observable markov decision processes (MR-POMDPs) , 2011, GECCO '11.
[42] Mehdi Dastani,et al. Goal generation in the BOID architecture , 2002 .
[43] Jessica Taylor,et al. Quantilizers: A Safer Alternative to Maximizers for Limited Optimization , 2016, AAAI Workshop: AI, Ethics, and Society.
[44] Carlos A. Coello Coello,et al. Evolutionary multi-objective optimization: a historical view of the field , 2006, IEEE Comput. Intell. Mag..
[45] Miles Brundage,et al. Limitations and risks of machine ethics , 2014, J. Exp. Theor. Artif. Intell..
[46] I. Kant,et al. Grounding for the metaphysics of morals , 1981 .
[47] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[48] Peter Norvig,et al. Artificial Intelligence: A Modern Approach , 1995 .
[49] R. Rosenfeld,et al. Ethics , 2008 .
[50] Patrick Lin,et al. The Rights and Wrongs of Robot Care , 2012 .
[51] Marcello Guarini,et al. Particularism and the Classification and Reclassification of Moral Cases , 2006, IEEE Intelligent Systems.
[52] David F. McAllister,et al. An Experimental Evaluation of Software Redundancy as a Strategy For Improving Reliability , 1991, IEEE Trans. Software Eng..
[53] Michael L. Littman,et al. Reinforcement Learning as a Framework for Ethical Decision Making , 2016, AAAI Workshop: AI, Ethics, and Society.
[54] P. Fishburn,et al. Utility theory , 1980, Cognitive Choice Modeling.
[55] Patrick Lin,et al. Killing Made Easy: From Joysticks to Politics , 2012 .
[56] J. Dennis,et al. A closer look at drawbacks of minimizing weighted sums of objectives for Pareto set generation in multicriteria optimization problems , 1997 .
[57] Andrew Critch. Toward negotiable reinforcement learning: shifting priorities in Pareto optimal sequential decision-making , 2017, ArXiv.
[58] Oren Etzioni,et al. Designing AI systems that obey our laws and values , 2016, Commun. ACM.
[59] Itamar Elhanany,et al. On the Broad Implications of Reinforcement Learning based AGI , 2008, AGI.
[60] Hendrik Prakken. On how AI & Law can help autonomous systems obey the law: a position paper , 2016 .
[61] Kyarash Shahriari,et al. IEEE standard review — Ethically aligned design: A vision for prioritizing human wellbeing with artificial intelligence and autonomous systems , 2017, 2017 IEEE Canada International Humanitarian Technology Conference (IHTC).
[62] Michael Anderson,et al. An Approach to Computing Ethics , 2006, IEEE Intelligent Systems.
[63] Michael Anderson,et al. Machine Ethics: Creating an Ethical Intelligent Agent , 2007, AI Mag..
[64] Pablo Noriega,et al. Normative Multi-Agent Systems , 2013, Normative Multi-Agent Systems.
[65] Patrick Lin,et al. Robot Ethics: The Ethical and Social Implications of Robotics , 2011 .
[66] Victor R. Lesser,et al. Creating Socially Adaptive Electronic Partners: Interaction, Reasoning and Ethical Challenges , 2015, AAMAS.
[67] Benja Fallenstein,et al. Aligning Superintelligence with Human Interests: A Technical Research Agenda , 2015 .
[68] Frank Dignum,et al. Deliberative Normative Agents: Principles and Architecture , 1999, ATAL.
[69] Daniel Dewey,et al. Learning What to Value , 2011, AGI.
[70] Ingemar J. Cox,et al. Autonomous Robot Vehicles , 1990, Springer New York.
[71] John Schulman,et al. Concrete Problems in AI Safety , 2016, ArXiv.
[72] Salvatore Ruggieri,et al. A multidisciplinary survey on discrimination analysis , 2013, The Knowledge Engineering Review.
[73] Nick Bostrom,et al. Superintelligence: Paths, Dangers, Strategies , 2014 .
[74] Stephen M. Omohundro,et al. The Basic AI Drives , 2008, AGI.
[75] J. Trinkle,et al. Learning controllers for human-robot interaction , 2009 .