Supplementary InformationReinforcement Learning Under Moral Uncertainty