Teaching AI Agents Ethical Values Using Reinforcement Learning and Policy Orchestration