Training and Transferring Safe Policiesin Reinforcement Learning