Action selection methods using reinforcement learning