Temporal credit assignment in reinforcement learning