Discussion of 'Reinforcement learning behaviors in sponsored search'