A model of emergence of reward expectancy neurons using reinforcement learning and neural network