Reward-Related Learning via Multiple Memory