Factors of Influence of the Overestimation Bias of Q-Learning