QnR-Learning in Markov Games