Relative Upper Confidence Bound for the K-Armed Dueling Bandit Problem