KL-UCRL Revisited : Variance-Aware Regret Bound ∗