Are sample means in multi-armed bandits positively or negatively biased?