Relaxing the i.i.d. assumption: Adaptively minimax optimal regret via root-entropic regularization