The dynamics of explore–exploit decisions reveal a signal-to-noise mechanism for random exploration