On finite memory solutions to the two-armed bandit problem (Corresp.)

The least upper bound on the asymptotic proportion of the choice of the correct coin, achievable by {\em expedient} finite-memory algorithms in certain two-armed bandit problems, is derived and schemes which achieve these bounds in a limiting sense are displayed. A deterministic automaton whose performance is close to optimal is also presented.