One- and Two-Armed Bandit Problems

[1]  D. Berry,et al.  Maximizing the length of a success run for many-armed bandits , 1983 .

[2]  D. Robinson,et al.  A comparison of sequential treatment allocation rules , 1983 .

[3]  P. Kumar,et al.  On the optimal solution of the one-armed bandit adaptive control problem , 1981 .

[4]  D. Berry,et al.  Bernoulli two-armed bandits with geometric termination , 1981 .

[5]  D. Berry,et al.  Two-armed bandits with a goal, II. Dependent arms , 1980, Advances in Applied Probability.

[6]  Donald A. Berry,et al.  Two-armed bandits with a goal, I. One arm known , 1980, Advances in Applied Probability.

[7]  D. Berry,et al.  Bernoulli One-Armed Bandits--Arbitrary Discount Sequences , 1979 .

[8]  L. Rodman On the Many-armed Bandit Problem , 1978 .

[9]  E. Nordbrock An improved play-the-winner sampling procedure for selecting the better of two binomial populations , 1976 .

[10]  M. Rothschild A two-armed bandit theory of market pricing , 1974 .

[11]  Thomas A. Kelley A Note on the Bernoulli Two-Armed Bandit Problem , 1974 .

[12]  D. Berry A Bernoulli Two-armed Bandit , 1972 .

[13]  Milton Sobel,et al.  Play-the-winner sampling for selecting the better of two binomial populations , 1970 .

[14]  Thomas M. Cover,et al.  The two-armed-bandit problem with time-invariant finite memory , 1970, IEEE Trans. Inf. Theory.

[15]  Carter Smith,et al.  The Robbins-Isbell Two-Armed-Bandit Problem with Finite Memory , 1965 .

[16]  Dorian Feldman Contributions to the "Two-Armed Bandit" Problem , 1962 .

[17]  J. Isbell On a Problem of Robbins , 1959 .

[18]  R. N. Bradt,et al.  On Sequential Designs for Maximizing the Sum of $n$ Observations , 1956 .

[19]  W. R. Thompson ON THE LIKELIHOOD THAT ONE UNKNOWN PROBABILITY EXCEEDS ANOTHER IN VIEW OF THE EVIDENCE OF TWO SAMPLES , 1933 .

[20]  D. Berry Bandit Problems with Random Discounting , 1983 .

[21]  Donald A. Berry,et al.  Modified Two-Armed Bandit Strategies for Certain Clinical Trials , 1978 .

[22]  I. Witten The apparent conflict between estimation and control—a survey of the two-armed bandit problem , 1976 .

[23]  T. Wallsten,et al.  Individual Decision Behavior , 1972 .