Addendum to ‘On an index policy for restless bandits'
暂无分享,去创建一个
We show that the fluid approximation to Whittle's index policy for restless bandits has a globally asymptotically stable equilibrium point when the bandits move on just three states. It follows that in this case the index policy is asymptotic optimal.
[1] D. Jordan,et al. Nonlinear Ordinary Differential Equations: An Introduction for Scientists and Engineers , 1979 .
[2] D. Jordan,et al. Nonlinear ordinary differential equations (2nd ed.) , 1987 .
[3] P. Whittle. Restless Bandits: Activity Allocation in a Changing World , 1988 .