论文信息 - Convergence Results for Single-Step On-PolicyReinforcement-Learning Algorithms

Convergence Results for Single-Step On-PolicyReinforcement-Learning Algorithms

An important application of reinforcement learning (RL) is to finite-state control problems and one of the most difficult problems in learning for control is balancing the exploration/exploitation ...

SinghSatinder | JaakkolaTommi | L LittmanMichael | SzepesváriCsaba