论文信息 - Concentration bounds for temporal difference learning with linear function approximation: the case of batch data and uniform sampling - 字舞流文

Concentration bounds for temporal difference learning with linear function approximation: the case of batch data and uniform sampling

R. Munos | N. Korda | A. PrashanthL.