Geodesic Gaussian kernels for value function approximation
暂无分享,去创建一个
The least-squares policy iteration approach works efficiently in value function approximation, given appropriate basis functions. Because of its smoothness, the Gaussian kernel is a popular and use...