暂无分享,去创建一个
Pierre-Yves Oudeyer | Olivier Sigaud | Cédric Colas | Pierre-Yves Oudeyer | Olivier Sigaud | Cédric Colas | P. Oudeyer
[1] Welch Bl. THE GENERALIZATION OF ‘STUDENT'S’ PROBLEM WHEN SEVERAL DIFFERENT POPULATION VARLANCES ARE INVOLVED , 1947 .
[2] Philip Bachman,et al. Deep Reinforcement Learning that Matters , 2017, AAAI.
[3] Peter Henderson,et al. Reproducibility of Benchmarked Deep Reinforcement Learning Tasks for Continuous Control , 2017, ArXiv.
[4] Wojciech Zaremba,et al. OpenAI Gym , 2016, ArXiv.
[5] J. D. de Winter. Using the Student ’ s t-test with extremely small sample sizes , 2013 .
[6] F. Wilcoxon. Individual Comparisons by Ranking Methods , 1945 .
[7] Sergey Levine,et al. Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor , 2018, ICML.
[8] Herke van Hoof,et al. Addressing Function Approximation Error in Actor-Critic Methods , 2018, ICML.
[9] Robert Tibshirani,et al. An Introduction to the Bootstrap , 1994 .
[10] Student,et al. THE PROBABLE ERROR OF A MEAN , 1908 .
[11] N. Smirnov. Table for Estimating the Goodness of Fit of Empirical Distributions , 1948 .
[12] R. Iman,et al. Rank Transformations as a Bridge between Parametric and Nonparametric Statistics , 1981 .