Simon S. Du
发表
Ruosong Wang,
Jason D. Lee,
Simon S. Du,
2020
.
Jiajun Wu,
Wei Hu,
Zhao Song,
2020,
ArXiv.
Liwei Wang,
Jason D. Lee,
Simon S. Du,
2018,
ICML.
Jason D. Lee,
Simon S. Du,
S. Du,
2018,
ICML.
Michael I. Jordan,
Barnabás Póczos,
Chi Jin,
2017,
NIPS.
Sivaraman Balakrishnan,
Ruslan Salakhutdinov,
Aarti Singh,
2018,
NeurIPS.
Simon S. Du,
S. Du,
2019
.
Ruosong Wang,
Simon S. Du,
Hanrui Zhang,
2019,
NeurIPS.
Yi Wu,
Yining Wang,
Simon S. Du,
2018,
ArXiv.
Barnabás Póczos,
Simon S. Du,
Shashank Singh,
2016,
NIPS.
Xiao Zhang,
Quanquan Gu,
Simon S. Du,
2018,
ICML.
Yunbo Wang,
Fei-Fei Li,
Yuke Zhu,
2020,
IJCAI.
Ken-ichi Kawarabayashi,
Stefanie Jegelka,
Simon S. Du,
2020,
ArXiv.
Aarti Singh,
Yining Wang,
Simon S. Du,
2017,
NIPS.
Xi Chen,
Simon S. Du,
Xin T. Tong,
2019,
J. Mach. Learn. Res..
Wei Hu,
Jason D. Lee,
Simon S. Du,
2018,
NeurIPS.
Is Reinforcement Learning More Difficult Than Bandits? A Near-optimal Algorithm Escaping the Curse of Horizon
pdf
Xiangyang Ji,
Zihan Zhang,
Simon S. Du,
2020,
ArXiv.
Simon S. Du,
Surbhi Goel,
S. Du,
2018,
ArXiv.
Ruosong Wang,
Dingli Yu,
Ruslan Salakhutdinov,
2019,
ArXiv.
Sivaraman Balakrishnan,
Aarti Singh,
Simon S. Du,
2017,
ArXiv.
Wei Hu,
Simon S. Du,
S. Du,
2019,
ICML.
Barnabás Póczos,
Aarti Singh,
Simon S. Du,
2018,
ICLR.
Barnabás Póczos,
Aarti Singh,
Simon S. Du,
2016,
NIPS.
Sivaraman Balakrishnan,
Ruslan Salakhutdinov,
Aarti Singh,
2018,
NIPS 2018.
Agnostic Q-learning with Function Approximation in Deterministic Systems: Tight Bounds on Approximation Error and Sample Complexity
pdf
Ruosong Wang,
Jason D. Lee,
Simon S. Du,
2020,
ArXiv.
Yunbo Wang,
Fei-Fei Li,
Yuke Zhu,
2019,
ArXiv.
Xiaoxia Wu,
Rachel Ward,
Simon S. Du,
2019,
ArXiv.
Ruosong Wang,
Wei Hu,
Sanjeev Arora,
2019,
ICML.
Yuandong Tian,
Barnabás Póczos,
Jason D. Lee,
2017,
ICML.
Zhao Song,
Simon S. Du,
Xingguo Li,
2020,
NeurIPS.
Ken-ichi Kawarabayashi,
Stefanie Jegelka,
Simon S. Du,
2019,
ICLR.
Lin F. Yang,
Simon S. Du,
Kunhe Yang,
2020,
ArXiv.
Lihong Li,
Lin Xiao,
Jianshu Chen,
2017,
ICML.
Ruosong Wang,
Sham M. Kakade,
Lin F. Yang,
2020,
ICLR.
Ruosong Wang,
Lin F. Yang,
Simon S. Du,
2020,
NeurIPS.
Is Long Horizon Reinforcement Learning More Difficult Than Short Horizon Reinforcement Learning?
pdf
Ruosong Wang,
Sham M. Kakade,
Lin F. Yang,
2020,
ArXiv.
Michael I. Jordan,
Simon S. Du,
Bin Shi,
2019,
NeurIPS.
Ruosong Wang,
Barnabás Póczos,
Ruslan Salakhutdinov,
2019,
NeurIPS.
Ruosong Wang,
Ruslan Salakhutdinov,
Lin F. Yang,
2020,
NeurIPS.
Xi Chen,
Simon S. Du,
Xin T. Tong,
2019,
ArXiv.
Yuandong Tian,
Jason D. Lee,
Simon S. Du,
2017,
ICLR.
Jerry Li,
Sivaraman Balakrishnan,
Aarti Singh,
2017,
COLT.
Sivaraman Balakrishnan,
Pradeep Ravikumar,
Aarti Singh,
2018,
ArXiv.
Tuo Zhao,
Simon S. Du,
Enlu Zhou,
2019,
NeurIPS.
Ruosong Wang,
Dingli Yu,
Ruslan Salakhutdinov,
2019,
ICLR.
Barnabás Póczos,
David Wettergreen,
Simon S. Du,
2017,
FSR.
Ruosong Wang,
Sham M. Kakade,
Lin F. Yang,
2020,
NeurIPS.
Michael I. Jordan,
Simon S. Du,
Bin Shi,
2018,
Mathematical Programming.
Ruosong Wang,
Wei Hu,
Sanjeev Arora,
2019,
NeurIPS.
Nan Jiang,
John Langford,
Akshay Krishnamurthy,
2019,
ICML.
Ruslan Salakhutdinov,
Yining Wang,
Simon S. Du,
2018
.
Sivaraman Balakrishnan,
Aarti Singh,
Yining Wang,
2017,
AISTATS.
Linear Convergence of the Primal-Dual Gradient Method for Convex-Concave Saddle Point Problems without Strong Convexity
pdf
Wei Hu,
Simon S. Du,
S. Du,
2018,
AISTATS.
Sujay Sanghavi,
Bo Dai,
Simon S. Du,
2021,
ArXiv.
Lin F. Yang,
Simon S. Du,
Kunhe Yang,
2020,
AISTATS.
Simon S. Du,
Ruoqi Shen,
Wei Hu,
2020,
UAI.
Tengyu Ma,
Simon S. Du,
Haike Xu,
2021,
COLT.
Zihan Zhang,
Simon S. Du,
Xiangyang Ji,
2021,
NeurIPS.
Simon S. Du,
Wei Hu,
Jason D. Lee,
2020,
ICLR.
Ken-ichi Kawarabayashi,
Stefanie Jegelka,
Simon S. Du,
2020,
ICLR.
Simon S. Du,
Kevin G. Jamieson,
Kevin Jamieson,
2021,
ICML.
Simon S. Du,
Zhuoran Yang,
Zhaoran Wang,
2021,
AISTATS.
Simon S. Du,
Ruoqi Shen,
Qiwen Cui,
2021
.
Yi Wu,
Simon S. Du,
Yining Wang,
2021,
INFORMS J. Comput..
Alessandro Lazaric,
Simon S. Du,
Matteo Pirotta,
2021,
NeurIPS.
Alessandro Lazaric,
Liwei Wang,
Simon S. Du,
2021,
ArXiv.
Simon S. Du,
Tian Ye,
S. Du,
2021,
NeurIPS.
Simon S. Du,
Ruoqi Shen,
Zhihan Xiong,
2021,
NeurIPS.
Simon S. Du,
Xinlei Chen,
Yuandong Tian,
2021,
ArXiv.
Liwei Wang,
Simon S. Du,
Tianhao Wu,
2021,
ICML.
Variance-Aware Confidence Set: Variance-Dependent Bound for Linear Bandits and Horizon-Free Bound for Linear Mixture MDP
pdf
Xiangyang Ji,
Zihan Zhang,
Simon S. Du,
2021,
ArXiv.
Jiantao Jiao,
Simon S. Du,
Han Zhong,
2021,
ArXiv.
Simon S. Du,
Gao Huang,
Rui Lu,
2021,
ArXiv.
First-Order Regret in Reinforcement Learning with Linear Function Approximation: A Robust Estimation Approach
pdf
Max Simchowitz,
Simon S. Du,
Kevin Jamieson,
2021,
ArXiv.
Simon S. Du,
Kevin Jamieson,
Yifang Chen,
2021,
ArXiv.
Yuandong Tian,
Jason D. Lee,
Simon S. Du,
2021,
ArXiv.
Xiangyang Ji,
Zihan Zhang,
Simon S. Du,
2021,
COLT.
Shachar Lovett,
Sham M. Kakade,
Simon S. Du,
2021,
ICML.
Simon S. Du,
Shaobo Han,
Zhili Feng,
2021,
ArXiv.
Jason D. Lee,
Simon S. Du,
Xiyu Zhai,
2018
.
Yichong Xu,
Pulkit Grover,
Simon S. Du,
2016
.