Simon S. Du

发表

Agnostic Q-learning with Function Approximation in Deterministic Systems: Near-Optimal Bounds on Approximation Error and Sample Complexity

Ruosong Wang, Jason D. Lee, Simon S. Du, 2020 .

When is Particle Filtering Efficient for POMDP Sequential Planning? pdf

Jiajun Wu, Wei Hu, Zhao Song, 2020, ArXiv.

Gradient Descent Finds Global Minima of Deep Neural Networks pdf

Liwei Wang, Jason D. Lee, Simon S. Du, 2018, ICML.

On the Power of Over-parametrization in Neural Networks with Quadratic Activation

Jason D. Lee, Simon S. Du, S. Du, 2018, ICML.

Gradient Descent Can Take Exponential Time to Escape Saddle Points

Michael I. Jordan, Barnabás Póczos, Chi Jin, 2017, NIPS.

How Many Samples are Needed to Estimate a Convolutional Neural Network?

Sivaraman Balakrishnan, Ruslan Salakhutdinov, Aarti Singh, 2018, NeurIPS.

Gradient Descent for Non-convex Problems in Modern Machine Learning

Simon S. Du, S. Du, 2019 .

Provably Efficient Q-learning with Function Approximation via Distribution Shift Error Checking Oracle

Ruosong Wang, Simon S. Du, Hanrui Zhang, 2019, NeurIPS.

Near-Linear Time Local Polynomial Nonparametric Estimation pdf

Yi Wu, Yining Wang, Simon S. Du, 2018, ArXiv.

Efficient Nonparametric Smoothness Estimation

Barnabás Póczos, Simon S. Du, Shashank Singh, 2016, NIPS.

Fast and Sample Efficient Inductive Matrix Completion via Multi-Phase Procrustes Flow pdf

Xiao Zhang, Quanquan Gu, Simon S. Du, 2018, ICML.

DualSMC: Tunneling Differentiable Filtering and Planning under Continuous POMDPs

Yunbo Wang, Fei-Fei Li, Yuke Zhu, 2020, IJCAI.

How Neural Networks Extrapolate: From Feedforward to Graph Neural Networks pdf

Ken-ichi Kawarabayashi, Stefanie Jegelka, Simon S. Du, 2020, ArXiv.

On the Power of Truncated SVD for General High-rank Matrix Estimation Problems

Aarti Singh, Yining Wang, Simon S. Du, 2017, NIPS.

On Stationary-Point Hitting Time and Ergodicity of Stochastic Gradient Langevin Dynamics

Xi Chen, Simon S. Du, Xin T. Tong, 2019, J. Mach. Learn. Res..

Algorithmic Regularization in Learning Deep Homogeneous Models: Layers are Automatically Balanced

Wei Hu, Jason D. Lee, Simon S. Du, 2018, NeurIPS.

Is Reinforcement Learning More Difficult Than Bandits? A Near-optimal Algorithm Escaping the Curse of Horizon pdf

Xiangyang Ji, Zihan Zhang, Simon S. Du, 2020, ArXiv.

Improved Learning of One-hidden-layer Convolutional Neural Networks with Overlaps pdf

Simon S. Du, Surbhi Goel, S. Du, 2018, ArXiv.

Enhanced Convolutional Neural Tangent Kernels pdf

Ruosong Wang, Dingli Yu, Ruslan Salakhutdinov, 2019, ArXiv.

Computationally Efficient Robust Estimation of Sparse Functionals pdf

Sivaraman Balakrishnan, Aarti Singh, Simon S. Du, 2017, ArXiv.

Width Provably Matters in Optimization for Deep Linear Neural Networks

Wei Hu, Simon S. Du, S. Du, 2019, ICML.

Gradient Descent Provably Optimizes Over-parameterized Neural Networks pdf

Barnabás Póczos, Aarti Singh, Simon S. Du, 2018, ICLR.

Hypothesis Transfer Learning via Transformation Functions

Barnabás Póczos, Aarti Singh, Simon S. Du, 2016, NIPS.

How Many Samples are Needed to Learn a Convolutional Neural Network? pdf

Sivaraman Balakrishnan, Ruslan Salakhutdinov, Aarti Singh, 2018, NIPS 2018.

Agnostic Q-learning with Function Approximation in Deterministic Systems: Tight Bounds on Approximation Error and Sample Complexity pdf

Ruosong Wang, Jason D. Lee, Simon S. Du, 2020, ArXiv.

Dual Sequential Monte Carlo: Tunneling Filtering and Planning in Continuous POMDPs pdf

Yunbo Wang, Fei-Fei Li, Yuke Zhu, 2019, ArXiv.

Global Convergence of Adaptive Gradient Methods for An Over-parameterized Neural Network pdf

Xiaoxia Wu, Rachel Ward, Simon S. Du, 2019, ArXiv.

Fine-Grained Analysis of Optimization and Generalization for Overparameterized Two-Layer Neural Networks

Ruosong Wang, Wei Hu, Sanjeev Arora, 2019, ICML.

Gradient Descent Learns One-hidden-layer CNN: Don't be Afraid of Spurious Local Minima

Yuandong Tian, Barnabás Póczos, Jason D. Lee, 2017, ICML.

Over-parameterized Adversarial Training: An Analysis Overcoming the Curse of Dimensionality pdf

Zhao Song, Simon S. Du, Xingguo Li, 2020, NeurIPS.

What Can Neural Networks Reason About? pdf

Ken-ichi Kawarabayashi, Stefanie Jegelka, Simon S. Du, 2019, ICLR.

Q-learning with Logarithmic Regret pdf

Lin F. Yang, Simon S. Du, Kunhe Yang, 2020, ArXiv.

Stochastic Variance Reduction Methods for Policy Evaluation

Lihong Li, Lin Xiao, Jianshu Chen, 2017, ICML.

Is a Good Representation Sufficient for Sample Efficient Reinforcement Learning? pdf

Ruosong Wang, Sham M. Kakade, Lin F. Yang, 2020, ICLR.

Planning with General Objective Functions: Going Beyond Total Rewards

Ruosong Wang, Lin F. Yang, Simon S. Du, 2020, NeurIPS.

Is Long Horizon Reinforcement Learning More Difficult Than Short Horizon Reinforcement Learning? pdf

Ruosong Wang, Sham M. Kakade, Lin F. Yang, 2020, ArXiv.

Acceleration via Symplectic Discretization of High-Resolution Differential Equations

Michael I. Jordan, Simon S. Du, Bin Shi, 2019, NeurIPS.

Graph Neural Tangent Kernel: Fusing Graph Neural Networks with Graph Kernels

Ruosong Wang, Barnabás Póczos, Ruslan Salakhutdinov, 2019, NeurIPS.

On Reward-Free Reinforcement Learning with Linear Function Approximation pdf

Ruosong Wang, Ruslan Salakhutdinov, Lin F. Yang, 2020, NeurIPS.

Hitting Time of Stochastic Gradient Langevin Dynamics to Stationary Points: A Direct Analysis pdf

Xi Chen, Simon S. Du, Xin T. Tong, 2019, ArXiv.

When is a Convolutional Filter Easy To Learn? pdf

Yuandong Tian, Jason D. Lee, Simon S. Du, 2017, ICLR.

Computationally Efficient Robust Sparse Estimation in High Dimensions

Jerry Li, Sivaraman Balakrishnan, Aarti Singh, 2017, COLT.

Robust Nonparametric Regression under Huber's ε-contamination Model pdf

Sivaraman Balakrishnan, Pradeep Ravikumar, Aarti Singh, 2018, ArXiv.

Towards Understanding the Importance of Shortcut Connections in Residual Networks

Tuo Zhao, Simon S. Du, Enlu Zhou, 2019, NeurIPS.

Harnessing the Power of Infinitely Wide Deep Nets on Small-data Tasks pdf

Ruosong Wang, Dingli Yu, Ruslan Salakhutdinov, 2019, ICLR.

High-Throughput Robotic Phenotyping of Energy Sorghum Crops

Barnabás Póczos, David Wettergreen, Simon S. Du, 2017, FSR.

Is Long Horizon RL More Difficult Than Short Horizon RL?

Ruosong Wang, Sham M. Kakade, Lin F. Yang, 2020, NeurIPS.

Understanding the acceleration phenomenon via high-resolution differential equations pdf

Michael I. Jordan, Simon S. Du, Bin Shi, 2018, Mathematical Programming.

On Exact Computation with an Infinitely Wide Neural Net

Ruosong Wang, Wei Hu, Sanjeev Arora, 2019, NeurIPS.

Provably efficient RL with Rich Observations via Latent State Decoding pdf

Nan Jiang, John Langford, Akshay Krishnamurthy, 2019, ICML.

How Many Samples are Needed to Estimate a Convolutional or Recurrent Neural Network

Ruslan Salakhutdinov, Yining Wang, Simon S. Du, 2018 .

Stochastic Zeroth-order Optimization in High Dimensions

Sivaraman Balakrishnan, Aarti Singh, Yining Wang, 2017, AISTATS.

Linear Convergence of the Primal-Dual Gradient Method for Convex-Concave Saddle Point Problems without Strong Convexity pdf

Wei Hu, Simon S. Du, S. Du, 2018, AISTATS.

Nearly Horizon-Free Offline Reinforcement Learning pdf

Sujay Sanghavi, Bo Dai, Simon S. Du, 2021, ArXiv.

Q-learning with Logarithmic Regret

Lin F. Yang, Simon S. Du, Kunhe Yang, 2020, AISTATS.

When is particle filtering efficient for planning in partially observed linear dynamical systems?

Simon S. Du, Ruoqi Shen, Wei Hu, 2020, UAI.

Fine-Grained Gap-Dependent Bounds for Tabular MDPs via Adaptive Multi-Step Bootstrap pdf

Tengyu Ma, Simon S. Du, Haike Xu, 2021, COLT.

Improved Variance-Aware Confidence Sets for Linear Bandits and Linear Mixture MDP

Zihan Zhang, Simon S. Du, Xiangyang Ji, 2021, NeurIPS.

Impact of Representation Learning in Linear Bandits

Simon S. Du, Wei Hu, Jason D. Lee, 2020, ICLR.

How Neural Networks Extrapolate: From Feedforward to Graph Neural Networks pdf

Ken-ichi Kawarabayashi, Stefanie Jegelka, Simon S. Du, 2020, ICLR.

Improved Corruption Robust Algorithms for Episodic Reinforcement Learning pdf

Simon S. Du, Kevin G. Jamieson, Kevin Jamieson, 2021, ICML.

Gap-Dependent Bounds for Two-Player Markov Games pdf

Simon S. Du, Zhuoran Yang, Zhaoran Wang, 2021, AISTATS.

Near-Optimal Randomized Exploration for Tabular MDP

Simon S. Du, Ruoqi Shen, Qiwen Cui, 2021 .

Near-Linear Time Local Polynomial Nonparametric Estimation with Box Kernels

Yi Wu, Simon S. Du, Yining Wang, 2021, INFORMS J. Comput..

Stochastic Shortest Path: Minimax, Parameter-Free and Towards Horizon-Free Regret pdf

Alessandro Lazaric, Simon S. Du, Matteo Pirotta, 2021, NeurIPS.

A Unified Framework for Conservative Exploration pdf

Alessandro Lazaric, Liwei Wang, Simon S. Du, 2021, ArXiv.

Global Convergence of Gradient Descent for Asymmetric Low-Rank Matrix Factorization pdf

Simon S. Du, Tian Ye, S. Du, 2021, NeurIPS.

Near-Optimal Randomized Exploration for Tabular Markov Decision Processes pdf

Simon S. Du, Ruoqi Shen, Zhihan Xiong, 2021, NeurIPS.

Towards Demystifying Representation Learning with Non-contrastive Self-supervision pdf

Simon S. Du, Xinlei Chen, Yuandong Tian, 2021, ArXiv.

On Reinforcement Learning with Adversarial Corruption and Its Application to Block MDP

Liwei Wang, Simon S. Du, Tianhao Wu, 2021, ICML.

Variance-Aware Confidence Set: Variance-Dependent Bound for Linear Bandits and Horizon-Free Bound for Linear Mixture MDP pdf

Xiangyang Ji, Zihan Zhang, Simon S. Du, 2021, ArXiv.

Nearly Optimal Policy Optimization with Stable at Any Time Guarantee pdf

Jiantao Jiao, Simon S. Du, Han Zhong, 2021, ArXiv.

On the Power of Multitask Representation Learning in Linear MDP pdf

Simon S. Du, Gao Huang, Rui Lu, 2021, ArXiv.

First-Order Regret in Reinforcement Learning with Linear Function Approximation: A Robust Estimation Approach pdf

Max Simchowitz, Simon S. Du, Kevin Jamieson, 2021, ArXiv.

Corruption Robust Active Learning pdf

Simon S. Du, Kevin Jamieson, Yifang Chen, 2021, ArXiv.

Provably Efficient Policy Gradient Methods for Two-Player Zero-Sum Markov Games pdf

Yuandong Tian, Jason D. Lee, Simon S. Du, 2021, ArXiv.

Is Reinforcement Learning More Difficult Than Bandits? A Near-optimal Algorithm Escaping the Curse of Horizon

Xiangyang Ji, Zihan Zhang, Simon S. Du, 2021, COLT.

Bilinear Classes: A Structural Framework for Provable Generalization in RL

Shachar Lovett, Sham M. Kakade, Simon S. Du, 2021, ICML.

Provable Adaptation across Multiway Domains via Representation Learning pdf

Simon S. Du, Shaobo Han, Zhili Feng, 2021, ArXiv.

N ov 2 01 8 Gradient Descent Finds Global Minima of Deep Neural Networks

Jason D. Lee, Simon S. Du, Xiyu Zhai, 2018 .

Novel Quantization Strategies for Linear Prediction with Guarantees

Yichong Xu, Pulkit Grover, Simon S. Du, 2016 .