论文信息 - The Advantage of Doubling: A Deep Reinforcement Learning Approach to Studying the Double Team in the NBA

The Advantage of Doubling: A Deep Reinforcement Learning Approach to Studying the Double Team in the NBA

During the 2017 NBA playoffs, Celtics coach Brad Stevens was faced with a difficult decision when defending against the Cavaliers: "Do you double and risk giving up easy shots, or stay at home and do the best you can?" It's a tough call, but finding a good defensive strategy that effectively incorporates doubling can make all the difference in the NBA. In this paper, we analyze double teaming in the NBA, quantifying the trade-off between risk and reward. Using player trajectory data pertaining to over 643,000 possessions, we identified when the ball handler was double teamed. Given these data and the corresponding outcome (i.e., was the defense successful), we used deep reinforcement learning to estimate the quality of the defensive actions. We present qualitative and quantitative results summarizing our learned defensive strategy for defending. We show that our policy value estimates are predictive of points per possession and win percentage. Overall, the proposed framework represents a step toward a more comprehensive understanding of defensive strategies in the NBA.

[1] R. Zemel,et al. Classifying NBA Offensive Plays Using Neural Networks , 2016 .

[2] A. Preliminaries. Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning , 2016 .

[3] Diego Klabjan,et al. Predicting Shot Making in Basketball using Convolutional Neural Networks Learnt from Adversarial Multiagent Trajectories , 2016, ArXiv.

[4] L. Bornn,et al. Counterpoints : Advanced Defensive Metrics for NBA , 2022 .

[5] Diego Klabjan,et al. Predicting Shot Making in Basketball Learnt from Adversarial Multiagent Trajectories , 2016, 1609.04849.

[6] Tom Schaul,et al. Dueling Network Architectures for Deep Reinforcement Learning , 2015, ICML.

[7] Yu-Han Chang,et al. Quantifying Shot Quality in the NBA , 2014 .

[8] Tom Schaul,et al. Rainbow: Combining Improvements in Deep Reinforcement Learning , 2017, AAAI.

[9] Yisong Yue,et al. Data-Driven Ghosting using Deep Imitation Learning , 2017 .

[10] Alex M. Andrew,et al. Reinforcement Learning: : An Introduction , 1998 .

[11] Y. Loewenstein,et al. Reinforcement learning in professional basketball players , 2011, Nature communications.

[12] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.

[13] Nan Jiang,et al. Doubly Robust Off-policy Value Evaluation for Reinforcement Learning , 2015, ICML.