论文信息 - Forward Modeling for Partial Observation Strategy Games - A StarCraft Defogger

Forward Modeling for Partial Observation Strategy Games - A StarCraft Defogger

We formulate the problem of defogging as state estimation and future state prediction from previous, partial observations in the context of real-time strategy games. We propose to employ encoder-decoder neural networks for this task, and introduce proxy tasks and baselines for evaluation to assess their ability of capturing basic game rules and high-level dynamics. By combining convolutional neural networks and recurrent networks, we exploit spatial and sequential correlations and train well-performing models on a large dataset of human games of StarCraft: Brood War. Finally, we demonstrate the relevance of our models to downstream tasks by applying them for enemy unit prediction in a state-of-the-art, rule-based StarCraft bot. We observe improvements in win rates against several strong community bots.

[1] Santiago Ontañón,et al. Automatic Learning of Combat Models for RTS Games , 2015, AIIDE.

[2] Santiago Ontañón,et al. A Survey of Real-Time Strategy Game AI Research and Competition in StarCraft , 2013, IEEE Transactions on Computational Intelligence and AI in Games.

[3] Yann LeCun,et al. Predicting Deeper into the Future of Semantic Segmentation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[4] Florian Richoux,et al. TorchCraft: a Library for Machine Learning Research on Real-Time Strategy Games , 2016, ArXiv.

[5] Pierre Bessière,et al. Special tactics: A Bayesian approach to tactical decision-making , 2012, 2012 IEEE Conference on Computational Intelligence and Games (CIG).

[6] Yunpeng Wang,et al. Spatiotemporal Recurrent Convolutional Networks for Traffic Prediction in Transportation Networks , 2017, Sensors.

[7] Dit-Yan Yeung,et al. Deep Learning for Precipitation Nowcasting: A Benchmark and A New Model , 2017, NIPS.

[8] Yann LeCun,et al. Deep multi-scale video prediction beyond mean square error , 2015, ICLR.

[9] Xiqun Chen,et al. Short-Term Forecasting of Passenger Demand under On-Demand Ride Services: A Spatio-Temporal Deep Learning Approach , 2017, ArXiv.

[10] Rob Fergus,et al. Learning Physical Intuition of Block Towers by Example , 2016, ICML.

[11] Dit-Yan Yeung,et al. Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting , 2015, NIPS.

[12] Yoshua Bengio,et al. A Neural Probabilistic Language Model , 2003, J. Mach. Learn. Res..

[13] Gabriel Synnaeve,et al. STARDATA: A StarCraft AI Research Dataset , 2017, AIIDE.

[14] Arnav Jhala,et al. A Particle Model for State Estimation in Real-Time Strategy Games , 2011, AIIDE.

[15] Yonghui Wu,et al. Exploring the Limits of Language Modeling , 2016, ArXiv.

[16] Sebastian Risi,et al. Learning macromanagement in starcraft from replays using deep learning , 2017, 2017 IEEE Conference on Computational Intelligence and Games (CIG).

[17] Hermann Ney,et al. Improved backing-off for M-gram language modeling , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[18] Shimon Whiteson,et al. Counterfactual Multi-Agent Policy Gradients , 2017, AAAI.

[19] Nicolas Usunier,et al. Episodic Exploration for Deep Deterministic Policies: An Application to StarCraft Micromanagement Tasks , 2016, ArXiv.

[20] Marc'Aurelio Ranzato,et al. Video (language) modeling: a baseline for generative models of natural videos , 2014, ArXiv.

[21] Luca Antiga,et al. Automatic differentiation in PyTorch , 2017 .