Forecasting Player Behavioral Data and Simulating In-Game Events

Understanding player behavior is fundamental in game data science. Video games evolve as players interact with the game, so being able to foresee player experience would help to ensure a successful game development. In particular, game developers need to evaluate beforehand the impact of in-game events. Simulation optimization of these events is crucial to increase player engagement and maximize monetization. We present an experimental analysis of several methods to forecast game-related variables, with two main aims: to obtain accurate predictions of in-app purchases and playtime in an operational production environment, and to perform simulations of in-game events in order to maximize sales and playtime. Our ultimate purpose is to take a step towards the data-driven development of games. The results suggest that, even though the performance of traditional approaches such as ARIMA is still better, the outcomes of state-of-the-art techniques like deep learning are promising. Deep learning comes up as a well-suited general model that could be used to forecast a variety of time series with different dynamic behaviors.

[1]  Anna Guitart,et al.  Churn Prediction in Mobile Social Games: Towards a Complete Assessment Using Survival Ensembles , 2016, 2016 IEEE International Conference on Data Science and Advanced Analytics (DSAA).

[2]  Geoffrey E. Hinton,et al.  A Learning Algorithm for Boltzmann Machines , 1985, Cogn. Sci..

[3]  Peter W. Glynn,et al.  Stochastic Simulation: Algorithms and Analysis , 2007 .

[4]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[5]  Bin Yu,et al.  Boosting with early stopping: Convergence and consistency , 2005, math/0508276.

[6]  Colin Chen Generalized additive mixed models , 2000 .

[7]  Richard A. Davis,et al.  Introduction to time series and forecasting , 1998 .

[8]  R. Tibshirani,et al.  Generalized Additive Models , 1991 .

[9]  Greg Ridgeway,et al.  Generalized Boosted Models: A guide to the gbm package , 2006 .

[10]  Spyros Makridakis,et al.  The M3-Competition: results, conclusions and implications , 2000 .

[11]  M. Otto,et al.  Outliers in Time Series , 1972 .

[12]  S. Wood mgcv:Mixed GAM Computation Vehicle with GCV/AIC/REML smoothness estimation , 2012 .

[13]  Julian Togelius,et al.  Artificial Intelligence and Games , 2018, Springer International Publishing.

[14]  A. Maria,et al.  Simulation Optimization: Methods And Applications , 1997, Winter Simulation Conference Proceedings,.

[15]  Christian Bauckhage,et al.  Clustering Game Behavior Data , 2015, IEEE Transactions on Computational Intelligence and AI in Games.

[16]  Alain Saas,et al.  Discovering playing patterns: Time series clustering of free-to-play game data , 2016, 2016 IEEE Conference on Computational Intelligence and Games (CIG).

[17]  Christian Bauckhage,et al.  A comparison of methods for player clustering via behavioral telemetry , 2013, FDG.

[18]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[19]  John G. Cragg Estimation and testing in time-series regression models with heteroscedastic disturbances , 1982 .

[20]  Konstantinos Nikolopoulos Advances in business and management forecasting , 2005 .

[21]  Thomas G. Dietterich Multiple Classifier Systems , 2000, Lecture Notes in Computer Science.

[22]  Lutz Prechelt,et al.  Early Stopping-But When? , 1996, Neural Networks: Tricks of the Trade.

[23]  R. K. Agrawal,et al.  An Introductory Study on Time Series Modeling and Forecasting , 2013, ArXiv.

[24]  S. Wood Thin plate regression splines , 2003 .

[25]  Christian Bauckhage,et al.  The Playtime Principle: Large-scale cross-games interest modeling , 2014, 2014 IEEE Conference on Computational Intelligence and Games.

[26]  Magy Seif El-Nasr,et al.  Game Analytics , 2013, Springer London.

[27]  N. Breslow,et al.  Approximate inference in generalized linear mixed models , 1993 .

[28]  Yoshua Bengio,et al.  Classification using discriminative restricted Boltzmann machines , 2008, ICML '08.

[29]  Gwilym M. Jenkins,et al.  Time series analysis, forecasting and control , 1971 .

[30]  Michael Y. Hu,et al.  Forecasting with artificial neural networks: The state of the art , 1997 .

[31]  Takehisa Yairi,et al.  Anomaly Detection Using Autoencoders with Nonlinear Dimensionality Reduction , 2014, MLSDA'14.

[32]  H. Akaike A new look at the statistical model identification , 1974 .

[33]  Alois Knoll,et al.  Gradient boosting machines, a tutorial , 2013, Front. Neurorobot..

[34]  Amy Loutfi,et al.  A review of unsupervised feature learning and deep learning for time-series modeling , 2014, Pattern Recognit. Lett..

[35]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[36]  S. Wood Fast stable restricted maximum likelihood and marginal likelihood estimation of semiparametric generalized linear models , 2011 .

[37]  L. Breiman Arcing the edge , 1997 .

[38]  Rob J Hyndman,et al.  Another look at measures of forecast accuracy , 2006 .

[39]  Christian Bauckhage,et al.  Predicting player churn in the wild , 2014, 2014 IEEE Conference on Computational Intelligence and Games.

[40]  John Salvatier,et al.  Theano: A Python framework for fast computation of mathematical expressions , 2016, ArXiv.

[41]  Peter L. Bartlett,et al.  Boosting Algorithms as Gradient Descent , 1999, NIPS.

[42]  Udo Sglavo,et al.  Business Forecasting: Practical Problems and Solutions , 2015 .

[43]  D. Cox,et al.  An Analysis of Transformations , 1964 .

[44]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[45]  Dong Yu,et al.  Deep Learning: Methods and Applications , 2014, Found. Trends Signal Process..

[46]  Christian Bauckhage,et al.  How players lose interest in playing a game: An empirical study based on distributions of total playing times , 2012, 2012 IEEE Conference on Computational Intelligence and Games (CIG).

[47]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[48]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[49]  Rob J Hyndman,et al.  Automatic Time Series Forecasting: The forecast Package for R , 2008 .

[50]  Geoffrey E. Hinton,et al.  Speech recognition with deep recurrent neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[51]  S. Wood Generalized Additive Models: An Introduction with R , 2006 .

[52]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[53]  A. Ng Feature selection, L1 vs. L2 regularization, and rotational invariance , 2004, Twenty-first international conference on Machine learning - ICML '04.

[54]  Paul H. C. Eilers,et al.  Flexible smoothing with B-splines and penalties , 1996 .

[55]  Anna Guitart,et al.  Games and big data: A scalable multi-dimensional churn prediction model , 2017, 2017 IEEE Conference on Computational Intelligence and Games (CIG).

[56]  Yoshua. Bengio,et al.  Learning Deep Architectures for AI , 2007, Found. Trends Mach. Learn..

[57]  Ian Osband,et al.  Deep Learning for Time Series Modeling CS 229 Final Project Report , 2012 .

[58]  Kim Larsen,et al.  GAM : The Predictive Modeling Silver Bullet , 2015 .

[59]  Christian Bauckhage,et al.  Guns, swords and data: Clustering of player behavior in computer games in the wild , 2012, 2012 IEEE Conference on Computational Intelligence and Games (CIG).

[60]  Rob J Hyndman,et al.  25 years of time series forecasting , 2006 .

[61]  R. Tibshirani,et al.  Generalized Additive Models: Some Applications , 1987 .