Predictive Cloud Computing with Big Data: Professional Golf and Tennis Forecasting [Application Notes]

Major Golf and Grand Slam Tennis tournaments such as Australian Open, The Masters, Roland Garros, United States Golf Association (USGA), Wimbledon, and United States Tennis Association (USTA) United States (US) Open provide real-time and historical sporting information to immerse a global fan base in the action. Each tournament provides realtime content, including streaming video, game statistics, scores, images, schedule of play, and text. Due to the game popularities, some of the web servers are heavily visited and some are not, therefore, we need a method to autonomously provision servers to provide a smooth user experience. Predictive Cloud Computing (PCC) has been developed to provide a smart allocation/deallocation of servers by combining ensembles of forecasts and predictive modeling to determine the future origin demand for web site content. PCC distributes processing through analytical pipelines that correlate streaming data, such as scores, media schedules, and player brackets with a future-simulated tournament state to measure predicted demand spikes for content. Social data streamed from Twitter provides social sentiment and popularity features used within predictive modeling. Data at rest, such as machine logs and web content, provide additional features for forecasting. While the duration of each tournament varies, the number of origin website requests range from 29,000 to 110,000 hits per minute. The PCC technology was developed and deployed to all Grand Slam tennis events and several major golf tournaments that took place in 2013 and to the present, which has decreased wasted computing consumption by over 50%. We propose a novel forecasting ensemble that includes residual, vector, historical, partial, adjusted, cubic and quadratic forecasters. In addition, we present several predictive models based on Multiple Regression as inputs into several of these forecasters. We conclude by empirically demonstrating that the predictive cloud technology is able to forecast the computing load on origin web servers for professional golf and tennis tournaments.

[1]  Guilherme Galante,et al.  A Survey on Cloud Computing Elasticity , 2012, 2012 IEEE Fifth International Conference on Utility and Cloud Computing.

[2]  Judith Kelner,et al.  Autonomic Cloud Computing: Giving Intelligence to Simpleton Nodes , 2011, 2011 IEEE Third International Conference on Cloud Computing Technology and Science.

[3]  Naveen Sharma,et al.  Towards autonomic workload provisioning for enterprise Grids and clouds , 2009, 2009 10th IEEE/ACM International Conference on Grid Computing.

[4]  Navneet Kaur,et al.  Opinion mining and sentiment analysis , 2016, 2016 3rd International Conference on Computing for Sustainable Global Development (INDIACom).

[5]  Jeffrey O. Kephart,et al.  The Vision of Autonomic Computing , 2003, Computer.

[6]  Richard J. Bogdany,et al.  Resilient hosting in a continuously available virtualized environment , 2008, IBM Syst. J..

[7]  N. Prasath,et al.  Opinion mining and sentiment analysis on a Twitter data stream , 2012, International Conference on Advances in ICT for Emerging Regions (ICTer2012).

[8]  Kalyan Veeramachaneni,et al.  Bring Your Own Learner: A Cloud-Based, Data-Parallel Commons for Machine Learning , 2015, IEEE Computational Intelligence Magazine.

[9]  Zhenhuan Gong,et al.  PRESS: PRedictive Elastic ReSource Scaling for cloud systems , 2010, 2010 International Conference on Network and Service Management.

[10]  Eric W. Brown,et al.  Making Watson fast , 2012, IBM J. Res. Dev..

[11]  P. O'Donoghue,et al.  A notational analysis of elite tennis strategy , 2001, Journal of sports sciences.

[12]  Arnold Neumaier,et al.  Introduction to Numerical Analysis , 2001 .

[13]  Xiaohui Gu,et al.  CloudScale: elastic resource scaling for multi-tenant cloud systems , 2011, SoCC.

[14]  Prashant J. Shenoy,et al.  Dynamic resource allocation for shared data centers using online measurements , 2003, IWQoS'03.

[15]  Steven Hand,et al.  Self-adaptive and self-configured CPU resource provisioning for virtualized servers using Kalman filters , 2009, ICAC '09.

[16]  Raymond Chiong,et al.  Forecasting interval time series using a fully complex-valued RBF neural network with DPSO and PSO algorithms , 2015, Inf. Sci..

[17]  Jerome A. Rolia,et al.  Capacity Management and Demand Prediction for Next Generation Data Centers , 2007, IEEE International Conference on Web Services (ICWS 2007).

[18]  Christopher Stewart,et al.  A Dollar from 15 Cents: Cross-Platform Management for Internet Services , 2008, USENIX Annual Technical Conference.

[19]  Shicong Meng,et al.  Tide: achieving self-scaling in virtualized datacenter management middleware , 2010, Middleware Industrial Track '10.

[20]  Rajkumar Buyya,et al.  The Aneka platform and QoS-driven resource provisioning for elastic applications on hybrid Clouds , 2012, Future Gener. Comput. Syst..

[21]  G. R. Lindsey Tatistical Data Useful for the Operation of a Baseball Team , 1959 .

[22]  Raymond Chiong,et al.  Hybrid filter-wrapper feature selection for short-term load forecasting , 2015, Eng. Appl. Artif. Intell..

[23]  T. Maclennan Moneyball: The Art of Winning an Unfair Game , 2005 .

[24]  Manish Marwah,et al.  Minimizing data center SLA violations and power consumption via hybrid resource provisioning , 2011, 2011 International Green Computing Conference and Workshops.

[25]  Graham J. Williams,et al.  Big Data Opportunities and Challenges: Discussions from Data Analytics Perspectives [Discussion Forum] , 2014, IEEE Computational Intelligence Magazine.

[26]  Jeffrey S. Chase,et al.  Learning Application Models for Utility Resource Planning , 2006, 2006 IEEE International Conference on Autonomic Computing.

[27]  Xiaohui Gu,et al.  AGILE: Elastic Distributed Resource Scaling for Infrastructure-as-a-Service , 2013, ICAC.

[28]  Xiaoyun Zhu,et al.  1000 Islands: Integrated Capacity and Workload Management for the Next Generation Data Center , 2008, 2008 International Conference on Autonomic Computing.