Proactive Serving Decreases User Delay Exponentially: The Light-Tailed Service Time Case

In online service systems, the delay experienced by users from service request to service completion is one of the most critical performance metrics. To improve user delay experience, recent industrial practices suggest a modern system design mechanism: proactive serving, where the service system predicts future user requests and allocates its capacity to serve these upcoming requests proactively. This approach complements the conventional mechanism of capability boosting. In this paper, we propose queuing models for online service systems with proactive serving capability and characterize the user delay reduction by proactive serving. In particular, we show that proactive serving decreases average delay exponentially (as a function of the prediction window size) in the cases where service time follows light-tailed distributions. Furthermore, the exponential decrease in user delay is robust against prediction errors (in terms of miss detection and false alarm) and user demand fluctuation. Compared with the conventional mechanism of capability boosting, proactive serving is more effective in decreasing delay when the system is in the light-load regime. Our trace-driven evaluations demonstrate the practical power of proactive serving: for example, for the data trace of light-tailed YouTube videos, the average user delay decreases by 50% when the system predicts 60 s ahead. Our results provide, from a queuing-theoretical perspective, justifications for the practical application of proactive serving in online service systems.

[1]  Stephan Sigg,et al.  Development of a novel context prediction algorithm and analysis of context prediction schemes , 2008 .

[2]  Kevin Wilson The Kindle Fire , 2014 .

[3]  Kuang Xu,et al.  Using Future Information to Reduce Waiting Times in the Emergency Department via Diversion , 2016, Manuf. Serv. Oper. Manag..

[4]  Jason Flinn,et al.  Informed mobile prefetching , 2012, MobiSys '12.

[5]  J. Kingman On Queues in Heavy Traffic , 1962 .

[6]  Minghua Chen,et al.  When Backpressure Meets Predictive Scheduling , 2013, IEEE/ACM Transactions on Networking.

[7]  A. D. Rao,et al.  Modified Weibull distribution for maximum and significant wave height simulation and prediction , 2007 .

[8]  Minghua Chen,et al.  Effect of proactive serving on user delay reduction in service systems , 2014, SIGMETRICS '14.

[9]  Alois Ferscha,et al.  Recognizing and Predicting Context by Learning from User Behavior 1 , 2003 .

[10]  Xifeng Yan,et al.  Workload characterization and prediction in the cloud: A multiple time series approach , 2012, 2012 IEEE Network Operations and Management Symposium.

[11]  Joel H. Spencer,et al.  Queueing with future information , 2014, PERV.

[12]  Xin Chen,et al.  Coordinated data prefetching for web contents , 2005, Comput. Commun..

[13]  Xin Chen,et al.  A Popularity-Based Prediction Model for Web Prefetching , 2003, Computer.

[14]  G. J. A. Stern,et al.  Queueing Systems, Volume 2: Computer Applications , 1976 .

[15]  Ron Kohavi,et al.  Online Experiments: Lessons Learned , 2007, Computer.

[16]  Sarah Williams,et al.  Computer applications , 1988 .

[17]  Richard F. Gunst,et al.  Applied Regression Analysis , 1999, Technometrics.

[18]  Stathes Hadjiefthymiades,et al.  Predicting the location of mobile users: a machine learning approach , 2009, ICPS '09.

[19]  Madhu Sudan,et al.  Queuing with future information , 2012 .

[20]  Atilla Eryilmaz,et al.  Proactive Resource Allocation: Harnessing the Diversity and Multicast Gains , 2011, IEEE Transactions on Information Theory.

[21]  Atilla Eryilmaz,et al.  Proactive Content Download and User Demand Shaping for Data Networks , 2013, IEEE/ACM Transactions on Networking.

[22]  N. Draper,et al.  Applied Regression Analysis: Draper/Applied Regression Analysis , 1998 .

[23]  Atilla Eryilmaz,et al.  Joint Smart Pricing and Proactive Content Caching for Mobile Services , 2016, IEEE/ACM Transactions on Networking.

[24]  Stanley B. Zdonik,et al.  Fido: A Cache That Learns to Fetch , 1991, VLDB.

[25]  Jiangchuan Liu,et al.  Understanding the Characteristics of Internet Short Video Sharing: YouTube as a Case Study , 2007, ArXiv.

[26]  Haifeng Yu,et al.  DRAM-page based prediction and prefetching , 2000, Proceedings 2000 International Conference on Computer Design.

[27]  Vincent S. Tseng,et al.  Efficient mining and prediction of user behavior patterns in mobile web systems , 2006, Inf. Softw. Technol..

[28]  Joseph Bak,et al.  The Residue Theorem , 2010 .

[29]  Ibrahim Matta,et al.  Describing and forecasting video access patterns , 2011, 2011 Proceedings IEEE INFOCOM.

[30]  J. Kingman Inequalities in the Theory of Queues , 1970 .

[31]  Arnold O. Allen,et al.  Probability, statistics and queueing theory - with computer science applications (2. ed.) , 1981, Int. CMG Conference.

[32]  Shaoquan Zhang Proactive Serving Decreases User Delay Exponentially , 2015, PERV.

[33]  Carla Schlatter Ellis,et al.  Practical prefetching techniques for parallel file systems , 1991, [1991] Proceedings of the First International Conference on Parallel and Distributed Information Systems.

[34]  Leonard Kleinrock,et al.  Theory, Volume 1, Queueing Systems , 1975 .

[35]  Ye Xu,et al.  Preference, context and communities: a multi-faceted approach to predicting smartphone app usage patterns , 2013, ISWC '13.

[36]  Jiangchuan Liu,et al.  Understanding the Characteristics of Internet Short Video Sharing: A YouTube-Based Measurement Study , 2013, IEEE Transactions on Multimedia.