论文信息 - Proactive Serving Decreases User Delay Exponentially

Proactive Serving Decreases User Delay Exponentially

In online service systems, the delay experienced by a user from the service request to the service completion is one of the most critical performance metrics. To improve user delay experience, recent industrial practice suggests a modern system design mechanism: proactive serving, where the system predicts future user requests and allocates its capacity to serve these upcoming requests proactively. In this paper, we investigate the fundamentals of proactive serving from a theoretical perspective. In particular, we show that proactive serving decreases average delay exponentially (as a function of the prediction window size). Our results provide theoretical foundations for proactive serving and shed light on its application in practical systems.

Shaoquan Zhang

[1] Carla Schlatter Ellis,et al. Practical prefetching techniques for parallel file systems , 1991, [1991] Proceedings of the First International Conference on Parallel and Distributed Information Systems.

[2] Cheng-Hsin Hsu,et al. SmartTransfer: transferring your mobile multimedia contents at the "right" time , 2012, NOSSDAV '12.

[3] Ye Xu,et al. Preference, context and communities: a multi-faceted approach to predicting smartphone app usage patterns , 2013, ISWC '13.

[4] Ibrahim Matta,et al. Describing and forecasting video access patterns , 2011, 2011 Proceedings IEEE INFOCOM.

[5] Stathes Hadjiefthymiades,et al. Predicting the location of mobile users: a machine learning approach , 2009, ICPS '09.

[6] James R. Larus,et al. Branch prediction for free , 1993, PLDI '93.

[7] Vincenzo Grassi,et al. Modeling and evaluation of prefetching policies for context-aware information services , 1998, MobiCom '98.

[8] Joseph Bak,et al. The Residue Theorem , 2010 .

[9] Asuman E. Ozdaglar,et al. On the Delay and Throughput Gains of Coding in Unreliable Networks , 2008, IEEE Transactions on Information Theory.

[10] Vincent S. Tseng,et al. Efficient mining and prediction of user behavior patterns in mobile web systems , 2006, Inf. Softw. Technol..

[11] Stanley B. Zdonik,et al. Fido: A Cache That Learns to Fetch , 1991, VLDB.

[12] Jeffrey C. Mogul,et al. Using predictive prefetching to improve World Wide Web latency , 1996, CCRV.

[13] Dvir Shabtay,et al. Scheduling unit length jobs on parallel machines with lookahead information , 2011, J. Sched..

[14] Minghua Chen,et al. Effect of proactive serving on user delay reduction in service systems , 2014, SIGMETRICS '14.

[15] Richard W. Vuduc,et al. When Prefetching Works, When It Doesn’t, and Why , 2012, TACO.

[16] Atilla Eryilmaz,et al. Proactive Data Download and User Demand Shaping for Data Networks , 2013, ArXiv.

[17] D. Zats,et al. DeTail: reducing the flow completion time tail in datacenter networks , 2012, CCRV.

[18] Jason Flinn,et al. Informed mobile prefetching , 2012, MobiSys '12.

[19] S. Wittevrongel,et al. Queueing Systems , 2019, Introduction to Stochastic Processes and Simulation.

[20] Mark Handley,et al. The resource pooling principle , 2008, CCRV.

[21] Seung Jun Baek,et al. Reducing delays by network coding for wireless broadcasting in networks using relay stations , 2012, 2012 IEEE 23rd International Symposium on Personal, Indoor and Mobile Radio Communications - (PIMRC).

[22] G. Veciana,et al. Throughput optimality of delay-driven MaxWeight scheduler for a wireless system with flow dynamics , 2009, 2009 47th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[23] Michael J. Neely. Delay-Based Network Utility Maximization , 2013, IEEE/ACM Transactions on Networking.

[24] Ness B. Shroff,et al. Delay-based Back-Pressure scheduling in multi-hop wireless networks , 2011, INFOCOM.

[25] B. Chandrasekaran. Survey of Network Traffic Models , 2006 .

[26] Joel H. Spencer,et al. Queueing with future information , 2014, PERV.

[27] Xin Chen,et al. Coordinated data prefetching for web contents , 2005, Comput. Commun..

[28] Brian D. Noble,et al. BreadCrumbs: forecasting mobile connectivity , 2008, MobiCom '08.

[29] Haifeng Yu,et al. DRAM-page based prediction and prefetching , 2000, Proceedings 2000 International Conference on Computer Design.

[30] Matthew Andrews,et al. Providing quality of service over a shared wireless link , 2001, IEEE Commun. Mag..

[31] Rex K. Kincaid,et al. A look-ahead heuristic for scheduling jobs with release dates on a single machine , 1994, Comput. Oper. Res..

[32] Xifeng Yan,et al. Workload characterization and prediction in the cloud: A multiple time series approach , 2012, 2012 IEEE Network Operations and Management Symposium.

[33] Brian D. Noble,et al. Bobtail: Avoiding Long Tails in the Cloud , 2013, NSDI.

[34] Arnold O. Allen,et al. Probability, statistics and queueing theory - with computer science applications (2. ed.) , 1981, Int. CMG Conference.

[35] Xin Chen,et al. A Popularity-Based Prediction Model for Web Prefetching , 2003, Computer.

[36] Stephan Sigg,et al. Development of a novel context prediction algorithm and analysis of context prediction schemes , 2008 .

[37] Murray Hill,et al. SCHEDULING IN A QUEUING SYSTEM WITH ASYNCHRONOUSLY VARYING SERVICE RATES , 2004 .

[38] Minghua Chen,et al. When Backpressure Meets Predictive Scheduling , 2013, IEEE/ACM Transactions on Networking.

[39] G. J. A. Stern,et al. Queueing Systems, Volume 2: Computer Applications , 1976 .

[40] Terri Watson,et al. Application Design for Wireless Computing , 1994, 1994 First Workshop on Mobile Computing Systems and Applications.

[41] Hai Jin,et al. A Measurement Study of a Peer-to-Peer Video-on-Demand System , 2007, IPTPS.

[42] Milica Stojanovic,et al. On Coding for Delay—Network Coding for Time-Division Duplexing , 2012, IEEE Transactions on Information Theory.

[43] Ron Kohavi,et al. Online Experiments: Lessons Learned , 2007, Computer.

[44] Ben Coleman. Quality vs. Performance in Lookahead Scheduling , 2006, JCIS.

[45] Aarnout Brombacher,et al. Probability... , 2009, Qual. Reliab. Eng. Int..

[46] Stephen P. Boyd,et al. Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[47] Klara Nahrstedt,et al. Peer-to-peer multimedia streaming and caching service , 2002, Proceedings. IEEE International Conference on Multimedia and Expo.

[48] M. Medard,et al. On Delay Performance Gains From Network Coding , 2006, 2006 40th Annual Conference on Information Sciences and Systems.

[49] Jim Zelenka,et al. Informed prefetching and caching , 1995, SOSP.

[50] Alois Ferscha,et al. Recognizing and Predicting Context by Learning from User Behavior 1 , 2003 .

[51] Madhu Sudan,et al. Queuing with future information , 2012 .

[52] Kannan Ramchandran,et al. Codes can reduce queueing delay in data centers , 2012, 2012 IEEE International Symposium on Information Theory Proceedings.

[53] Atilla Eryilmaz,et al. Proactive Resource Allocation: Harnessing the Diversity and Multicast Gains , 2011, IEEE Transactions on Information Theory.

[54] Lizy Kurian John,et al. Store-Load-Branch (SLB) predictor: A compiler assisted branch prediction for data dependent branches , 2013, 2013 IEEE 19th International Symposium on High Performance Computer Architecture (HPCA).

[55] Alexander L. Stolyar,et al. Scheduling for multiple flows sharing a time-varying channel: the exponential rule , 2000 .