Subscriber-Driven Interference Detection for Cloud-Based Web Services

Web services are now increasingly being hosted on public cloud infrastructure as a service platforms such as the Amazon Web service elastic compute cloud (EC2). However, previous studies have shown that the virtualized infrastructure used in public clouds can introduce contention among virtual machines (VMs) for shared physical host resources eventually leading to performance problems. Subscribers in a public cloud platform typically do not have access to metrics that can directly quantify the adverse impact of such inter-VM interference on Web service response times. We present a software probe based system to address this limitation. The probe is a lightweight application that runs on each Web service VM that needs to be monitored. We periodically measure the probe’s response time on a monitored VM. We then compare this response time with the probe’s previously recorded baseline no-interference response time when it executes in isolation on a VM of the same type. Statistically significant increase in the probe’s response time from the baseline is used to detect interference. The probe also indicates the type of contention at the physical host that causes the interference. This information can be exploited by a subscriber to mitigate the problem. Results show that our approach is quite effective over two different cloud platforms and a wide variety of workload scenarios. In particular, results indicate that Web service instances hosted on EC2 suffer from interference. Our probe was able to detect 93% of performance degradations triggered by such interference. In all these cases, the probe imposed an average overhead of only 3%–4% on the mean response time of the Web service being monitored.

[1]  Minaxi Gupta,et al.  Revisiting Web Server Workload Invariants in the Context of Scientific Web Sites , 2006, ACM/IEEE SC 2006 Conference (SC'06).

[2]  Martin F. Arlitt,et al.  Web server workload characterization: the search for invariants , 1996, SIGMETRICS '96.

[3]  Tommaso Cucinotta,et al.  The effects of scheduling, workload type and consolidation scenarios on virtual machine performance and their prediction through optimized artificial neural networks , 2011, J. Syst. Softw..

[4]  Christina Delimitrou,et al.  Paragon: QoS-aware scheduling for heterogeneous datacenters , 2013, ASPLOS '13.

[5]  Anshul Gandhi,et al.  UIE: User-Centric Interference Estimation for Cloud Applications , 2016, 2016 IEEE International Conference on Cloud Engineering (IC2E).

[6]  Vladimir Vlassov,et al.  Stay-Away, protecting sensitive applications from performance interference , 2014, Middleware.

[7]  Margaret Fisher,et al.  Performance measurement tools , 2005 .

[8]  Diwakar Krishnamurthy,et al.  Performance Testing Web Applications on the Cloud , 2014, 2014 IEEE Seventh International Conference on Software Testing, Verification and Validation Workshops.

[9]  Ziming Zhang,et al.  Efficient and Accurate Anomaly Identification Using Reduced Metric Space in Utility Clouds , 2012, 2012 IEEE Seventh International Conference on Networking, Architecture, and Storage.

[10]  Kevin Skadron,et al.  Bubble-up: Increasing utilization in modern warehouse scale computers via sensible co-locations , 2011, 2011 44th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO).

[11]  Alexandru Iosup,et al.  On the Performance Variability of Production Cloud Services , 2011, 2011 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing.

[12]  Jerome A. Rolia,et al.  Resource Contention Detection in Virtualized Environments , 2015, IEEE Transactions on Network and Service Management.

[13]  Song Jiang,et al.  Workload analysis of a large-scale key-value store , 2012, SIGMETRICS '12.

[14]  Bowen Zhou,et al.  Mitigating interference in cloud services by middleware reconfiguration , 2014, Middleware.

[15]  Xiaohui Gu,et al.  PREPARE: Predictive Performance Anomaly Prevention for Virtualized Cloud Systems , 2012, 2012 IEEE 32nd International Conference on Distributed Computing Systems.

[16]  Alexandru Iosup,et al.  An Early Performance Analysis of Cloud Computing Services for Scientific Computing , 2008 .

[17]  Giuliano Casale,et al.  A Feasibility Study of Host-Level Contention Detection by Guest Virtual Machines , 2013, 2013 IEEE 5th International Conference on Cloud Computing Technology and Science.

[18]  Jie Liu,et al.  Algorithm Design for Performance Aware VM Consolidation , 2013 .

[19]  Ricardo Bianchini,et al.  DeepDive: Transparently Identifying and Managing Performance Interference in Virtualized Environments , 2013, USENIX Annual Technical Conference.

[20]  T. S. Eugene Ng,et al.  The Impact of Virtualization on Network Performance of Amazon EC2 Data Center , 2010, 2010 Proceedings IEEE INFOCOM.

[21]  Alexandra Fedorova,et al.  Contention-Aware Scheduling on Multicore Systems , 2010, TOCS.

[22]  Song Fu,et al.  Adaptive Anomaly Identification by Exploring Metric Subspace in Cloud Computing Infrastructures , 2013, 2013 IEEE 32nd International Symposium on Reliable Distributed Systems.

[23]  Feng Wang,et al.  A deep investigation into network performance in virtual machine based cloud environments , 2014, IEEE INFOCOM 2014 - IEEE Conference on Computer Communications.

[24]  Aman Kansal,et al.  Q-clouds: managing performance interference effects for QoS-aware clouds , 2010, EuroSys '10.

[25]  Guillaume Pierre,et al.  EC2 Performance Analysis for Resource Provisioning of Service-Oriented Applications , 2009, ICSOC/ServiceWave Workshops.

[26]  Song Fu,et al.  Performance Metric Selection for Autonomic Anomaly Detection on Cloud Computing Systems , 2011, 2011 IEEE Global Telecommunications Conference - GLOBECOM 2011.

[27]  Jerome A. Rolia,et al.  Resource contention detection and management for consolidated workloads , 2013, 2013 IFIP/IEEE International Symposium on Integrated Network Management (IM 2013).

[28]  Diwakar Krishnamurthy,et al.  Managing Performance Interference in Cloud-Based Web Services , 2015, IEEE Transactions on Network and Service Management.

[29]  Diwakar Krishnamurthy,et al.  Detecting performance interference in cloud-based web services , 2015, 2015 IFIP/IEEE International Symposium on Integrated Network Management (IM).