Limplock: understanding the impact of limpware on scale-out cloud systems
暂无分享,去创建一个
Tanakorn Leesatapornwongsa | Haryadi S. Gunawi | Mingzhe Hao | Thanh Do | Tiratat Patana-anake | Thanh Do | Tanakorn Leesatapornwongsa | M. Hao | Tiratat Patana-anake
[1] David E. Culler,et al. SEDA: an architecture for well-conditioned, scalable internet services , 2001, SOSP.
[2] Scott Shenker,et al. Usenix Association 10th Usenix Symposium on Networked Systems Design and Implementation (nsdi '13) 185 Effective Straggler Mitigation: Attack of the Clones , 2022 .
[3] Van-Anh Truong,et al. Availability in Globally Distributed Storage Systems , 2010, OSDI.
[4] Evgenia Smirni,et al. Anomaly? application change? or workload change? towards automated detection of application performance anomaly and change , 2008, 2008 IEEE International Conference on Dependable Systems and Networks With FTCS and DCC (DSN).
[5] Shankar Pasupathy,et al. An analysis of latent sector errors in disk drives , 2007, SIGMETRICS '07.
[6] Sanjay Ghemawat,et al. MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.
[7] Albert G. Greenberg,et al. Reining in the Outliers in Map-Reduce Clusters using Mantri , 2010, OSDI.
[8] Michael I. Jordan,et al. Detecting large-scale system problems by mining console logs , 2009, SOSP '09.
[9] Adam Silberstein,et al. Benchmarking cloud serving systems with YCSB , 2010, SoCC '10.
[10] Randy H. Katz,et al. Improving MapReduce Performance in Heterogeneous Environments , 2008, OSDI.
[11] Benjamin Livshits,et al. AjaxScope: a platform for remotely monitoring the client-side behavior of web 2.0 applications , 2007, TWEB.
[12] Xiao Zhang,et al. CPI2: CPU performance isolation for shared compute clusters , 2013, EuroSys '13.
[13] Remzi H. Arpaci-Dusseau,et al. Run-time adaptation in river , 2003, TOCS.
[14] Haryadi S. Gunawi,et al. Impact of Limpware on HDFS: A Probabilistic Estimation , 2013, ArXiv.
[15] Rajeev Gandhi,et al. Black-Box Problem Diagnosis in Parallel File Systems , 2010, FAST.
[16] Haitao Wu,et al. ICTCP: Incast Congestion Control for TCP , 2010 .
[17] Archana Ganapathi,et al. The Case for Evaluating MapReduce Performance Using Workload Suites , 2011, 2011 IEEE 19th Annual International Symposium on Modelling, Analysis, and Simulation of Computer and Telecommunication Systems.
[18] Mona Attariyan,et al. X-ray: Automating Root-Cause Diagnosis of Performance Anomalies in Production Software , 2012, OSDI.
[19] Eduardo Pinheiro,et al. DRAM errors in the wild: a large-scale field study , 2009, SIGMETRICS '09.
[20] Gregory R. Ganger,et al. Diagnosing Performance Changes by Comparing Request Flows , 2011, NSDI.
[21] Byung-Gon Chun,et al. MegaPipe: A New Programming Interface for Scalable Network I/O , 2012, OSDI.
[22] Bianca Schroeder,et al. Disk Failures in the Real World: What Does an MTTF of 1, 000, 000 Hours Mean to You? , 2007, FAST.
[23] Tao Zou,et al. Making time-stepped applications tick in the cloud , 2011, SoCC.
[24] Gregory R. Ganger,et al. Argon: Performance Insulation for Shared Storage Servers , 2007, FAST.
[25] Shan Lu,et al. Understanding and detecting real-world performance bugs , 2012, PLDI.
[26] Lin Xiao,et al. YCSB++: benchmarking and performance debugging advanced features in scalable table stores , 2011, SoCC.
[27] Anees Shaikh,et al. Performance Isolation and Fairness for Multi-Tenant Cloud Storage , 2012, OSDI.
[28] Randy H. Katz,et al. Cake: enabling high-level SLOs on shared storage systems , 2012, SoCC '12.
[29] Navendu Jain,et al. Understanding network failures in data centers: measurement, analysis, and implications , 2011, SIGCOMM.
[30] Wilson C. Hsieh,et al. Bigtable: A Distributed Storage System for Structured Data , 2006, TOCS.
[31] Irfan Ahmad,et al. Pesto: online storage performance management in virtualized datacenters , 2011, SoCC.
[32] Erez Zadok,et al. DARC: dynamic analysis of root causes of latency distributions , 2008, SIGMETRICS '08.
[33] Mendel Rosenblum,et al. Fast crash recovery in RAMCloud , 2011, SOSP.
[34] Kashi Venkatesh Vishwanath,et al. Characterizing cloud computing hardware reliability , 2010, SoCC '10.
[35] Jeffrey S. Chase,et al. Correlating Instrumentation Data to System States: A Building Block for Automated Diagnosis and Control , 2004, OSDI.
[36] Haitao Wu,et al. ICTCP: Incast Congestion Control for TCP in Data-Center Networks , 2010, IEEE/ACM Transactions on Networking.
[37] David E. Culler,et al. SEDA: An Architecture for Scalable, Well-Conditioned Internet Services , 2001 .
[38] Luiz André Barroso,et al. The tail at scale , 2013, CACM.
[39] Hairong Kuang,et al. The Hadoop Distributed File System , 2010, 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST).
[40] Helen J. Wang,et al. Automatic Misconfiguration Troubleshooting with PeerPressure , 2004, OSDI.
[41] Haryadi S. Gunawi,et al. The Case for Limping-Hardware Tolerant Clouds , 2013, HotCloud.
[42] Irfan Ahmad,et al. PARDA: Proportional Allocation of Resources for Distributed Storage Access , 2009, FAST.