论文信息 - A Discrete Event Simulation Model for Understanding Kernel Lock Thrashing on Multi-core Architectures

A Discrete Event Simulation Model for Understanding Kernel Lock Thrashing on Multi-core Architectures

Multi-core architectures have become mainstream. Trends suggest that the number of cores integrated on a single chip will increase continuously. However, lock contention in operating systems can limit the parallel scalability on multi-cores so significantly that the speedup decreases with the increasing number of cores (thrashing). Although the phenomenon can be easily reproduced experimentally, most existing lock models are not able to do so. To overcome this challenge, this paper develops a discrete event simulation model which has the capability of capturing both the sequential execution in critical sections and the contention for shared hardware resources. The model is evaluated using a series of typical parameter configurations which can represent different degrees of lock contention. Experimental results suggest that the thrashing phenomenon can be observed when the model parameters are selected properly. To further understand this phenomenon, statistics such as the percentage of time spent waiting for locks and the number of cores waiting for a lock are exploited to characterize the lock thrashing. In addition, the model sensitivity to changes in memory latency and hardware architectures are also examined. Finally, we use this model to compare three methods which are proposed for preventing the lock thrashing.

[1] Pat Conway,et al. The AMD Opteron Northbridge Architecture , 2007, IEEE Micro.

[2] Yang Zhang,et al. Corey: An Operating System for Many Cores , 2008, OSDI.

[3] Mats Björkman,et al. Locking Effects in Multiprocessor Implementations of Protocols , 1993, SIGCOMM.

[4] D. C. Gilbert. Modeling spin locks with queuing networks , 1978, OPSR.

[5] Jeffrey P. Buzen,et al. Computational algorithms for closed queueing networks with exponential servers , 1973, Commun. ACM.

[6] Anant Agarwal,et al. The KILL Rule for Multicore , 2007, 2007 44th ACM/IEEE Design Automation Conference.

[7] Michael L. Scott,et al. Algorithms for scalable synchronization on shared-memory multiprocessors , 1991, TOCS.

[8] Anant Agarwal,et al. Factored operating systems (fos): the case for a scalable operating system for multicores , 2009, OPSR.

[9] Mats Björkman,et al. Performance modeling of multiprocessor implementations of protocols , 1998, TNET.

[10] Yan Cui,et al. Scalability comparison of commodity operating systems on multi-cores , 2010, 2010 IEEE International Symposium on Performance Analysis of Systems & Software (ISPASS).

[11] P BuzenJeffrey. Computational algorithms for closed queueing networks with exponential servers , 1973 .