Low-cost and power-ecient thread collision detection scheme for shared caches in a real-time multithreaded embedded processor

This paper addresses an important issue in a real-time multithreaded embedded processor where several active hardware threads share the critical resources such as caches in the processor. Thread interferences or collisions could lead to severe performance degradations on the real-time threads. Although the cache interference issue on real- time multithreaded processors has been studied before, no cost-eective and simpler hardware solutions were proposed to maintain the single thread performance of the real-time thread while still letting the low-priority threads progress. A novel technique called collision tag is proposed to address these issues. Progressively, the collision tag scheme can be reduced into a more power-ecient form called collision bit vector, with almost no impact in the overall performance of

[1]  Dean M. Tullsen,et al.  Simultaneous multithreading: Maximizing on-chip parallelism , 1995, Proceedings 22nd Annual International Symposium on Computer Architecture.

[2]  Donald Yeung,et al.  Transparent threads: resource sharing in SMT processors for high single-thread performance , 2002, Proceedings.International Conference on Parallel Architectures and Compilation Techniques.

[3]  Emre Özer,et al.  Thread Priority-Aware Random Replacement in TLBs for a High-Performance Real-Time SMT Processor , 2007, Asia-Pacific Computer Systems Architecture Conference.

[4]  Nobuyuki Yamasaki,et al.  Prioritized SMT Architecture with IPC Control Method for Real-Time Processing , 2007, 13th IEEE Real Time and Embedded Technology and Applications Symposium (RTAS'07).

[5]  Sascha Uhrig,et al.  How to Enhance a Superscalar Processor to Provide Hard Real-Time Capable In-Order SMT , 2010, ARCS.

[6]  Steven Raasch,et al.  Applications of Thread Prioritization in SMT Processors , 1999 .

[7]  Rohit Jain,et al.  Soft real-time scheduling on simultaneous multithreaded processors , 2002, 23rd IEEE Real-Time Systems Symposium, 2002. RTSS 2002..

[8]  Alex Settle,et al.  Understanding the Impact of Inter-Thread Cache Interference on ILP in Modern SMT Processors , 2005, J. Instr. Level Parallelism.

[9]  Emre Ozer,et al.  Low-cost Techniques for Reducing Branch Context Pollution in a Soft Realtime Embedded Multithreaded Processor , 2007 .

[10]  Francisco J. Cazorla,et al.  Predictable performance in SMT processors , 2004, CF '04.

[11]  Krisztián Flautner,et al.  Energy-Efficient Simultaneous Thread Fetch from Different Cache Levels in a Soft Real-Time SMT Processor , 2008, SAMOS.

[12]  Francisco J. Cazorla,et al.  Characterizing the resource-sharing levels in the UltraSPARC T2 processor , 2009, 2009 42nd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO).

[13]  Eric Rotenberg,et al.  Virtual multiprocessor: an analyzable, high-performance architecture for real-time computing , 2005, CASES '05.