An energy-delay product study on chip multi-processors for variable stage pipelining

Power management is a major concern for computer architects and system designers. As reported by the International Technology Roadmap for Semiconductors (ITRS), energy consumption has become one of the most dominant issues for the semiconductor industry when the size of transistors scales down from 22 to 11 nm nodes. In this regard, current existing techniques such as dynamic voltage scaling, clock gating, and the Complementary metal-oxide semiconductor technology have shown their physical limits; therefore, scaling will no longer be a valid strategy for achieving power-performance improvement. To overcome this critical issue in energy-efficient processor design, there is a clear demand for alternative solution. In this paper, an approach that provides a promising solution for energy reduction is proposed, by using a micro-architectural technique referred to as variable stage pipelining, which can be further validated and extended to different application domains such as mobile and desktop. An analytical model for evaluating the relationship between the number of cores and the pipeline stage depth in a chip multi-processor is also proposed, based on which the optimal pipeline depth for various metrics are calculated.

[1]  Eric Sprangle,et al.  Increasing processor performance by implementing deeper pipelines , 2002, ISCA.

[2]  Michael Gschwind,et al.  Optimizing pipelines for power and performance , 2002, MICRO.

[3]  Julien Boucaron,et al.  Dynamic Variable Stage Pipeline: an Implementation of its Control , 2009 .

[4]  Diana Marculescu,et al.  Analysis of dynamic voltage/frequency scaling in chip-multiprocessors , 2007, Proceedings of the 2007 international symposium on Low power electronics and design (ISLPED '07).

[5]  Yuh-Fang Tsai,et al.  Tools and Techniques for Leakage Power Analysis , 2005 .

[6]  James E. Smith,et al.  Optimal Pipelining in Supercomputers , 1986, ISCA.

[7]  Mahmut T. Kandemir,et al.  Leakage Current: Moore's Law Meets Static Power , 2003, Computer.

[8]  Norman P. Jouppi,et al.  The optimal logic depth per pipeline stage is 6 to 8 FO4 inverter delays , 2002, ISCA.

[10]  Allan Hartstein,et al.  The optimum pipeline depth for a microprocessor , 2002, ISCA.

[11]  Hajime Shimada,et al.  Optimal pipeline depth with pipeline stage unification adoption , 2007, CARN.

[12]  David A. Patterson,et al.  Computer Architecture: A Quantitative Approach , 1969 .

[13]  Allan Hartstein,et al.  Optimum Power/Performance Pipeline Depth , 2003, MICRO.

[14]  Anantha Chandrakasan,et al.  Optimal supply and threshold scaling for subthreshold CMOS circuits , 2002, Proceedings IEEE Computer Society Annual Symposium on VLSI. New Paradigms for VLSI Systems Design. ISVLSI 2002.

[15]  Massoud Pedram,et al.  Minimizing the power consumption of a Chip Multiprocessor under an average throughput constraint , 2010, 2010 11th International Symposium on Quality Electronic Design (ISQED).

[16]  Alain C. Diebold,et al.  2012 Updates to the International Technology Roadmap for Semiconductors (ITRS) Metrology Chapter | NIST , 2013 .

[17]  Alagan Anpalagan,et al.  Power management in multi-core processors using automatic dynamic pipeline stage unification , 2013, 2013 International Symposium on Performance Evaluation of Computer and Telecommunication Systems (SPECTS).

[18]  A. Wang,et al.  Modeling and sizing for minimum energy operation in subthreshold circuits , 2005, IEEE Journal of Solid-State Circuits.

[19]  Eric Rotenberg,et al.  A case for dynamic pipeline scaling , 2002, CASES '02.

[20]  Yuan Taur,et al.  Fundamentals of Modern VLSI Devices , 1998 .

[21]  Shekhar Y. Borkar,et al.  Design challenges of technology scaling , 1999, IEEE Micro.