A practical automated timing and physical design implementation methodology for the synchronous asynchronous interface and multi-voltage domain in high-speed synthesis

In a high-speed synthesis design environment, designers struggle to ensure that multi-clock and multi-power interfaces are designed, placed, connected and timed correctly. Identifying and applying proper timing constraints such as "no cycle stealing" at synchronous and asynchronous domain interfaces in macro synthesis, unit and chip timing are essential. Standard cell library characterization, for multi-power and timing challenges due to an additional delay for level translator circuitry, demand careful implementation in a high-speed synthesis methodology. We propose a pseudo algorithm and methodology for synthesis and timing that will correctly identify synchronous and asynchronous interfaces. Our proposed methodology shows how these interface paths should be excluded from "cycle stealing" and yet, take full advantage of slack borrowing for the rest of the design. We find ~28% and ~80% timing path improvement in two of the units for IBM's Power8TM (P8) microprocessor. As an alternative to high-effort custom design, we develop a synthesis-based physical design methodology that incorporates the use of a level translator, enabling designers to address major issues that encompass dual-voltage solutions in high-speed design. We find ~50% physical design effort savings using this methodology. P8 is a 12-core, 649 mm2, 4.2B transistor chip fabricated in IBM's 22-nm Silicon On Insulator (SOI) technology, which is fully functional to support a wide range of high performing systems with an operating frequency greater than 4.5GHz.

[1]  Hai Zhou,et al.  An efficient algorithm for multi-domain clock skew scheduling , 2011, 2011 Design, Automation & Test in Europe.

[2]  Renu Mehra Commercial low-power EDA tools: a review , 2012, ISLPED '12.

[3]  Yih-Lang Li,et al.  Routing congestion estimation with real design constraints , 2013, 2013 50th ACM/EDAC/IEEE Design Automation Conference (DAC).

[4]  Haifeng Qian,et al.  Design methodology for the IBM POWER7 microprocessor , 2011, IBM J. Res. Dev..

[5]  Li Li,et al.  Optimal multi-domain clock skew scheduling , 2011, 2011 48th ACM/EDAC/IEEE Design Automation Conference (DAC).

[6]  Vikas Agarwal,et al.  Synthesis-based design and implementation methodology of high-speed, high-performing unit: L2 cache unit design , 2015, Integr..

[7]  Andreas Kuehlmann,et al.  Multi-Domain Clock Skew Scheduling , 2003, ICCAD.

[8]  E. Fluhr,et al.  Design and Implementation of the POWER6 Microprocessor , 2008, IEEE Journal of Solid-State Circuits.

[9]  Wolfgang Ecker,et al.  The semantic of the power intent format UPF: Consistent power modeling from system level to implementation , 2013, 2013 23rd International Workshop on Power and Timing Modeling, Optimization and Simulation (PATMOS).

[10]  Robert C. Aitken,et al.  Low Power Methodology Manual - for System-on-Chip Design , 2007 .

[11]  Louise Trevillyan,et al.  An integrated environment for technology closure of deep-submicron IC designs , 2004, IEEE Design & Test of Computers.

[12]  H. Mahmoodi,et al.  Low power design flow based on Unified Power Format and Synopsys tool chain , 2013, 2013 3rd Interdisciplinary Engineering Design Education Conference.

[13]  Baris Taskin,et al.  Time borrowing and clock skew scheduling effects on multi-phase level-sensitive circuits , 2004, 2004 IEEE International Symposium on Circuits and Systems (IEEE Cat. No.04CH37512).

[14]  Chris J. Myers,et al.  Interfacing synchronous and asynchronous modules within a high-speed pipeline , 2000, IEEE Trans. Very Large Scale Integr. Syst..

[15]  Mozammel Hossain,et al.  Physical design and implementation of POWER8™ (P8) server class processor , 2015, 2015 IEEE 58th International Midwest Symposium on Circuits and Systems (MWSCAS).