Interconnect Criticality-Driven Delay Relaxation

Due to decreasing transistor sizes and increasing clock frequency, interconnect delay is a dominant factor in achieving timing closure in deep-submicrometer designs. In field programmable gate arrays (FPGA), interconnect delay is contributed by programmable routing switches. This increases the wire delay significantly. In FPGA devices, the interconnect delay is usually more than 40% of the total delay. Techniques like wire pipelining and retiming can manage delay of timing critical wires. However, the latency of the design limits the total pipelining in the design. Therefore, new techniques are needed at synthesis stage to consider the effect of critical wires in the design. In this paper, we propose an intuitive Critical Edge Reduction (CER) algorithm, which minimizes the number of critical wires on a maximal delay- budgeting solution under fixed latency constraint. We prove that this problem is NP-hard. We provide an integer linear programming formulation of the problem and an iterative heuristic algorithm (CER). During the course of our algorithm, we introduce multiple graph problems. We give a proof of NP-hardness of one such problem, which we call max arc-cost balancing problem. In our experiments, we present an in-depth analysis of tradeoff between various maximal budgetings and critical edge minimization. We implemented our design flow using a set of MediaBench datapaths on Xilinx VirtexE FPGA devices. Using our algorithm, the Xilinx Place-and-Route tool achieved timing closure, which is, on average, 2.8 times faster than using maximum budgeting. The resulting average clock period using CER algorithm outperforms the one using the maximum budgeting by 6%. Other results show similar advantages of critical edge minimization over traditional budgeting techniques.

[1]  How-Rern Lin,et al.  Power reduction by gate sizing with path-oriented slack calculation , 1995, Proceedings of ASP-DAC'95/CHDL'95/VLSI'95 with EDA Technofair.

[2]  David S. Johnson,et al.  The Planar Hamiltonian Circuit Problem is NP-Complete , 1976, SIAM J. Comput..

[3]  Alberto L. Sangiovanni-Vincentelli,et al.  On-chip communication design: roadblocks and avenues , 2003, First IEEE/ACM/IFIP International Conference on Hardware/ Software Codesign and Systems Synthesis (IEEE Cat. No.03TH8721).

[4]  Majid Sarrafzadeh,et al.  A unified theory of timing budget management , 2004, ICCAD 2004.

[5]  How-Rern Lin,et al.  Power reduction by gate sizing with path-oriented slack calculation , 1995, ASP-DAC '95.

[6]  Carl Ebeling,et al.  Exploration of pipelined FPGA interconnect structures , 2004, FPGA '04.

[7]  Miodrag Potkonjak,et al.  MediaBench: a tool for evaluating and synthesizing multimedia and communications systems , 1997, Proceedings of 30th Annual International Symposium on Microarchitecture.

[8]  Majid Sarrafzadeh,et al.  Unification of budgeting and placement , 1997, DAC.

[9]  Jason Cong,et al.  Architectural Synthesis Integrated with Global Placement for Multi-Cycle Communication , 2003, ICCAD 2003.

[10]  Jason Cong,et al.  Architecture-level synthesis for automatic interconnect pipelining , 2004, Proceedings. 41st Design Automation Conference, 2004..

[11]  Chak-Kuen Wong,et al.  An Algorithm to Compact a VLSI Symbolic Layout with Mixed Constraints , 1983, 20th Design Automation Conference Proceedings.

[12]  Elaheh Bozorgzadeh,et al.  Fast timing closure by interconnect criticality driven delay relaxation , 2005, ICCAD-2005. IEEE/ACM International Conference on Computer-Aided Design, 2005..

[13]  Ankur Srivastava,et al.  On effective slack management in postscheduling phase , 2005, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[14]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[15]  Hai Zhou,et al.  Retiming for wire pipelining in system-on-chip , 2003, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[16]  Ankur Srivastava,et al.  Budget Management with Applications , 2002, Algorithmica.

[17]  Monica S. Lam,et al.  Maximizing Multiprocessor Performance with the SUIF Compiler , 1996, Digit. Tech. J..

[18]  Majid Sarrafzadeh,et al.  A unified theory of timing budget management , 2004, IEEE/ACM International Conference on Computer Aided Design, 2004. ICCAD-2004..

[19]  Andrew B. Kahng,et al.  Min-max placement for large-scale timing optimization , 2002, ISPD '02.

[20]  Malgorzata Marek-Sadowska,et al.  Delay budgeting in sequential circuit with application on FPGA placement , 2003, DAC '03.

[21]  Wei Zhang,et al.  Exploiting VLIW schedule slacks for dynamic and leakage energy reduction , 2001, MICRO.

[22]  Majid Sarrafzadeh,et al.  Optimal integer delay budgeting on directed acyclic graphs , 2003, Proceedings 2003. Design Automation Conference (IEEE Cat. No.03CH37451).

[23]  Soheil Ghiasi,et al.  Efficient implementation selection via time budgeting: complexity analysis and leakage optimization case study , 2005, 2005 International Conference on Computer Design.

[24]  Endre Boros,et al.  Balancing Problems in Acyclic Networks , 1994, Discret. Appl. Math..

[25]  Ravi Nair,et al.  Generation of performance constraints for layout , 1989, IEEE Trans. Comput. Aided Des. Integr. Circuits Syst..