Interconnect effects in deep submicron implementation of high performance arithmetic architectures

The conventional trend in algorithm implementation has been the reliance on advancements in process technology in order to satisfy the ever-increasing demand for high-speed processors, and computational systems. As current device technology approaches sub-100nm minimum device size, not only does the device geometry decrease, but switching times, and operating voltages also scale down. These gains come at the expense of increased layout complexity, and a greater susceptibility to parasitic effects in the interconnections. In this paper we will briefly overview the challenges that digital designers will have to face in the imminent future, and will provide suggestions on algorithmic measures which may be taken in order to overcome some of these obstacles. To illustrate our point, we will present an analysis of a digital multiplication algorithm, which is predicted to outperform current schemes, for future technologies.

[1]  M.D. Ercegovac,et al.  Effect of wire delay on the design of prefix adders in deep-submicron technology , 2000, Conference Record of the Thirty-Fourth Asilomar Conference on Signals, Systems and Computers (Cat. No.00CH37154).

[2]  Bulent Basaran,et al.  Methodology for repeater insertion management in the RTL, layout, floorplan and fullchip timing databases of the Itanium microprocessor , 2000, ISPD '00.

[3]  Michael J. Flynn,et al.  Technology Scaling Effects on Multipliers , 1998, IEEE Trans. Computers.

[4]  L. Dadda,et al.  The evolution of computer architectures , 1991, [1991] Proceedings, Advanced Computer Technology, Reliable Systems and Applications.

[5]  Behrooz Parhami,et al.  Computer arithmetic - algorithms and hardware designs , 1999 .

[6]  Yao-Wen Chang,et al.  Optimal reliable crosstalk-driven interconnect optimization , 2000, ISPD '00.

[7]  Keith A. Jenkins,et al.  When are transmission-line effects important for on-chip interconnections? , 1997 .

[8]  Jan M. Rabaey Exploring the Power Dimension , 1996 .

[9]  Vikas Agarwal,et al.  Clock rate versus IPC: the end of the road for conventional microarchitectures , 2000, Proceedings of 27th International Symposium on Computer Architecture (IEEE Cat. No.RS00201).

[10]  E.E. Swartzlander,et al.  Interconnection effects in fast multipliers , 1999, Conference Record of the Thirty-Third Asilomar Conference on Signals, Systems, and Computers (Cat. No.CH37020).

[11]  Rob A. Rutenbar,et al.  Wire packing: a strong formulation of crosstalk-aware chip-level track/layer assignment with an efficient integer programming solution , 2000, ISPD '00.

[12]  Ken Mai,et al.  The future of wires , 2001, Proc. IEEE.

[13]  E.E. Swartzlander,et al.  A recursive fast multiplier , 1998, Conference Record of Thirty-Second Asilomar Conference on Signals, Systems and Computers (Cat. No.98CH36284).

[14]  Jan M. Rabaey,et al.  A partitioning scheme for optimizing interconnect power , 1997, IEEE J. Solid State Circuits.

[15]  Graham A. Jullien,et al.  On the use of 4:2 compressors for partial product reduction , 2003, CCECE 2003 - Canadian Conference on Electrical and Computer Engineering. Toward a Caring and Humane Technology (Cat. No.03CH37436).

[16]  Christof Paar,et al.  A New Architecture for a Parallel Finite Field Multiplier with Low Complexity Based on Composite Fields , 1996, IEEE Trans. Computers.

[17]  Anatolij A. Karatsuba,et al.  Multiplication of Multidigit Numbers on Automata , 1963 .

[18]  Malgorzata Marek-Sadowska,et al.  Crosstalk reduction for VLSI , 1997, IEEE Trans. Comput. Aided Des. Integr. Circuits Syst..

[19]  Eby G. Friedman,et al.  Optimizing Inductive Interconnect for Low Power , 2003 .

[20]  Kaustav Banerjee,et al.  Analysis and optimization of thermal issues in high-performance VLSI , 2001, ISPD '01.

[21]  Kaustav Banerjee,et al.  Inductance aware interconnect scaling , 2002, Proceedings International Symposium on Quality Electronic Design.

[22]  Andreas C. Cangellaris The interconnect bottleneck in multi-GHz processors; new opportunities for hybrid electrical/optical solutions , 1998, Proceedings. Fifth International Conference on Massively Parallel Processing (Cat. No.98EX182).

[23]  Kaustav Banerjee,et al.  Analysis of on-chip inductance effects for distributed RLC interconnects , 2002, IEEE Trans. Comput. Aided Des. Integr. Circuits Syst..

[24]  David J. Sager,et al.  A 0 . 18-m CMOS IA-32 Processor With a 4-GHz Integer Execution Unit , 2001 .

[25]  William J. Dally,et al.  Smart Memories: a modular reconfigurable architecture , 2000, ISCA '00.

[26]  Majid Ahmadi,et al.  A reconfigurable digital multiplier architecture , 2003, CCECE 2003 - Canadian Conference on Electrical and Computer Engineering. Toward a Caring and Humane Technology (Cat. No.03CH37436).

[27]  T. Trick,et al.  A simple formula for the estimation of the capacitance of two-dimensional interconnects in VLSI circuits , 1982, IEEE Electron Device Letters.

[28]  Earl E. Swartzlander,et al.  Analysis of column compression multipliers , 2001, Proceedings 15th IEEE Symposium on Computer Arithmetic. ARITH-15 2001.