Performance and Area Optimization of a Bundled-Data Intel Processor through Resynthesis

We describe a method in which resynthesis is applied to the bundled-data implementation of a production-level Intel architecture CPU (Quark) to improve performance and area. A two-step quadratic program is presented for optimally adjusting the new flop-to-flop path constraints for resynthesis. Our experimental results show an average improvement of 25% in performance at the same area cost.

[1]  Luciano Lavagno,et al.  Desynchronization: Synthesis of Asynchronous Circuits From Synchronous Specifications , 2006, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[2]  Ivan E. Sutherland,et al.  Micropipelines , 1989, Commun. ACM.

[3]  Peter A. Beerel,et al.  A Designer's Guide to Asynchronous VLSI , 2010 .