GPU-Accelerated Wire-Length Estimation for FPGA Placement

In the FPGA design flow, placement remains one of the most time-consuming stages, and is also crucial in terms of quality of result. HPWL and Star+ are widely used as cost metrics in FPGA placement for estimating the total wire-length of a candidate placement prior to routing. However, both wire-length models are expensive to compute requiring O(nm) time, where n is the number of nets and m is the average net cardinality. This paper proposes using the massively multi-threaded architecture provided by GPUs to reduce the time required to compute HPWL and Star+. First, a specialized set of data structures is developed for storing net-connectivity information on the GPU. Next, a study is performed to determine how to best map the data structures onto the GPU to exploit the heterogeneous memories and thread-level parallelism that are available. Finally, a study is performed to determine what effect circuit size and net cardinality have on the speedups that can be achieved. Overall, the results show that speedups of as much as 160x over a serial CPU implementation can be achieved for both models when tested using standard benchmarks.

[1]  Vaughn Betz,et al.  VPR: A new packing, placement and routing tool for FPGA research , 1997, FPL.

[2]  Pinaki Mazumder,et al.  VLSI cell placement techniques , 1991, CSUR.

[3]  Jason Cong,et al.  Parallel multi-level analytical global placement on graphics processing units , 2009, 2009 IEEE/ACM International Conference on Computer-Aided Design - Digest of Technical Papers.

[4]  Jason Luu,et al.  VPR 5.0: FPGA cad and architecture exploration tools with single-driver routing, heterogeneity and process scaling , 2009, FPGA '09.

[5]  Andrew A. Kennings,et al.  Improving Simulated Annealing-Based FPGA Placement With Directed Moves , 2009, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[6]  Jianwen Zhu,et al.  Towards scalable placement for FPGAs , 2010, FPGA '10.

[7]  Chih-Liang Eric Cheng RISA: accurate and efficient placement routability modeling , 1994, ICCAD.

[8]  Vaughn Betz,et al.  Architecture and CAD for Deep-Submicron FPGAS , 1999, The Springer International Series in Engineering and Computer Science.

[9]  Meng Yang,et al.  FPGA placement optimization by two-step unified Genetic Algorithm and Simulated Annealing algorithm , 2006 .

[10]  Ming Xu,et al.  Near-linear wirelength estimation for FPGA placement , 2009, 2009 Canadian Conference on Electrical and Computer Engineering.

[11]  Miguel A. Vega-Rodríguez,et al.  A FPGA Optimization Tool Based on a Multi-island Genetic Algorithm Distributed over Grid Environments , 2008, 2008 Eighth IEEE International Symposium on Cluster Computing and the Grid (CCGRID).