Many-Core Acceleration of Vertical Plane Radio Wave Propagation Prediction

We accelerate a vertical plane radio wave propagation prediction application on GPUs. The application has abundant parallelism, but it exhibits uncoalesced memory accesses, divergent thread execution and contention for GPU register and memory resources – factors that make it challenging to accelerate on a GPU. We describe three acceleration strategies and optimizations to improve performance for each strategy. We evaluate the performance of the three strategies and their associated optimizations on GTX275 and GTX480 GPUs. We show that it is possible to obtain kernel speedups of up to 76× and overall application speedups of up to 48 × over the sequential CPU implementation. However, the dynamic nature of data accesses in the algorithm makes the best strategy/optimizations dependent on input data and target GPU.

[1]  K. Yee Numerical solution of initial boundary value problems involving maxwell's equations in isotropic media , 1966 .

[2]  Jie Zhang,et al.  A GPU approach to FDTD for radio coverage prediction , 2008, 2008 11th IEEE Singapore International Conference on Communication Systems.

[3]  Michael Reyer,et al.  Accelerating Radio Wave Propagation Predictions by Implementation on Graphics Hardware , 2007, 2007 IEEE 65th Vehicular Technology Conference - VTC2007-Spring.

[4]  Wen-mei W. Hwu,et al.  Optimization principles and application performance evaluation of a multithreaded GPU using CUDA , 2008, PPoPP.

[5]  R.L. Hamilton,et al.  Ray tracing as a design tool for radio networks , 1991, IEEE Network.

[6]  Yi Yang,et al.  A GPGPU compiler for memory optimization and parallelism management , 2010, PLDI '10.

[7]  Yoann Corre,et al.  Three-Dimensional Urban EM Wave Propagation Model for Radio Network Planning and Optimization Over Large Areas , 2009, IEEE Transactions on Vehicular Technology.

[8]  Harry R. Anderson,et al.  Fixed Broadband Wireless System Design , 2003 .

[9]  David M. Nicol,et al.  Acceleration of wireless channel simulation using GPUs , 2010, 2010 European Wireless Conference (EW).

[10]  Ali Akoglu,et al.  Parallel Implementation of the Irregular Terrain Model (ITM) for Radio Transmission Loss Prediction Using GPU and Cell BE Processors , 2011, IEEE Transactions on Parallel and Distributed Systems.

[11]  J. Deygout Multiple knife-edge diffraction of microwaves , 1966 .

[12]  Zhengqing Yun,et al.  Propagation prediction models for wireless communication systems , 2002 .

[13]  Wen-mei W. Hwu,et al.  CUDA-Lite: Reducing GPU Programming Complexity , 2008, LCPC.

[14]  Bradford Nichols,et al.  Pthreads programming , 1996 .

[15]  J. Epstein,et al.  An Experimental Study of Wave Propagation at 850 MC , 1953, Proceedings of the IRE.

[16]  J. D. Parsons,et al.  The Mobile Radio Propagation Channel , 1991 .

[17]  A. G. Longley,et al.  PREDICTION OF TROPOSPHERIC RADIO TRANSMISSION LOSS OVER IRREGULAR TERRAIN. A COMPUTER METHOD-1968 , 1968 .

[18]  J. Ramanujam,et al.  Automatic C-to-CUDA Code Generation for Affine Programs , 2010, CC.