Power and performance trade-offs for Space Time Adaptive Processing

Power efficiency - performance relative to power - is one of the most important concerns when designing RADAR processing systems. This paper analyzes power and performance trade-offs for a typical Space Time Adaptive Processing (STAP) application. We study STAP implementations for CUDA and OpenMP on two architectures, Intel Haswell Core I7-4770TE and NVIDIA Kayla with a GK208 GPU. We analyze the power and performance of STAP's computationally intensive kernels across the two hardware testbeds. We discuss an efficient parallel implementation for the Haswell CPU architecture. We also show the impact and trade-offs of GPU optimization techniques. The GPU architecture is able to process large size data sets without increase in power requirement. The use of shared memory has a significant impact on the power requirement for the GPU. Finally, we show that a balance between the use of shared memory and main memory access leads to an improved performance in a typical STAP application.

[1]  Stéphan Jourdan,et al.  Haswell: The Fourth-Generation Intel Core Processor , 2014, IEEE Micro.

[2]  R. W. Linderman,et al.  Real-time STAP demonstration on an embedded high performance computer , 1997, Proceedings of the 1997 IEEE National Radar Conference.

[3]  John K. Antonio,et al.  Reconfigurable computing for space-time adaptive processing , 1998, Proceedings. IEEE Symposium on FPGAs for Custom Computing Machines (Cat. No.98TB100251).

[4]  Pramod K. Varshney,et al.  Performance Evaluation of a Parallel Pipeline Computational Model for Space-Time Adaptive Processing , 2005, The Journal of Supercomputing.

[5]  W.L. Melvin,et al.  A STAP overview , 2004, IEEE Aerospace and Electronic Systems Magazine.

[6]  L. Trefethen,et al.  Numerical linear algebra , 1997 .

[7]  Kurt Keutzer,et al.  A Predictive Model for Solving Small Linear Algebra Problems in GPU Registers , 2012, 2012 IEEE 26th International Parallel and Distributed Processing Symposium.

[8]  Ryan K. Hersey,et al.  GPU-based space-time adaptive processing (STAP) for radar , 2013, 2013 IEEE High Performance Extreme Computing Conference (HPEC).

[9]  D J-F.,et al.  GPU processors for space surveillance and airborne radars . Towards highly efficient ARM-GPU architecture for embedded radar systems , 2013 .

[10]  Dennis Braunreiter,et al.  GPU implementations for fast factorizations of STAP covariance matrices , 2008, Optical Engineering + Applications.

[11]  Ronald T. Williams,et al.  RT_STAP: Real-Time Space-Time Adaptive Processing Benchmark , 1997 .

[12]  M.C. Wicks,et al.  Space-time adaptive processing: a knowledge-based perspective for airborne radar , 2006, IEEE Signal Processing Magazine.

[13]  I. Reed,et al.  Rapid Convergence Rate in Adaptive Arrays , 1974, IEEE Transactions on Aerospace and Electronic Systems.

[14]  R.C. DiPietro,et al.  Extended factored space-time processing for airborne radar systems , 1992, [1992] Conference Record of the Twenty-Sixth Asilomar Conference on Signals, Systems & Computers.

[15]  L.E. Brennan,et al.  Theory of Adaptive Radar , 1973, IEEE Transactions on Aerospace and Electronic Systems.