Radioastronomy Image Synthesis on the Cell/B.E

Now that large radiotelescopes like SKA, LOFAR, or ASKAP, become available in different parts of the world, radioastronomers foresee a vast increase in the amount of data to gather, store and process. To keep the processing time bounded, parallelization and execution on (massively) parallel machines are required for the commonly-used radioastronomy software kernels. In this paper, we analyze data gridding and degridding, a very time-consuming kernel of radioastronomy image synthesis. To tackle its its dynamic behavior, we devise and implement a parallelization strategy for the Cell/B.E. multi-core processor, offering a cost-efficient alternative compared to classical supercomputers. Our experiments show that the application running on one Cell/B.E. is more than 20 times faster than the original application running on a commodity machine. Based on scalability experiments, we estimate the hardware requirements for a realistic radio-telescope. We conclude that our parallelization solution exposes an efficient way to deal with dynamic data-intensive applications on heterogeneous multi-core processors.

[1]  Samuel Williams,et al.  The potential of the cell processor for scientific computing , 2005, CF '06.

[2]  Tao Zhang,et al.  Supporting OpenMP on Cell , 2007, IWOMP.

[3]  I. Wald,et al.  Ray Tracing on the Cell Processor , 2006, 2006 IEEE Symposium on Interactive Ray Tracing.

[4]  Qiang Liu,et al.  Digital Media Indexing on the Cell Processor , 2007, 2007 IEEE International Conference on Multimedia and Expo.

[5]  Michael Gschwind The Cell Broadband Engine: Exploiting Multiple Levels of Parallelism in a Chip Multiprocessor , 2007, International Journal of Parallel Programming.

[6]  Michael D. McCool Signal Processing and General-Purpose Computing on GPUs , 2007 .

[7]  John D. Bunton,et al.  A Radio Astronomy Correlator Optimized for the Xilinx Virtex-4 SX FPGA , 2007, 2007 International Conference on Field Programmable Logic and Applications.

[8]  H. J. Sips,et al.  The Performance of Gridding / Degridding on the Cell / B . , 2008 .

[9]  H. Peter Hofstee,et al.  Power efficient processor architecture and the cell processor , 2005, 11th International Symposium on High-Performance Computer Architecture.

[10]  William J. Dally,et al.  Sequoia: Programming the Memory Hierarchy , 2006, International Conference on Software Composition.

[11]  Chris Broekema,et al.  The Lofar Central Processing Facility Architecture , 2004 .

[12]  Fabrizio Petrini,et al.  Multicore Surprises: Lessons Learned from Optimizing Sweep3D on the Cell Broadband Engine , 2007, 2007 IEEE International Parallel and Distributed Processing Symposium.

[13]  Michael Mccool,et al.  Signal Processing and General-Purpose Computing and GPUs [Exploratory DSP] , 2007, IEEE Signal Processing Magazine.

[14]  Hiroshi Inoue,et al.  MPI microtask for programming the Cell Broadband Enginee , 2006 .

[15]  T. J. Cornwell SKA and EVLA Computing Costs for Wide Field Imaging , 2004 .

[16]  Alexandros Stamatakis,et al.  RAxML-Cell: Parallel Phylogenetic Tree Inference on the Cell Broadband Engine , 2007, 2007 IEEE International Parallel and Distributed Processing Symposium.

[17]  K. I. Kellermann,et al.  Preliminary Specifications for the Square Kilometre Array , 2008 .

[18]  K. Golap,et al.  W Projection: A New Algorithm for Wide Field Imaging with Radio Synthesis Arrays , 2005 .