3 D seismic imaging through reverse-time migration on homogeneous and heterogeneous multi-core processors

Reverse-Time Migration (RTM) is a state-of-the-art technique in seismic acoustic imaging, because of the quality and integrity of the images it provides. Oil and gas companies trust RTM with crucial decisions on multi-million-dollar drilling investments. But RTM requires vastly more computational power than its predecessor techniques, and this has somewhat hindered its practical success. On the other hand, despite multi-core architectures promise to deliver unprecedented computational power, little attention has been devoted to mapping efficiently RTM to multi-cores. In this paper, we present a mapping of the RTM computational kernel to the IBM Cell/B.E. processor that reaches close-tooptimal performance. The kernel proves to be memory-bound and it achieves a 98% utilization of the peak memory bandwidth. Our Cell/B.E. implementation outperforms a traditional processor (PowerPC 970MP) in terms of performance (with an 15.0× speedup) and energy-efficiency (with a 10.0× increase in the GFlops/W delivered). Also, it is the fastest RTM implementation available to the best of our knowledge. These results increase the practical usability of RTM. Also, the RTM-Cell/B.E. combination proves to be a strong competitor in the seismic arena.

[1]  Samuel Williams,et al.  The potential of the cell processor for scientific computing , 2005, CF '06.

[2]  Jesús Labarta,et al.  Interfacing Computer Aided Parallelization and Performance Analysis , 2003, International Conference on Computational Science.

[3]  Fabrizio Petrini,et al.  High-speed string searching against large dictionaries on the Cell/B.E. Processor , 2008, 2008 IEEE International Symposium on Parallel and Distributed Processing.

[4]  Jesús Labarta,et al.  An Evaluation of Marenostrum Performance , 2008, Int. J. High Perform. Comput. Appl..

[5]  Monica S. Lam,et al.  A data locality optimizing algorithm , 1991, PLDI '91.

[6]  L. Dagum,et al.  OpenMP: an industry standard API for shared-memory programming , 1998 .

[7]  A. Ray,et al.  Developing a finite difference time domain parallel code for nuclear electromagnetic field Simulation , 2006, IEEE Transactions on Antennas and Propagation.

[8]  Srinivas Aluru,et al.  Parallel biological sequence alignments on the Cell Broadband Engine , 2008, 2008 IEEE International Symposium on Parallel and Distributed Processing.

[9]  Fabrizio Petrini,et al.  Cell Multiprocessor Communication Network: Built for Speed , 2006, IEEE Micro.

[10]  Fabrizio Petrini,et al.  Peak-Performance DFA-based String Matching on the Cell Processor , 2007, 2007 IEEE International Parallel and Distributed Processing Symposium.

[11]  George A. McMechan,et al.  A review of seismic acoustic imaging by reverse‐time migration , 1989, Int. J. Imaging Syst. Technol..

[12]  Samuel Williams,et al.  Lattice Boltzmann simulation optimization on leading multicore platforms , 2008, 2008 IEEE International Symposium on Parallel and Distributed Processing.

[13]  Leonid Oliker,et al.  Impact of modern memory subsystems on cache optimizations for stencil computations , 2005, MSP '05.

[14]  Sadaf R. Alam,et al.  Balancing productivity and performance on the cell broadband engine , 2007, 2007 IEEE International Conference on Cluster Computing.

[15]  Chau-Wen Tseng,et al.  Tiling Optimizations for 3D Scientific Computations , 2000, ACM/IEEE SC 2000 Conference (SC'00).

[16]  Guillaume Houzeaux,et al.  WAVE ACOUSTIC PROPAGATION FOR GEOPHYSICS IMAGING , FINITE DIFFERENCE vs FINITE ELEMENT METHODS COMPARISON AND BOUNDARY CONDITION TREATMENT , 2007 .

[17]  Samuel Williams,et al.  Scientific Computing Kernels on the Cell Processor , 2007, International Journal of Parallel Programming.