论文信息 - 3 D seismic imaging through reverse-time migration on homogeneous and heterogeneous multi-core processors

3 D seismic imaging through reverse-time migration on homogeneous and heterogeneous multi-core processors

Reverse-Time Migration (RTM) is a state-of-the-art technique in seismic acoustic imaging, because of the quality and integrity of the images it provides. Oil and gas companies trust RTM with crucial decisions on multi-million-dollar drilling investments. But RTM requires vastly more computational power than its predecessor techniques, and this has somewhat hindered its practical success. On the other hand, despite multi-core architectures promise to deliver unprecedented computational power, little attention has been devoted to mapping efficiently RTM to multi-cores. In this paper, we present a mapping of the RTM computational kernel to the IBM Cell/B.E. processor that reaches close-tooptimal performance. The kernel proves to be memory-bound and it achieves a 98% utilization of the peak memory bandwidth. Our Cell/B.E. implementation outperforms a traditional processor (PowerPC 970MP) in terms of performance (with an 15.0× speedup) and energy-efficiency (with a 10.0× increase in the GFlops/W delivered). Also, it is the fastest RTM implementation available to the best of our knowledge. These results increase the practical usability of RTM. Also, the RTM-Cell/B.E. combination proves to be a strong competitor in the seismic arena.

[1] Samuel Williams,et al. The potential of the cell processor for scientific computing , 2005, CF '06.

[2] Jesús Labarta,et al. Interfacing Computer Aided Parallelization and Performance Analysis , 2003, International Conference on Computational Science.

[3] Fabrizio Petrini,et al. High-speed string searching against large dictionaries on the Cell/B.E. Processor , 2008, 2008 IEEE International Symposium on Parallel and Distributed Processing.

[4] Jesús Labarta,et al. An Evaluation of Marenostrum Performance , 2008, Int. J. High Perform. Comput. Appl..

[5] Monica S. Lam,et al. A data locality optimizing algorithm , 1991, PLDI '91.

[6] L. Dagum,et al. OpenMP: an industry standard API for shared-memory programming , 1998 .

[7] A. Ray,et al. Developing a finite difference time domain parallel code for nuclear electromagnetic field Simulation , 2006, IEEE Transactions on Antennas and Propagation.

[8] Srinivas Aluru,et al. Parallel biological sequence alignments on the Cell Broadband Engine , 2008, 2008 IEEE International Symposium on Parallel and Distributed Processing.

[9] Fabrizio Petrini,et al. Cell Multiprocessor Communication Network: Built for Speed , 2006, IEEE Micro.

[10] Fabrizio Petrini,et al. Peak-Performance DFA-based String Matching on the Cell Processor , 2007, 2007 IEEE International Parallel and Distributed Processing Symposium.

[11] George A. McMechan,et al. A review of seismic acoustic imaging by reverse‐time migration , 1989, Int. J. Imaging Syst. Technol..

[12] Samuel Williams,et al. Lattice Boltzmann simulation optimization on leading multicore platforms , 2008, 2008 IEEE International Symposium on Parallel and Distributed Processing.

[13] Leonid Oliker,et al. Impact of modern memory subsystems on cache optimizations for stencil computations , 2005, MSP '05.

[14] Sadaf R. Alam,et al. Balancing productivity and performance on the cell broadband engine , 2007, 2007 IEEE International Conference on Cluster Computing.

[15] Chau-Wen Tseng,et al. Tiling Optimizations for 3D Scientific Computations , 2000, ACM/IEEE SC 2000 Conference (SC'00).

[16] Guillaume Houzeaux,et al. WAVE ACOUSTIC PROPAGATION FOR GEOPHYSICS IMAGING , FINITE DIFFERENCE vs FINITE ELEMENT METHODS COMPARISON AND BOUNDARY CONDITION TREATMENT , 2007 .

[17] Samuel Williams,et al. Scientific Computing Kernels on the Cell Processor , 2007, International Journal of Parallel Programming.