A Study on Adaptive Algorithms for Numerical Quadrature on Heterogeneous GPU and Multicore Based Systems

In this work, a parallel adaptive algorithm for the computation of a multidimensional integral on heterogeneous GPU and multicore based systems is described. Two different strategies have been combined together in the algorithm: a first procedure is responsible for the load balancing among the threads on the multicore CPU and a second one is responsible for an efficient execution on the GPU of the computational kernel. The performance is analyzed and experimental results on a system with a quad-core CPUs and two GPUs have been achieved.

[1]  Jack Dongarra,et al.  Sourcebook of parallel computing , 2003 .

[2]  Marco Lapegna,et al.  PAMIHR. A Parallel FORTRAN Program for Multidimensional Quadrature on Distributed Memory Architectures , 1999, Euro-Par.

[3]  Christoph W. Ueberhuber,et al.  Numerical Integration on Advanced Computer Systems , 1994, Lecture Notes in Computer Science.

[4]  Marco Ceccarelli,et al.  Total Variation Regularization for Edge Preserving 3D SPECT Imaging in High Performance Computing Environments , 2002, International Conference on Computational Science.

[5]  J. Berntsen Practical error estimation in adaptive multidimensional quadrature routines , 1989 .

[6]  Paul Van Dooren,et al.  An adaptive algorithm for numerical integration over the n-cube , 1976 .

[7]  Almerico Murli,et al.  Integration of emerging computer technologies for an efficient image sequences analysis , 2011, Integr. Comput. Aided Eng..

[8]  A. Genz,et al.  An Imbedded Family of Fully Symmetric Numerical Integration Rules , 1983 .

[9]  Lucia Maddalena,et al.  A fusion-based approach to digital movie restoration , 2009, Pattern Recognit..

[10]  Mario Rosario Guarracino,et al.  A grid enabled PSE for medical imaging: experiences on MedIGrid , 2005, 18th IEEE Symposium on Computer-Based Medical Systems (CBMS'05).

[11]  G. Laccetti,et al.  A Grid Enabled PSE for Medical Imaging: Experiences on , 2005 .

[12]  Almerico Murli,et al.  Towards a parallel component for imaging in PETSc programming environment: a case study in 3-D echocardiography , 2006, Parallel Comput..

[13]  A. Murli,et al.  Image sequence inpainting: towards numerical software for detection and removal of local missing data via motion estimation , 2007 .

[14]  Francesco Gregoretti,et al.  A multi-grained distributed implementation of the parallel Block Conjugate Gradient algorithm , 2010 .

[15]  Terje O. Espelid,et al.  Algorithm 698: DCUHRE: an adaptive multidemensional integration routine for a vector of integrals , 1991, TOMS.

[16]  R. Cools,et al.  Monomial cubature rules since “Stroud”: a compilation , 1993 .

[17]  Almerico Murli,et al.  A Double Adaptive Algorithm for Multidimensional Integration on Multicore Based HPC Systems , 2012, International Journal of Parallel Programming.