Adaptive multi-GPU Exchange Monte Carlo for the 3D Random Field Ising Model

Abstract This work presents an adaptive multi-GPU Exchange Monte Carlo approach for the simulation of the 3D Random Field Ising Model (RFIM). The design is based on a two-level parallelization. The first level, spin-level parallelism , maps the parallel computation as optimal 3D thread-blocks that simulate blocks of spins in shared memory with minimal halo surface, assuming a constant block volume. The second level, replica-level parallelism , uses multi-GPU computation to handle the simulation of an ensemble of replicas. CUDA’s concurrent kernel execution feature is used in order to fill the occupancy of each GPU with many replicas, providing a performance boost that is more notorious at the smallest values of L . In addition to the two-level parallel design, the work proposes an adaptive multi-GPU approach that dynamically builds a proper temperature set free of exchange bottlenecks. The strategy is based on mid-point insertions at the temperature gaps where the exchange rate is most compromised. The extra work generated by the insertions is balanced across the GPUs independently of where the mid-point insertions were performed. Performance results show that spin-level performance is approximately two orders of magnitude faster than a single-core CPU version and one order of magnitude faster than a parallel multi-core CPU version running on 16-cores. Multi-GPU performance is highly convenient under a weak scaling setting, reaching up to 99 % efficiency as long as the number of GPUs and L increase together. The combination of the adaptive approach with the parallel multi-GPU design has extended our possibilities of simulation to sizes of L = 32 , 64 for a workstation with two GPUs. Sizes beyond L = 64 can eventually be studied using larger multi-GPU systems.

[1]  P. Theodorakis,et al.  Random-field Ising model: Insight from zero-temperature simulations , 2014, 1501.02338.

[2]  B. Cipra An introduction to the Ising model , 1987 .

[3]  Frederic Magoules,et al.  Iterative Krylov Methods for Acoustic Problems on Graphics Processing Unit , 2014, 2014 13th International Symposium on Distributed Computing and Applications to Business, Engineering and Science.

[4]  L. Santen,et al.  The critical exponents of the two-dimensional Ising spin glass revisited: Exact ground-state calculations and Monte Carlo simulations , 1996 .

[5]  Peter Virnau,et al.  Multi-GPU accelerated multi-spin Monte Carlo simulations of the 2D Ising model , 2010, Comput. Phys. Commun..

[6]  V. Martin-Mayor,et al.  Universality in the three-dimensional random-field Ising model. , 2013, Physical review letters.

[7]  Martin Weigel SIMULATING SPIN MODELS ON GPU: A TOUR , 2012 .

[8]  K. Hukushima,et al.  Exchange Monte Carlo Method and Application to Spin Glass Simulations , 1995, cond-mat/9512035.

[9]  T. Yavors’kii,et al.  Optimized GPU simulation of continuous-spin glass models , 2012, The European Physical Journal Special Topics.

[10]  Martin Weigel,et al.  Simulating spin models on GPU , 2010, Comput. Phys. Commun..

[11]  Wang,et al.  Replica Monte Carlo simulation of spin glasses. , 1986, Physical review letters.

[12]  J. D. de Pablo,et al.  Optimal allocation of replicas in parallel tempering simulations. , 2005, The Journal of chemical physics.

[13]  J. Ramanujam,et al.  Parallel tempering simulation of the three-dimensional Edwards-Anderson model with compact asynchronous multispin coding on GPU , 2013, Comput. Phys. Commun..

[14]  Massimo Bernaschi,et al.  Highly optimized simulations on single- and multi-GPU systems of the 3D Ising spin glass model , 2014, Comput. Phys. Commun..

[15]  Alan D. Sokal,et al.  Universal Amplitude Ratios in the Critical Two-Dimensional Ising Model on a Torus , 2000 .

[16]  Helmut G. Katzgraber,et al.  Universality in three-dimensional Ising spin glasses: A Monte Carlo study , 2006, cond-mat/0602212.

[17]  W. K. Hastings,et al.  Monte Carlo Sampling Methods Using Markov Chains and Their Applications , 1970 .

[18]  Frédéric Magoulès,et al.  Auto-tuned Krylov methods on cluster of graphics processing unit , 2015, Int. J. Comput. Math..

[19]  Nancy Hitschfeld-Kahler,et al.  A Survey on Parallel Computing and its Applications in Data-Parallel Problems Using GPU Architectures , 2014 .

[20]  Wang,et al.  Nonuniversal critical dynamics in Monte Carlo simulations. , 1987, Physical review letters.

[21]  Wolff,et al.  Collective Monte Carlo updating for spin systems. , 1989, Physical review letters.

[22]  M. Weigel Connected-component identification and cluster update on graphics processing units. , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[23]  Frederic Magoules,et al.  Iterative Krylov Methods for Gravity Problems on Graphics Processing Unit , 2013, 2013 12th International Symposium on Distributed Computing and Applications to Business, Engineering & Science.

[24]  Anastasios Malakis,et al.  Lack of self-averaging of the specific heat in the three-dimensional random-field Ising model. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[25]  N Prokof'ev,et al.  Worm algorithms for classical statistical models. , 2001, Physical review letters.

[26]  Coddington,et al.  Comparison of cluster algorithms for two-dimensional Potts models. , 1991, Physical review. B, Condensed matter.

[27]  Young,et al.  Quasicritical behavior and first-order transition in the d=3 random-field Ising model. , 1985, Physical review letters.

[28]  Martin Weigel,et al.  Performance potential for simulating spin models on GPU , 2010, J. Comput. Phys..

[29]  C. Geyer Markov Chain Monte Carlo Maximum Likelihood , 1991 .

[30]  Yutaka Okabe,et al.  Multi-GPU-based Swendsen-Wang multi-cluster algorithm for the simulation of two-dimensional q-state Potts model , 2012, Comput. Phys. Commun..

[31]  Frederic Magoules,et al.  Schwarz Method with Two-Sided Transmission Conditions for the Gravity Equations on Graphics Processing Unit , 2013, 2013 12th International Symposium on Distributed Computing and Applications to Business, Engineering & Science.

[32]  E. Ising Beitrag zur Theorie des Ferromagnetismus , 1925 .

[33]  Nicolás Wolovick,et al.  q-state Potts model metastability study using optimized GPU-based Monte Carlo algorithms , 2011, Comput. Phys. Commun..

[34]  Yutaka Okabe,et al.  GPU-based single-cluster algorithm for the simulation of the Ising model , 2012, J. Comput. Phys..

[35]  Barbara Chapman,et al.  Using OpenMP: Portable Shared Memory Parallel Programming (Scientific and Engineering Computation) , 2007 .

[36]  R. B. Potts Some generalized order-disorder transformations , 1952, Mathematical Proceedings of the Cambridge Philosophical Society.

[37]  D. Kofke,et al.  Selection of temperature intervals for parallel-tempering simulations. , 2005, The Journal of chemical physics.

[38]  Matthias Troyer,et al.  Feedback-optimized parallel tempering Monte Carlo , 2006, cond-mat/0602085.

[39]  Schwartz,et al.  Critical behavior of the random-field Ising model. , 1996, Physical review. B, Condensed matter.

[40]  A. Malakis,et al.  Phase diagram of the 3D bimodal random-field Ising model , 2008, 0802.0077.

[41]  Yutaka Okabe,et al.  GPU-based Swendsen-Wang multi-cluster algorithm for the simulation of two-dimensional classical spin systems , 2012, Comput. Phys. Commun..

[42]  Morgenstern,et al.  Critical behavior of three-dimensional Ising spin-glass model. , 1985, Physical review letters.

[43]  Wolfgang Paul,et al.  GPU accelerated Monte Carlo simulation of the 2D and 3D Ising model , 2009, J. Comput. Phys..

[44]  N. Metropolis,et al.  Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[45]  Kenneth A. Hawick,et al.  Parallel graph component labelling with GPUs and CUDA , 2010, Parallel Comput..

[46]  K. Adkins Theory of spin glasses , 1974 .

[47]  Wolfhard Janke,et al.  Make life simple: unleash the full power of the parallel tempering algorithm. , 2010, Physical review letters.

[48]  Yoseph Imry,et al.  Random-Field Instability of the Ordered State of Continuous Symmetry , 1975 .