TY - JOUR
T1 - NBODY6++GPU
T2 - Ready for the gravitational million-body problem
AU - Wang, Long
AU - Spurzem, Rainer
AU - Aarseth, Sverre
AU - Nitadori, Keigo
AU - Berczik, Peter
AU - Kouwenhoven, M. B.N.
AU - Naab, Thorsten
N1 - Publisher Copyright:
© 2015 The Authors.
PY - 2015/4/24
Y1 - 2015/4/24
N2 - Accurate direct N-body simulations help to obtain detailed information about the dynamical evolution of star clusters. They also enable comparisons with analytical models and Fokker- Planck or Monte Carlo methods. NBODY6 is a well-known direct N-body code for star clusters, and NBODY6++ is the extended version designed for large particle number simulations by supercomputers. We present NBODY6++GPU, an optimized version of NBODY6++ with hybrid parallelization methods (MPI, GPU, OpenMP, andAVX/SSE) to accelerate large direct N-body simulations, and in particular to solve the million-body problem. We discuss the new features of the NBODY6++GPU code, benchmarks, as well as the first results from a simulation of a realistic globular cluster initially containing a million particles. For million-body simulations, NBODY6++GPU is 400-2000 times faster than NBODY6 with 320 CPU cores and 32 NVIDIA K20X GPUs. With this computing cluster specification, the simulations of million-body globular clusters including 5 per cent primordial binaries require about an hour per half-mass crossing time.
AB - Accurate direct N-body simulations help to obtain detailed information about the dynamical evolution of star clusters. They also enable comparisons with analytical models and Fokker- Planck or Monte Carlo methods. NBODY6 is a well-known direct N-body code for star clusters, and NBODY6++ is the extended version designed for large particle number simulations by supercomputers. We present NBODY6++GPU, an optimized version of NBODY6++ with hybrid parallelization methods (MPI, GPU, OpenMP, andAVX/SSE) to accelerate large direct N-body simulations, and in particular to solve the million-body problem. We discuss the new features of the NBODY6++GPU code, benchmarks, as well as the first results from a simulation of a realistic globular cluster initially containing a million particles. For million-body simulations, NBODY6++GPU is 400-2000 times faster than NBODY6 with 320 CPU cores and 32 NVIDIA K20X GPUs. With this computing cluster specification, the simulations of million-body globular clusters including 5 per cent primordial binaries require about an hour per half-mass crossing time.
KW - Globular clusters: general
KW - Methods: numerical
UR - http://www.scopus.com/inward/record.url?scp=84937788274&partnerID=8YFLogxK
U2 - 10.1093/mnras/stv817
DO - 10.1093/mnras/stv817
M3 - Article
AN - SCOPUS:84937788274
SN - 0035-8711
VL - 450
SP - 4070
EP - 4080
JO - Monthly Notices of the Royal Astronomical Society
JF - Monthly Notices of the Royal Astronomical Society
IS - 4
ER -