Massively parallel algorithm and implementation of RI‐MP2 energy calculation for peta‐scale many‐core supercomputers |
| |
Authors: | Michio Katouda Akira Naruse Yukihiko Hirano Takahito Nakajima |
| |
Affiliation: | 1. Computational Molecular Science Research Team, RIKEN Advanced Institute for Computational Science, Chuo‐ku, Kobe, Japan;2. NVIDIA Corporation, Minato‐ku, Tokyo, Japan |
| |
Abstract: | A new parallel algorithm and its implementation for the RI‐MP2 energy calculation utilizing peta‐flop‐class many‐core supercomputers are presented. Some improvements from the previous algorithm (J. Chem. Theory Comput. 2013, 9, 5373) have been performed: (1) a dual‐level hierarchical parallelization scheme that enables the use of more than 10,000 Message Passing Interface (MPI) processes and (2) a new data communication scheme that reduces network communication overhead. A multi‐node and multi‐GPU implementation of the present algorithm is presented for calculations on a central processing unit (CPU)/graphics processing unit (GPU) hybrid supercomputer. Benchmark results of the new algorithm and its implementation using the K computer (CPU clustering system) and TSUBAME 2.5 (CPU/GPU hybrid system) demonstrate high efficiency. The peak performance of 3.1 PFLOPS is attained using 80,199 nodes of the K computer. The peak performance of the multi‐node and multi‐GPU implementation is 514 TFLOPS using 1349 nodes and 4047 GPUs of TSUBAME 2.5. © 2016 Wiley Periodicals, Inc. |
| |
Keywords: | massively parallel algorithm GPGPU electron correlation theory second‐order Mø ller– Plesset perturbation theory K computer TSUBAME 2 5 NTChem |
|
|