首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Massively parallel algorithm and implementation of RI‐MP2 energy calculation for peta‐scale many‐core supercomputers
Authors:Michio Katouda  Akira Naruse  Yukihiko Hirano  Takahito Nakajima
Institution:1. Computational Molecular Science Research Team, RIKEN Advanced Institute for Computational Science, Chuo‐ku, Kobe, Japan;2. NVIDIA Corporation, Minato‐ku, Tokyo, Japan
Abstract:A new parallel algorithm and its implementation for the RI‐MP2 energy calculation utilizing peta‐flop‐class many‐core supercomputers are presented. Some improvements from the previous algorithm (J. Chem. Theory Comput. 2013, 9, 5373) have been performed: (1) a dual‐level hierarchical parallelization scheme that enables the use of more than 10,000 Message Passing Interface (MPI) processes and (2) a new data communication scheme that reduces network communication overhead. A multi‐node and multi‐GPU implementation of the present algorithm is presented for calculations on a central processing unit (CPU)/graphics processing unit (GPU) hybrid supercomputer. Benchmark results of the new algorithm and its implementation using the K computer (CPU clustering system) and TSUBAME 2.5 (CPU/GPU hybrid system) demonstrate high efficiency. The peak performance of 3.1 PFLOPS is attained using 80,199 nodes of the K computer. The peak performance of the multi‐node and multi‐GPU implementation is 514 TFLOPS using 1349 nodes and 4047 GPUs of TSUBAME 2.5. © 2016 Wiley Periodicals, Inc.
Keywords:massively parallel algorithm  GPGPU  electron correlation theory  second‐order Mø  ller–  Plesset perturbation theory  K computer  TSUBAME 2  5  NTChem
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号