On optimal message vector length for block single parallel partition algorithm in a three-dimensional ADI solver |
| |
Authors: | Li Yuan Hong Guo |
| |
Affiliation: | a LSEC and Institute of Computational Mathematics and Scientific/Engineering Computing, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, PR China b National Microgravity Laboratory, Institute of Mechanics, Chinese Academy of Sciences, Beijing 100190, PR China |
| |
Abstract: | It has long been recognized that many direct parallel tridiagonal solvers are only efficient for solving a single tridiagonal equation of large sizes, and they become inefficient when naively used in a three-dimensional ADI solver. In order to improve the parallel efficiency of an ADI solver using a direct parallel solver, we implement the single parallel partition (SPP) algorithm in conjunction with message vectorization, which aggregates several communication messages into one to reduce the communication costs. The measured performances show that the longest allowable message vector length (MVL) is not necessarily the best choice. To understand this observation and optimize the performance, we propose an improved model that takes the cache effect into consideration. The optimal MVL for achieving the best performance is shown to depend on number of processors and grid sizes. Similar dependence of the optimal MVL is also found for the popular block pipelined method. |
| |
Keywords: | Tridiagonal equation Single parallel partition ADI scheme Message vectorization Optimal message vector length |
本文献已被 ScienceDirect 等数据库收录! |
|