首页 | 本学科首页   官方微博 | 高级检索  
     

大规模声学边界元法的GPU并行计算
引用本文:张锐,文立华,校金友. 大规模声学边界元法的GPU并行计算[J]. 计算物理, 2015, 32(3): 299-309
作者姓名:张锐  文立华  校金友
作者单位:西北工业大学航天学院, 陕西 西安 710072
基金项目:国家自然科学基金,教育部博士点基金
摘    要:提出一种大规模声学边界元法的高效率、高精度GPU并行计算方法.基于Burton-Miller边界积分方程,推导适于GPU的并行计算格式并实现了传统边界元法的GPU加速算法.为提高原型算法的效率,研究GPU数据缓存优化方法.由于GPU的双精度浮点运算能力较低,为了降低数值误差,研究基于单精度浮点运算实现的doublesingle精度算法.数值算例表明,改进的算法实现了最高89.8%的GPU使用效率,且数值精度与直接使用双精度数相当,而计算时间仅为其1/28,显存消耗也仅为其一半.该方法可在普通PC机(8GB内存,NVIDIA Ge Force 660 Ti显卡)上快速完成自由度超过300万的大规模声学边界元分析,计算速度和内存消耗均优于快速边界元法.

关 键 词:声学  边界元法  大规模问题  GPU计算  优化算法  
收稿时间:2014-03-21
修稿时间:2014-06-10

GPU-accelerated Boundary Element Method for Large-scale Problems in Acoustics
ZHANG Rui,WEN Lihua,XIAO Jinyou. GPU-accelerated Boundary Element Method for Large-scale Problems in Acoustics[J]. Chinese Journal of Computational Physics, 2015, 32(3): 299-309
Authors:ZHANG Rui  WEN Lihua  XIAO Jinyou
Affiliation:School of Astronautics, Northwestern Polytechnical University, Xi'an 710072, China
Abstract:A boundary element method (BEM) for large-scale acoustic analysis is accelerated efficiently and precisely with Graphics Processing Units (GPUs). Based on Burton-Miller boundary integral equation, an implementation scheme that can be handled efficiently in GPU is derived and applied to accelerate conventional BEM. Data caching techniques in GPU are introduced to improve efficiency of the prototype algorithm. A double-single precision algorithm implemented with single-precision floating-point numbers is employed to reduce numerical errors. It shows that the improved algorithm sustained a highest GPU efficiency of 89.8% for large-scale problems, and its accuracy was almost the same as that with double-precision numerals directly while costing only 1/28 in time and half in GPU memory consumption of the latter. The largest problem size up to 3 million unknowns was solved rapidly on a desktop PC (8GB RAM, NVIDIA GeForce 660 Ti) by the method. Its performance was better than the fast BEM algorithms in both time and memory consumption.
Keywords:acoustics  large-scale problems  GPU computing  optimization algorithm
本文献已被 万方数据 等数据库收录!
点击此处可从《计算物理》浏览原始摘要信息
点击此处可从《计算物理》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号