大规模有限元系统的GPU加速计算研究 Solving large finite element system by GPU computation期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

大规模有限元系统的GPU加速计算研究

引用本文：	刘小虎,胡耀国,符伟.大规模有限元系统的GPU加速计算研究[J].计算力学学报,2012,29(1):146-152.

作者姓名：	刘小虎胡耀国符伟

作者单位：	华中科技大学力学系, 武汉 430074;华中科技大学力学系, 武汉 430074;华中科技大学力学系, 武汉 430074

基金项目：	国家自然科学基金(10672066)资助项目.

摘要：	研究了GPU(Graphics Processing Units)计算应用于有限元方法中的总刚计算和组装、稀疏矩阵与向量乘积运算、线性方程组求解问题,并基于CUDA(Compute Unified Device Architecture)平台利用GTX295GPU进行程序实现和测试。系统总刚采用CSR(Compressed Sparse Row)压缩格式存放于GPU显存中,用单元染色方法实现总刚并行计算组装,用共轭梯度迭代法求解大规模线性方程组。对300万自由度以内的空间桁架和平面问题算例,GPU有限元计算分别获得最高9.5倍和6.5倍的计算加速比,并且加速比随系统自由度的增加而近似线性增加,GFLOP/s峰值也有近10倍的增加。
关键词：	GPU计算有限元共轭梯度法并行计算 CUDA
收稿时间：	2010/8/20 0:00:00
修稿时间：	2011/3/18 0:00:00
Solving large finite element system by GPU computation

LIU Xiao-hu,HU Yao-guo and FU Wei.Solving large finite element system by GPU computation[J].Chinese Journal of Computational Mechanics,2012,29(1):146-152.

Authors:	LIU Xiao-hu HU Yao-guo and FU Wei

Institution:	Department of Mechanics, Hua Zhong University of Science and Technology, Wuhan 430074, China;Department of Mechanics, Hua Zhong University of Science and Technology, Wuhan 430074, China;Department of Mechanics, Hua Zhong University of Science and Technology, Wuhan 430074, China

Abstract:	Some techniques for applying GPU(Graphics Processing Units) computation in FEM(Finite Element Method) were investigated in this paper,which include element stiffness matrix parallel calculation and global stiffness matrix assembly method,unstructured sparse matrix-vector multiplication and large-scale linear system solving method.A FEM code was implemented by using CUDA(Compute Unified Device Architecture) platform and tested on nVidia GeForce GPU device.The system stiffness matrix was stored in the graphics memory in CSR(Compressed Sparse Row) format,and assembled via element coloring.Conjugate gradient method was used to solve FEM linear system iteratively.For the truss and 2D examples,the GPU-based FEM code gained speedups up to 9.5x and 6.5x,respectively.It is found that the GPU speedup values are roughly linear with system DOFs(Degree Of Freedoms),and the peak values of GFLOP/s increase approximately 10 times when comparing with those of CPU’s.

Keywords:	GPU computation finite element method conjugate gradient method parallel computation CUDA
本文献已被 CNKI 等数据库收录！
	点击此处可从《计算力学学报》浏览原始摘要信息
	点击此处可从《计算力学学报》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏