首页 | 本学科首页   官方微博 | 高级检索  
文章检索
  按 检索   检索词:      
出版年份:   被引次数:   他引次数: 提示:输入*表示无穷大
  收费全文   46篇
  免费   5篇
  国内免费   1篇
化学   19篇
力学   8篇
综合类   3篇
数学   7篇
物理学   15篇
  2020年   3篇
  2019年   1篇
  2018年   2篇
  2017年   5篇
  2016年   5篇
  2015年   4篇
  2014年   6篇
  2013年   4篇
  2012年   6篇
  2011年   11篇
  2010年   3篇
  2009年   1篇
  2007年   1篇
排序方式: 共有52条查询结果,搜索用时 15 毫秒
1.
We implemented accurate FFD in terms of triangular Bezier surfaces as matrix multiplications in CUDA and rendered them via OpenGL. Experimental results show that the proposed algorithm is more efficient than the previous GPU acceleration algorithm and tessel- lation shader algorithms.  相似文献   
2.
构建航天飞行器的结构有限元模型是准确模拟飞行仿真、完成飞行器在轨飞行阶段结构故障监测和诊断的基础。采用细长体飞行器简化梁模型,提出新的基于CUDA(Compute Unified Device Architecture)的有限元单元刚度矩阵生成和总刚度矩阵组装算法。依据梁单元矩阵的对称性,结合GPU硬件架构提出并行生成算法并进行改进。为有效减少装配时间,在装配过程中采用着色算法,提出了基于GPU(Graphics Processing Unit)共享内存的非零项组装策略,通过在不同计算平台下算例对比,验证了新算法的快速性。数值算例表明,本文算法的求解效率较高,针对一定计算规模内的模型可满足快速计算与诊断的实时性要求。  相似文献   
3.
The availability of state of art GPGPU cards can play a promising role in accelerating the calculations of computer generated holograms (CGH) where one of the main problems of generating such holograms is the need for massive amount of calculations. Both CUDA and Matlab can be used alone to fulfill this purpose but using both at the same time gives a number of benefits. This paper is dedicated to present a performance study of applying Fermi-Architecture CUDA-enabled GPGPU card for speeding up the calculations of binary detour phase holograms using both Matlab and CUDA programming.  相似文献   
4.
Nowadays, high performance computing (HPC) systems experience a disruptive moment with a variety of novel architectures and frameworks, without any clarity of which one is going to prevail. In this context, the portability of codes across different architectures is of major importance. This paper presents a portable implementation model based on an algebraic operational approach for direct numerical simulation (DNS) and large eddy simulation (LES) of incompressible turbulent flows using unstructured hybrid meshes. The strategy proposed consists in representing the whole time-integration algorithm using only three basic algebraic operations: sparse matrix–vector product, a linear combination of vectors and dot product. The main idea is based on decomposing the nonlinear operators into a concatenation of two SpMV operations. This provides high modularity and portability. An exhaustive analysis of the proposed implementation for hybrid CPU/GPU supercomputers has been conducted with tests using up to 128 GPUs. The main objective consists in understanding the challenges of implementing CFD codes on new architectures.  相似文献   
5.
Graphics Processing Unit (GPU), originally developed for real-time, high-definition 3D graphics in computer games, now provides great faculty in solving scientific applications. The basis of particle transport simulation is the time-dependent, multi-group, inhomogeneous Boltzmann transport equation. The numerical solution to the Boltzmann equation involves the discrete ordinates (Sn) method and the procedure of source iteration. In this paper, we present a GPU accelerated simulation of one energy group time-independent deterministic discrete ordinates particle transport in 3D Cartesian geometry (Sweep3D). The performance of the GPU simulations are reported with the simulations of vacuum boundary condition. The discussion of the relative advantages and disadvantages of the GPU implementation, the simulation on multi GPUs, the programming effort and code portability are also reported. The results show that the overall performance speedup of one NVIDIA Tesla M2050 GPU ranges from 2.56 compared with one Intel Xeon X5670 chip to 8.14 compared with one Intel Core Q6600 chip for no flux fixup. The simulation with flux fixup on one M2050 is 1.23 times faster than on one X5670.  相似文献   
6.
This paper presents a novel background prediction method for infrared small target detection (ISTD). Using a separable convolution template (SCT) to accelerate the traditional background prediction by graphic processing unit (GPU), the new method provides a significant improvement in the prediction speed, which enables the prediction process in real time. And experimental results show its high efficiency and practical application over previous work. The mathematical approach proposed here could be extended to accelerate the applications referred to image convolutions not only to the infrared field.  相似文献   
7.
综述了图形处理器(GPU)在计算化学中的应用和进展.首先简单介绍了GPU在科学计算中应用的发展,然后分别详细讲述了迄今几个使用GPU和CUDA(compute unified device architecture,显卡厂商Nvidia推出的计算平台)开发工具设计的量子化学计算和分子动力学(MD)模拟的算法和程序,尤其对目前唯一完全使用GPU技术开发的量子化学计算软件TeraChem做了完备的介绍,包括算法、实现的细节和程序目前的功能.此外,本文还对GPU在计算化学上将会发挥的作用做出了极为乐观的展望.  相似文献   
8.
In the present study, a GPU accelerated 1D space–time CESE method is developed and applied to shock tube problems with and without condensation. We have demonstrated how to implement the CESE algorithm to solve 1D shock tube problems using an older generation GPU (the NVIDIA 9800 GT) with relatively limited memory. To optimize the code performance, we used Shared Memory and solved the inter-Block boundary problem in two ways, namely the branch scheme and the overlapping scheme. The implementations of these schemes are discussed in detail and their performances are compared for the Sod shock tube problems. For the Sod problem without condensation, the speedup over an Intel CPU E7300 is 23 for the branch scheme and 41 for the overlapping scheme, respectively. While for problems with condensation, both schemes achieve higher acceleration ratios, 53 and 71, respectively. The higher speedup of the condensation case can be ascribed to the source term calculation which has a local dependence on the mesh point and the SOURCE kernel has a higher acceleration ratio.  相似文献   
9.
Modern graphics processing units (GPUs) are flexibly programmable and have peak computational throughput significantly faster than conventional CPUs. Herein, we describe the design and implementation of PAPER, an open‐source implementation of Gaussian molecular shape overlay for NVIDIA GPUs. We demonstrate one to two order‐of‐magnitude speedups on high‐end commodity GPU hardware relative to a reference CPU implementation of the shape overlay algorithm and speedups of over one order of magnitude relative to the commercial OpenEye ROCS package. In addition, we describe errors incurred by approximations used in common implementations of the algorithm. © 2009 Wiley Periodicals, Inc. J Comput Chem 2010  相似文献   
10.
During the past few years, general-purpose graphics processing units (GPGPUs) have become rather popular in the high performance computing community. In this study, we present an implementation of the simulation of dynamic nuclear magnetic resonance (DNMR) spectra. The algorithm is based on the kinetic Monte Carlo method and therefore can benefit from the multithreaded architecture of the GPGPU. By careful optimization of the algorithm a 30-100-fold speed increase could be achieved.  相似文献   
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号