首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
An unstructured non‐nested multigrid method is presented for efficient simulation of unsteady incompressible Navier–Stokes flows. The Navier–Stokes solver is based on the artificial compressibility approach and a higher‐order characteristics‐based finite‐volume scheme on unstructured grids. Unsteady flow is calculated with an implicit dual time stepping scheme. For efficient computation of unsteady viscous flows over complex geometries, an unstructured multigrid method is developed to speed up the convergence rate of the dual time stepping calculation. The multigrid method is used to simulate the steady and unsteady incompressible viscous flows over a circular cylinder for validation and performance evaluation purposes. It is found that the multigrid method with three levels of grids results in a 75% reduction in CPU time for the steady flow calculation and 55% reduction for the unsteady flow calculation, compared with its single grid counterparts. The results obtained are compared with numerical solutions obtained by other researchers as well as experimental measurements wherever available and good agreements are obtained. Copyright © 2004 John Wiley & Sons, Ltd.  相似文献   

2.
This paper describes a modern free‐surface capturing strategy implemented in an unstructured finite‐volume viscous flow solver that can handle moving grids composed of arbitrary‐shaped control volumes. An adaptive mesh strategy is fully integrated in the code making it a single tool for dynamically maintaining a prescribed density of grid points around the steady or unsteady interface between air and water. The whole adaptive procedure is described in detail. The efficiency of the overall approach is examined on two‐ and three‐dimensional hydrodynamic applications. The adaptive strategy achieves interesting gains in terms of computational and human efforts compared to single‐mesh computations. Copyright © 2005 John Wiley & Sons, Ltd.  相似文献   

3.
An unstructured, shock‐fitting algorithm, originally developed to simulate steady flows, has being further developed to make it capable of dealing with unsteady flows. The present paper discusses and analyses the additional features required to extend to unsteady flows, the steady algorithm. The properties of the unsteady version of this novel, unstructured shock‐fitting technique, are tested by reference to the inviscid interaction between a vortex and a planar shock: a comparative assessment of shock‐capturing and shock‐fitting is made for the same test problem. Copyright © 2015 John Wiley & Sons, Ltd.  相似文献   

4.
The present paper investigates the multigrid (MG) acceleration of compressible Reynolds‐averaged Navier–Stokes computations using Reynolds‐stress model 7‐equation turbulence closures, as well as lower‐level 2‐equation models. The basic single‐grid SG algorithm combines upwind‐biased discretization with a subiterative local‐dual‐time‐stepping time‐integration procedure. MG acceleration, using characteristic MG restriction and prolongation operators, is applied on meanflow variables only (MF–MG), turbulence variables being simply injected onto coarser grids. A previously developed non‐time‐consistent (for steady flows) full‐approximation‐multigrid (s–MG) is assessed for 3‐D anisotropy‐driven and/or separated flows, which are dominated by the convergence of turbulence variables. Even for these difficult test cases CPU‐speed‐ups rCPUSUP∈[3, 5] are obtained. Alternative, potentially time‐consistent approaches (unsteady u–MG), where MG acceleration is applied at each subiteration, are also examined, using different subiterative strategies, MG cycles, and turbulence models. For 2‐D shock wave/turbulent boundary layer interaction, the fastest s–MG approach, with a V(2, 0) sawtooth cycle, systematically yields CPU‐speed‐ups of 5±½, quasi‐independent of the particular turbulence closure used. Copyright © 2008 John Wiley & Sons, Ltd.  相似文献   

5.
This paper describes a non‐iterative operator‐splitting algorithm for computing all‐speed flows in complex geometries. A pressure‐based algorithm is adopted as the base, in which pressure, instead of density, is a primary variable, thus allowing for a unified formulation for all Mach numbers. The focus is on adapting the method for (a) flows at all speeds, and (b) multiblock, non‐orthogonal, body‐fitted grids for very complex geometries. Key features of the formulation include special treatment of mass fluxes at control volume interfaces to avoid pressure–velocity decoupling for incompressible (low Mach number limit) flows and to provide robust pressure–velocity–density coupling for compressible (high‐speed) flows. The method is shown to be robust for all Mach number regimes for both steady and unsteady flows; it is found to be stable for CFL numbers of order ten, allowing large time steps to be taken for steady flows. Enhancements to the method which allow for stable solutions to be obtained on non‐orthogonal grids are also discussed. The method is found to be very reliable even in complex engineering applications such as unsteady rotor–stator interactions in turbulent, all‐speed turbomachinery flows. Copyright © 2004 John Wiley & Sons, Ltd.  相似文献   

6.
The implementation of an edge-based three-dimensional Reynolds Average Navier–Stokes solver for unstructured grids able to run on multiple graphics processing units (GPUs) is presented. Loops over edges, which are the most time-consuming part of the solver, have been written to exploit the massively parallel capabilities of GPUs. Non-blocking communications between parallel processes and between the GPU and the central processor unit (CPU) have been used to enhance code scalability. The code is written using a mixture of C++ and OpenCL, to allow the execution of the source code on GPUs. The Message Passage Interface (MPI) library is used to allow the parallel execution of the solver on multiple GPUs. A comparative study of the solver parallel performance is carried out using a cluster of CPUs and another of GPUs. It is shown that a single GPU is up to 64 times faster than a single CPU core. The parallel scalability of the solver is mainly degraded due to the loss of computing efficiency of the GPU when the size of the case decreases. However, for large enough grid sizes, the scalability is strongly improved. A cluster featuring commodity GPUs and a high bandwidth network is ten times less costly and consumes 33% less energy than a CPU-based cluster with an equivalent computational power.  相似文献   

7.
A nested multi‐grid solution algorithm has been developed for an adaptive Cartesian/Quad grid viscous flow solver. Body‐fitted adaptive Quad (quadrilateral) grids are generated around solid bodies through ‘surface extrusion’. The Quad grids are then overlapped with an adaptive Cartesian grid. Quadtree data structures are employed to record both the Quad and Cartesian grids. The Cartesian grid is generated through recursive sub‐division of a single root, whereas the Quad grids start from multiple roots—a forest of Quadtrees, representing the coarsest possible Quad grids. Cell‐cutting is performed at the Cartesian/Quad grid interface to merge the Cartesian and Quad grids into a single unstructured grid with arbitrary cell topologies (i.e., arbitrary polygons). Because of the hierarchical nature of the data structure, many levels of coarse grids have already been built in. The coarsening of the unstructured grid is based on the Quadtree data structure through reverse tree traversal. Issues arising from grid coarsening are discussed and solutions are developed. The flow solver is based on a cell‐centered finite volume discretization, Roe's flux splitting, a least‐squares linear reconstruction, and a differentiable limiter developed by Venkatakrishnan in a modified form. A local time stepping scheme is used to handle very small cut cells produced in cell‐cutting. Several cycling strategies, such as the saw‐tooth, W‐ and V‐cycles, have been studies. The V‐cycle has been found to be the most efficient. In general, the multi‐grid solution algorithm has been shown to greatly speed up convergence to steady state—by one to two orders. Copyright © 2000 John Wiley & Sons, Ltd.  相似文献   

8.
This paper contains a comparison of four SIMPLE‐type methods used as solver and as preconditioner for the iterative solution of the (Reynolds‐averaged) Navier–Stokes equations, discretized with a finite volume method for cell‐centered, colocated variables on unstructured grids. A matrix‐free implementation is presented, and special attention is given to the treatment of the stabilization matrix to maintain a compact stencil suitable for unstructured grids. We find SIMPLER preconditioning to be robust and efficient for academic test cases and industrial test cases. Compared with the classical SIMPLE solver, SIMPLER preconditioning reduces the number of nonlinear iterations by a factor 5–20 and the CPU time by a factor 2–5 depending on the case. The flow around a ship hull at Reynolds number 2E9, for example, on a grid with cell aspect ratio up to 1:1E6, can be computed in 3 instead of 15 h.Copyright © 2012 John Wiley & Sons, Ltd.  相似文献   

9.
We optimized the Arbitrary accuracy DErivatives Riemann problem (ADER) ‐ Discontinuous Galerkin (DG) numerical method using the CUDA‐C language to run the code in a graphic processing unit (GPU). We focus on solving linear hyperbolic partial–differential equations where the method can be expressed as a combination of precomputed matrix multiplications becoming a good candidate to be used on the GPU hardware. Moreover, the method is arbitrarily high order involving intensive work on local data, a property that is also beneficial for the target hardware. We compare our GPU implementation against CPU versions of the same method observing similar convergence properties up to a threshold where the error remains fixed. This behavior is in agreement with the CPU version, but the threshold is slightly larger than in the CPU case. We also observe a big difference when considering single and double precisions where in the first case, the threshold error is significantly larger. Finally, we did observe a speed‐up factor in computational time that depends on the order of the method and the size of the problem. In the best case, our novel GPU implementation runs 23 times faster than the CPU version. We used three partial–differential equation to test the code considering the linear advection equation, the seismic wave equation, and the linear shallow water equation, all of them considering variable coefficients. Copyright © 2015 John Wiley & Sons, Ltd.  相似文献   

10.
11.
The finite‐volume methods normally utilize either simple or complicated mathematical expressions to interpolate the fluxes at the cell faces of their unstructured volumes. Alternatively, we benefit from the advantages of both finite‐volume and finite‐element methods and estimate the advection terms on the cell faces using an inclusive pressure‐weighted upwinding scheme extended on unstructured grids. The present pressure‐based method treats the steady and unsteady flows on a collocated grid arrangement. However, to avoid a non‐physical spurious pressure field pattern, two mass flux per volume expressions are derived at the cell interfaces. The dual advantages of using an unstructured‐based discretization and a pressure‐weighted upwinding scheme result in obtaining high accurate solutions with noticeable progress in the performance of the primitive method extended on the structured grids. The accuracy and performance of the extended formulations are demonstrated by solving different standard and benchmark problems. The results show that there are excellent agreements with both benchmark and analytical solutions as well as experimental data. Copyright © 2007 John Wiley & Sons, Ltd.  相似文献   

12.
We present a new modelling strategy for improving the efficiency of computationally intensive flow problems in environmental free‐surface flows. The approach combines a recently developed semi‐implicit subgrid method with a hierarchical grid solution strategy. The method allows the incorporation of high‐resolution data on subgrid scale to obtain a more accurate and efficient hydrodynamic model. The subgrid method improves the efficiency of the hierarchical grid method by providing better solutions on coarse grids. The method is applicable to both steady and unsteady flows, but we particularly focus on river flows with steady boundary conditions. There, the combined hierarchical grid–subgrid method reduces the computational effort to obtain a steady state with factors up to 43. For unsteady models, the method can be used for efficiently generating accurate initial conditions on high‐resolution grids. Additionally, the method provides automatic insight in grid convergence. We demonstrate the efficiency and applicability of the method using a schematic test for the vortex shedding around a circular cylinder and a real‐world river case study. Copyright © 2015 John Wiley & Sons, Ltd.  相似文献   

13.
We implement and evaluate a massively parallel and scalable algorithm based on a multigrid preconditioned Defect Correction method for the simulation of fully nonlinear free surface flows. The simulations are based on a potential model that describes wave propagation over uneven bottoms in three space dimensions and is useful for fast analysis and prediction purposes in coastal and offshore engineering. A dedicated numerical model based on the proposed algorithm is executed in parallel by utilizing affordable modern special purpose graphics processing unit (GPU). The model is based on a low‐storage flexible‐order accurate finite difference method that is known to be efficient and scalable on a CPU core (single thread). To achieve parallel performance of the relatively complex numerical model, we investigate a new trend in high‐performance computing where many‐core GPUs are utilized as high‐throughput co‐processors to the CPU. We describe and demonstrate how this approach makes it possible to do fast desktop computations for large nonlinear wave problems in numerical wave tanks (NWTs) with close to 50/100 million total grid points in double/single precision with 4 GB global device memory available. A new code base has been developed in C++ and compute unified device architecture C and is found to improve the runtime more than an order in magnitude in double precision arithmetic for the same accuracy over an existing CPU (single thread) Fortran 90 code when executed on a single modern GPU. These significant improvements are achieved by carefully implementing the algorithm to minimize data‐transfer and take advantage of the massive multi‐threading capability of the GPU device. Copyright © 2011 John Wiley & Sons, Ltd.  相似文献   

14.
This paper presents a relaxation algorithm, which is based on the overset grid technology, an unsteady three‐dimensional Navier–Stokes flow solver, and an inner‐ and outer‐relaxation method, for simulation of the unsteady flows of moving high‐speed trains. The flow solutions on the overlapped grids can be accurately updated by introducing a grid tracking technique and the inner‐ and outer‐relaxation method. To evaluate the capability and solution accuracy of the present algorithm, the computational static pressure distribution of a single stationary TGV high‐speed train inside a long tunnel is investigated numerically, and is compared with the experimental data from low‐speed wind tunnel test. Further, the unsteady flows of two TGV high‐speed trains passing by each other inside a long tunnel and at the tunnel entrance are simulated. A series of time histories of pressure distributions and aerodynamic loads acting on the train and tunnel surfaces are depicted for detailed discussions. Copyright © 2004 John Wiley & Sons, Ltd.  相似文献   

15.
In this article, we apply Davis's second‐order predictor‐corrector Godunov type method to numerical solution of the Savage–Hutter equations for modeling granular avalanche flows. The method uses monotone upstream‐centered schemes for conservation laws (MUSCL) reconstruction for conservative variables and Harten–Lax–van Leer contact (HLLC) scheme for numerical fluxes. Static resistance conditions and stopping criteria are incorporated into the algorithm. The computation is implemented on graphics processing unit (GPU) by using compute unified device architecture programming model. A practice of allocating memory for two‐dimensional array in GPU is given and computational efficiency of two‐dimensional memory allocation is compared with one‐dimensional memory allocation. The effectiveness of the present simulation model is verified through several typical numerical examples. Numerical tests show that significant speedups of the GPU program over the CPU serial version can be obtained, and Davis's method in conjunction with MUSCL and HLLC schemes is accurate and robust for simulating granular avalanche flows with shock waves. As an application example, a case with a teardrop‐shaped hydraulic jump in Johnson and Gray's granular jet experiment is reproduced by using specific friction coefficients given in the literature. Copyright © 2014 John Wiley & Sons, Ltd.  相似文献   

16.
While new power-efficient computer architectures exhibit spectacular theoretical peak performance, they require specific conditions to operate efficiently, which makes porting complex algorithms a challenge. Here, we report results of the semi-implicit method for pressure linked equations (SIMPLE) and the pressure implicit with operator splitting (PISO) methods implemented on the graphics processing unit (GPU). We examine the advantages and disadvantages of the full porting over a partial acceleration of these algorithms run on unstructured meshes. We found that the full-port strategy requires adjusting the internal data structures to the new hardware and proposed a convenient format for storing internal data structures on GPUs. Our implementation is validated on standard steady and unsteady problems and its computational efficiency is checked by comparing its results and run times with those of some standard software (OpenFOAM) run on central processing unit (CPU). The results show that a server-class GPU outperforms a server-class dual-socket multi-core CPU system running essentially the same algorithm by up to a factor of 4.  相似文献   

17.
An improved hybrid method for computing unsteady compressible viscous flows is presented. This method divides the computational domain into two zones. In the inner zone, the Navier–Stokes equations are solved using a diagonal form of an alternating‐direction implicit (ADI) approximate factorisation procedure. In the outer zone, the unsteady full‐potential equation (FPE) is solved. The two zones are tightly coupled so that steady and unsteady flows may be efficiently solved. Characteristic‐based viscous/inviscid interface boundary conditions are employed to avoid spurious reflections at that interface. The resulting CPU times are about 60% of the full Navier–Stokes CPU times for unsteady flows in non‐vector processing machines. Applications of the method are presented for a F‐5 wing in steady and unsteady transonic flows. Steady surface pressures are in very good agreement with experimental data and are essentially identical to the full Navier–Stokes predictions. Density contours show that shocks cross the viscous/inviscid interface smoothly, so that the accuracy of full Navier–Stokes equations can be retained with significant savings in computational time. Copyright © 1999 John Wiley & Sons, Ltd.  相似文献   

18.
A vertex‐centred finite‐volume/finite‐element method (FV/FEM) is developed for solving 2‐D shallow water equations (SWEs) with source terms written in a surface elevation splitting form, which balances the flux gradients and source terms. The method is implemented on unstructured grids and the numerical scheme is based on a second‐order MUSCL‐like upwind Godunov FV discretization for inviscid fluxes and a classical Galerkin FE discretization for the viscous gradients and source terms. The main advantages are: (1) the discretization of SWE written in surface elevation splitting form satisfies the exact conservation property (??‐Property) naturally; (2) the simple centred‐type discretization can be used for the source terms; (3) the method is suitable for both steady and unsteady shallow water problems; and (4) complex topography can be handled based on unstructured grids. The accuracy of the method was verified for both steady and unsteady problems, including discontinuous cases. The results indicate that the new method is accurate, simple, and robust. Copyright © 2007 John Wiley & Sons, Ltd.  相似文献   

19.
Unstructured meshes allow easily representing complex geometries and to refine in regions of interest without adding control volumes in unnecessary regions. However, numerical schemes used on unstructured grids have to be properly defined in order to minimise numerical errors. An assessment of a low Mach algorithm for laminar and turbulent flows on unstructured meshes using collocated and staggered formulations is presented. For staggered formulations using cell‐centred velocity reconstructions, the standard first‐order method is shown to be inaccurate in low Mach flows on unstructured grids. A recently proposed least squares procedure for incompressible flows is extended to the low Mach regime and shown to significantly improve the behaviour of the algorithm. Regarding collocated discretisations, the odd–even pressure decoupling is handled through a kinetic energy conserving flux interpolation scheme. This approach is shown to efficiently handle variable‐density flows. Besides, different face interpolations schemes for unstructured meshes are analysed. A kinetic energy‐preserving scheme is applied to the momentum equations, namely, the symmetry‐preserving scheme. Furthermore, a new approach to define the far‐neighbouring nodes of the quadratic upstream interpolation for convective kinematics scheme is presented and analysed. The method is suitable for both structured and unstructured grids, either uniform or not. The proposed algorithm and the spatial schemes are assessed against a function reconstruction, a differentially heated cavity and a turbulent self‐igniting diffusion flame. It is shown that the proposed algorithm accurately represents unsteady variable‐density flows. Furthermore, the quadratic upstream interpolation for convective kinematics scheme shows close to second‐order behaviour on unstructured meshes, and the symmetry‐preserving is reliably used in all computations. Copyright © 2016 John Wiley & Sons, Ltd.  相似文献   

20.
动网格生成技术及非定常计算方法进展综述   总被引:17,自引:1,他引:16  
对应用于飞行器非定常运动的数值计算方法(包括动态网格技术和相应的数值离散格式)进行了综述.根据网格拓扑结构的不同,重点论述了基于结构网格的非定常计算方法和基于非结构/混合网格的非定常计算方法,比较了各种方法的优缺点.在基于结构网格的非定常计算方法中,重点介绍了刚性运动网格技术、超限插值动态网格技术、重叠动网格技术、滑移动网格技术等动态结构网格生成方法,同时介绍了惯性系和非惯性系下的控制方程,讨论了非定常时间离散方法、动网格计算的几何守恒律等问题.在基于非结构/混合网格的非定常计算方法中,重点介绍了重叠非结构动网格技术、重构非结构动网格技术、变形非结构动网格技术以及变形/重构耦合动态混合网格技术等方法,以及相应的计算格式,包括非定常时间离散、几何守恒律计算方法、可压缩和不可压缩非定常流动的计算方法、各种加速收敛技术等.在介绍国内外进展的同时,介绍了作者在动态混合网格生成技术和相应的非定常方法方面的研究与应用工作.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号