期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

A new implementation of the CMRH method for solving dense linear systems

M. Heyouni H. Sadok 《Journal of Computational and Applied Mathematics》2008

The CMRH method [H. Sadok, Méthodes de projections pour les systèmes linéaires et non linéaires, Habilitation thesis, University of Lille1, Lille, France, 1994; H. Sadok, CMRH: A new method for solving nonsymmetric linear systems based on the Hessenberg reduction algorithm, Numer. Algorithms 20 (1999) 303–321] is an algorithm for solving nonsymmetric linear systems in which the Arnoldi component of GMRES is replaced by the Hessenberg process, which generates Krylov basis vectors which are orthogonal to standard unit basis vectors rather than mutually orthogonal. The iterate is formed from these vectors by solving a small least squares problem involving a Hessenberg matrix. Like GMRES, this method requires one matrix–vector product per iteration. However, it can be implemented to require half as much arithmetic work and less storage. Moreover, numerical experiments show that this method performs accurately and reduces the residual about as fast as GMRES. With this new implementation, we show that the CMRH method is the only method with long-term recurrence which requires not storing at the same time the entire Krylov vectors basis and the original matrix as in the GMRES algorithm. A comparison with Gaussian elimination is provided. 相似文献

2.

Algorithms for the CMRH method for dense linear systems

Sébastien Duminil Mohammed Heyouni Philippe Marion Hassane Sadok 《Numerical Algorithms》2016,71(2):383-394

The CMRH (Changing Minimal Residual method based on the Hessenberg process) method is a Krylov subspace method for solving large linear systems with non-symmetric coefficient matrices. CMRH generates a (non orthogonal) basis of the Krylov subspace through the Hessenberg process, and minimizes a quasi-residual norm. On dense matrices, the CMRH method is less expensive and requires less storage than other Krylov methods. In this work, we describe Matlab codes for the best of these implementations. Fortran codes for sequential and parallel implementations are also presented. 相似文献

3.

A flexible CMRH algorithm for nonsymmetric linear systems

Ke Zhang Chuanqing Gu 《Journal of Applied Mathematics and Computing》2014,45(1-2):43-61

A flexible version of the CMRH algorithm is presented that allows varying preconditioning at every step of the algorithm. A consequence of the flexibility of this new variant is that any iterative methods can be incorporated as a preconditioner in the inner steps. Theoretical results that relate the residual norm of the new algorithm and the flexible GMRES, the new algorithm with CMRH itself, are given. Numerical experiments are carried out to illustrate the effectiveness of the proposed algorithm in comparison with the standard CMRH algorithm, ILU-preconditioned CMRH variants and the flexible GMRES algorithm. 相似文献

4.

CMRH: A new method for solving nonsymmetric linear systems based on the Hessenberg reduction algorithm

H. Sadok 《Numerical Algorithms》1999,20(4):303-321

The Generalized Minimal Residual (GMRES) method and the Quasi-Minimal Residual (QMR) method are two Krylov methods for solving linear systems. The main difference between these methods is the generation of the basis vectors for the Krylov subspace. The GMRES method uses the Arnoldi process while QMR uses the Lanczos algorithm for constructing a basis of the Krylov subspace. In this paper we give a new method similar to QMR but based on the Hessenberg process instead of the Lanczos process. We call the new method the CMRH method. The CMRH method is less expensive and requires slightly less storage than GMRES. Numerical experiments suggest that it has behaviour similar to GMRES. This revised version was published online in June 2006 with corrections to the Cover Date. 相似文献

5.

A parallel implementation of the restarted GMRES iterative algorithm for nonsymmetric systems of linear equations

Rudnei Dias da Cunha Tim Hopkins 《Advances in Computational Mathematics》1994,2(3):261-277

We describe the parallelisation of the GMRES(c) algorithm and its implementation on distributed-memory architectures, using both networks of transputers and networks of workstations under the PVM message-passing system. The test systems of linear equations considered are those derived from five-point finite-difference discretisations of partial differential equations. A theoretical model of the computation and communication phases is presented which allows us to decide for which values of the parameterc our implementation executes efficiently. The results show that for reasonably large discretisation grids the implementations are effective on a large number of processors. 相似文献

6.

A parallel implementation of Wang’s method for solving tridiagonal systems

《Chaos, solitons, and fractals》2007,31(3):580-585

In this paper, a parallel implementation of Wang’s method for solving tridiagonal system of equations on the multiprocessor machine using occam language is presented. The parallel algorithm has been designed for shared and distributed memory machine that support data parallel and message passing. The over all performance of this implementation on 9 each of processors is given. The communication times are very important and any improvement on this communication would have a significant performance of the implementation. The significance of these results are discussed. 相似文献

7.

The global Hessenberg and CMRH methods for linear systems with multiple right-hand sides 总被引：1，自引：0，他引：1

M. Heyouni 《Numerical Algorithms》2001,26(4):317-332

In this paper, we introduce two new methods for solving large sparse nonsymmetric linear systems with several right-hand sides. These methods are the global Hessenberg and global CMRH methods. Using the global Hessenberg process, these methods are less expensive than the global FOM and global GMRES methods [9]. Theoretical results about the new methods are given, and experimental results that show good performances of these new methods are presented. 相似文献

8.

A data parallel augmenting path algorithm for the dense linear many-to-one assignment problem

Olof Damberg Sverre Storøy Tor Sørevik 《Computational Optimization and Applications》1996,6(3):251-272

The purpose of this study is to describe a data parallel primal-dual augmenting path algorithm for the dense linear many-to-one assignment problem also known as semi-assignment. This problem could for instance be described as assigning n persons to m(n) job groups.The algorithm is tailored specifically for massive SIMD parallelism and employs, in this context, a new efficient breadth-first-search augmenting path technique which is found to be faster than the shortest augmenting path search normally used in sequential algorithms for this problem. We show that the best known sequential computational complexity of O(mn ²) for dense problems, is reduced to the parallel complexity of O(mn), on a machine with n processors supporting reductions in O(1) time. The algorithm is easy to implement efficiently on commercially available massively parallel computers. A range of numerical experiments are performed on a Connection Machine CM200 and a MasPar MP-2. The tests show the good performance of the proposed algorithm. 相似文献

9.

A parallel balance scheme for banded linear systems

Gene H. Golub Ahmed H. Sameh Vivek Sarin 《Numerical Linear Algebra with Applications》2001,8(5):297-316

A parallel algorithm is proposed for the solution of narrow banded non‐symmetric linear systems. The linear system is partitioned into blocks of rows with a small number of unknowns common to multiple blocks. Our technique yields a reduced system defined only on these common unknowns which can then be solved by a direct or iterative method. A projection based extension to this approach is also proposed for computing the reduced system implicitly, which gives rise to an inner–outer iteration method. In addition, the product of a vector with the reduced system matrix can be computed efficiently on a multiprocessor by concurrent projections onto subspaces of block rows. Scalable implementations of the algorithm can be devized for hierarchical parallel architectures by exploiting the two‐level parallelism inherent in the method. Our experiments indicate that the proposed algorithm is a robust and competitive alternative to existing methods, particularly for difficult problems with strong indefinite symmetric part. Copyright © 2001 John Wiley & Sons, Ltd. 相似文献

10.

A parallel algorithm for solving Toeplitz linear systems 总被引：1，自引：0，他引：1

L.E. Garey R.E. Shaw 《Applied mathematics and computation》1999,100(2-3):241-247

Numerical methods of solution are considered for systems which are Toeplitz and symmetric. In our case, the coefficient matrix is essentially tridiagonal and sparse. There are two distinct approaches to be considered each of which is efficient in its own way. Here we will combine the two approaches which will allow application of the cyclic reduction method to coefficient matrices of more general forms. The convergence of the approximations to the exact solution will also be examined. Solving linear systems by the adapted cyclic reduction method can be parallel processed. 相似文献

11.

A parallel implementation of overlapping Schwarz method for the dual reciprocity method

Ke Chen Kamal Shanazari 《Applied mathematics and computation》2010,216(11):3355-3364

We present a parallel algorithm for the overlapping domain decomposition boundary integral equation method for two dimensional partial differential equations. In addition to the improvement of the ill-conditioning and the computational efficiency achieved by domain partitioning, using a parallel computer with p processors can offer up to p times efficiency. Assuming direct solution is used throughout, partitioning the domain into p subregions and employing a processor for each subproblem, overall, result in p² times efficiency over using a single domain and a single processor, taking into account that a sequential algorithm of the underlying method can improve the computational efficiency at least p times over using a single domain. Some numerical results showing the efficiency of the parallel technique will be presented. 相似文献

12.

An implementation of a parallel primal-dual interior point method for block-structured linear programs

Irvin J. Lustig Guangye Li 《Computational Optimization and Applications》1992,1(2):141-161

An implementation of the primal-dual predictor-corrector interior point method is specialized to solve block-structured linear programs with side constraints. The block structure of the constraint matrix is exploited via parallel computation. The side constraints require the Cholesky factorization of a dense matrix, where a method that exploits parallelism for the dense Cholesky factorization is used. For testing, multicommodity flow problems were used. The resulting implementation is 65%–90% efficient, depending on the problem instance. For a problem with K commodities, an approximate speedup for the interior point method of 0.8K is realized. 相似文献

13.

A parallel implementation of the nested decomposition algorithm for multistage stochastic linear programs

John R. Birge Christopher J. Donohue Derek F. Holmes Oleg G. Svintsitski 《Mathematical Programming》1996,75(2):327-352

Multistage stochastic linear programs can represent a variety of practical decision problems. Solving a multistage stochastic program can be viewed as solving a large tree of linear programs. A common approach for solving these problems is the nested decomposition algorithm, which moves up down the tree by solving nodes and passing information among nodes. The natural independence of subtrees suggests that much of the computational effort of the nested decomposition algorithm can run in parallel across small numbers of fast processors. This paper explores the advantages of such parallel implementations over serial implementations and compares alternative sequencing protocols for parallel processors. Computational experience on a large test set of practical problems with up to 1.5 million constraints and almost 5 million variables suggests that parallel implementations may indeed work well, but they require careful attention to processor load balancing. Supported in part by the National Science Foundation under Grants DDM-9215921 and SES-9211937. 相似文献

14.

Matrix-free iterative solution strategies for large dense linear systems

S. A. Goreinov E. E. Tyrtyshnikov A. Yu. Yeremin 《Numerical Linear Algebra with Applications》1997,4(4):273-294

A purely algebric approach to solving very large general unstructured dense linear systems, in particular, those that arise in 3D boundary integral applications is suggested. We call this technique the matrix-free approach because it allows one to avoid the necessity of storing the whole coefficient matrix in any form, which provides significant memory and arithmetic savings. We propose to approximate a non-singular coefficient matrix A by a block low-rank matrix Ã and to use the latter when performing matrix–vector multiplications in iterative solution algorithms. Such approximations are shown to be easily computable, and a reliable a posteriori accuracy estimate of ‖A − Ã‖₂ is derived. We prove that block low-rank approximations are sufficiently accurate for some model cases. However, even in the absence of rigorous proof of the existence of accurate approximations, one can apply the algorithm proposed to compute a block low-rank approximation and then make a decision on its practical suitability. We present numerical examples for the 3D CEM and CFD integral applications, which show that, at least for some industrial applications, the matrix-free approach is robust and cost-effective. © 1997 John Wiley & Sons, Ltd. 相似文献

15.

A parallel projection method for solving generalized linear least-squares problems

Gang Lou Shih-Ping Han 《Numerische Mathematik》1988,53(3):255-264

Summary A parallel projection algorithm is proposed to solve the generalized linear least-squares problem: find a vector to minimize the 2-norm distance from its image under an affine mapping to a closed convex cone. In each iteration of the algorithm the problem is decomposed into several independent small problems of finding projections onto subspaces, which are simple and can be tackled parallelly. The algorithm can be viewed as a dual version of the algorithm proposed by Han and Lou [8]. For the special problem under consideration, stronger convergence results are established. The algorithm is also related to the block iterative methods of Elfving [6], Dennis and Steihaug [5], and the primal-dual method of Springarn [14].This material is based on work supported in part by the National Science foundation under Grant DMS-8602416 and by the Center for Supercomputing Research and Development, University of Illinois at Urbana-Champaign 相似文献

16.

A discretization method for systems of linear inequalities

K. H. Meyn 《Journal of Optimization Theory and Applications》1981,34(3):355-369

We investigate a method for approximating a convex domainCR ⁿ described by a (possibly infinite) set of linear inequalities. In contrast to other methods, the approximating domains (polyhedrons) lie insideC. We discuss applications to semi-infinite programming and present numerical examples.The paper was written at the Institut für Angewandte Mathematik, Universität Hamburg, Hamburg, West Germany. The author thanks Prof. U. Eckhardt, Dr. K. Roleff, and Prof. B. Werner for helpful discussions. 相似文献

17.

LU implementation of the modified minimal polynomial extrapolation method for solving linear and nonlinear systems

Jbilou K; Sadok H 《IMA Journal of Numerical Analysis》1999,19(4):549-561

We give an efficient implementation of the modified minimalpolynomial extrapolation (MMPE) method for solving linear andnonlinear systems. We will show how to choose the auxiliaryvectors in the MMPE method such that the resulting approximationsare always defined. This new implementation, which is basedon an LU factorization with a pivoting strategy, is inexpensiveboth in time and storage as compared with other extrapolationmethods. 相似文献

18.

A preconditioned GMRES for complex dense linear systems from electromagnetic wave scattering problems

Angelika Bunse-Gerstner 《Linear algebra and its applications》2006,416(1):135-147

We consider the numerical solution of linear systems arising from the discretization of the electric field integral equation (EFIE). For some geometries the associated matrix can be poorly conditioned making the use of a preconditioner mandatory to obtain convergence. The electromagnetic scattering problem is here solved by means of a preconditioned GMRES in the context of the multilevel fast multipole method (MLFMM). The novelty of this work is the construction of an approximate hierarchically semiseparable (HSS) representation of the near-field matrix, the part of the matrix capturing interactions among nearby groups in the MLFMM, as preconditioner for the GMRES iterations. As experience shows, the efficiency of an ILU preconditioning for such systems essentially depends on a sufficient fill-in, which apparently sacrifices the sparsity of the near-field matrix. In the light of this experience we propose a multilevel near-field matrix and its corresponding HSS representation as a hierarchical preconditioner in order to substantially reduce the number of iterations in the solution of the resulting system of equations. 相似文献

19.

A modified ASOR-like method for augmented linear systems

Feng Ting-Ting Guo Xue-Ping Chen Guo-Liang 《Numerical Algorithms》2019,82(3):1097-1115

Numerical Algorithms - For solving an augmented linear system, Njeru and Guo presented an accelerated SOR-like (ASOR) method in (P. N. Njeru and X.-P. Guo. Accelerated SOR-like method for augmented... 相似文献

20.

A parallel projection method for overdetermined nonlinear systems of equations 总被引：1，自引：0，他引：1

Maria A. Diniz-Ehrhardt José Mario Martínez 《Numerical Algorithms》1993,4(2):241-262

We consider overdetermined nonlinear systems of equationsF(x)=0, whereF: ⁿ ^m,mn. For this type of systems we define weighted least square distance (WLSD) solutions, which represent an alternative to classical least squares solutions and to other solutions based on residual normas. We introduce a generalization of the classical method of Cimmino for linear systems and we prove local convergence results. We introduce a practical strategy for improving the global convergence properties of the method. Finally, numerical experiments are presented.Work supported by FAPESP (Grant 90/3724/6), FINEP, CNPq and FAEP-UNICAMP. 相似文献