首页 | 本学科首页   官方微博 | 高级检索  
     检索      


A Faster and Simpler Recursive Algorithm for the LAPACK Routine DGELS
Authors:Erik Elmroth  Fred G Gustavson
Institution:(1) Department of Computing Science and HPC2N, Umeå University, SE-901 87 Umeå, Sweden;(2) IBM T.J. Watson Research Center, P.O. Box 218, Yorktown Heights, NY 10598, USA
Abstract:We present new algorithms for computing the linear least squares solution to overdetermined linear systems and the minimum norm solution to underdetermined linear systems. For both problems, we consider the standard formulation min VerbarAXBVerbar F and the transposed formulation min VerbarA T XBVerbar F , i.e, four different problems in all. The functionality of our implementation corresponds to that of the LAPACK routine DGELS. The new implementation is significantly faster and simpler. It outperforms the LAPACK DGELS for all matrix sizes tested. The improvement is usually 50–100% and it is as high as 400%. The four different problems of DGELS are essentially reduced to two, by use of explicit transposition of A. By explicit transposition we avoid computing Householder transformations on vectors with large stride. The QR factorization of block columns of A is performed using a recursive level-3 algorithm. By interleaving updates of B with the factorization of A, we reduce the number of floating point operations performed for the linear least squares problem. By avoiding redundant computations in the update of B we reduce the work needed to compute the minimum norm solution. Finally, we outline fully recursive algorithms for the four problems of DGELS as well as for QR factorization.This revised version was published online in October 2005 with corrections to the Cover Date.
Keywords:Overdetermined systems  underdetermined systems  linear least squares  minimum norm solution  recursion  high performance library software
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号