论文标题
通过示例理解CG和GMRE
Towards understanding CG and GMRES through examples
论文作者
论文摘要
当大约70年前,兰开斯(Lanczos),赫斯滕斯(Hestenes)和斯蒂芬(Stiefel)提出了求解线性代数系统的CG方法时,它被认为是具有数学有限终止特性的迭代过程。 CG被置于丰富的数学环境中,包括与高斯正交和持续分数的联系。 CG的最佳性能是通过归一化的加权多项式最小二乘近似为零描述的。这个高度非线性的问题解释了CG迭代对给定数据的适应。卡鲁什(Karush)和海斯(Hayes)立即考虑了无限尺寸希尔伯特(Hilbert)空间中的CG,并研究了其超线性收敛。从那时起,CG和其他Krylov子空间方法的视图发生了变化。如今,这些方法主要用作计算工具,通常使用基于特征值聚类的线性上限或启发式方法来表征它们的行为。这种简化限制了数学理解,也会对其实际应用产生负面影响。 本文提供了不同的观点。它专注于CG和GMRE,它提出了数学上重要且实际相关的现象,这些现象通过讨论计算的示例来揭示其行为。这些示例提供了一种易于访问的方法,可以理解这些方法,同时给出了文献中更详细的分析。这种方法使读者可以选择适合其意图的深度和彻底的水平。本文提出的一些观点说明了众所周知的事实。其他人则挑战主流观点并解释现有的误解。有几点指的是导致开放问题的最新结果。我们认为CG和GMRE对于其他Krylov子空间方法的数学理解,进一步的发展以及实际应用至关重要。
When the CG method for solving linear algebraic systems was formulated about 70 years ago by Lanczos, Hestenes, and Stiefel, it was considered an iterative process possessing a mathematical finite termination property. CG was placed into a rich mathematical context, including links with Gauss quadrature and continued fractions. The optimality property of CG was described via a normalized weighted polynomial least squares approximation to zero. This highly nonlinear problem explains the adaptation of CG iterates to the given data. Karush and Hayes immediately considered CG in infinite dimensional Hilbert spaces and investigated its superlinear convergence. Since then, the view of CG and other Krylov subspace methods has changed. Today these methods are primarily used as computational tools, and their behavior is typically characterized using linear upper bounds or heuristics based on clustering of eigenvalues. Such simplifications limit the mathematical understanding and also negatively affect their practical application. This paper offers a different perspective. Focusing on CG and GMRES, it presents mathematically important and practically relevant phenomena that uncover their behavior through a discussion of computed examples. These examples provide an easily accessible approach that enables understanding of the methods, while pointers to more detailed analyses in the literature are given. This approach allows readers to choose the level of depth and thoroughness appropriate for their intentions. Some of the points made in this paper illustrate well known facts. Others challenge mainstream views and explain existing misunderstandings. Several points refer to recent results leading to open problems. We consider CG and GMRES crucially important for the mathematical understanding, further development, and practical applications also of other Krylov subspace methods.