Abstract
In this study, we introduce algorithms optimized for GPU architectures, aimed at efciently solving large sparse linear systems, a central challenge in Navier-Stokes pressure projection problems. Our approach includes an adaptation of the GMRES algorithm, drawing inspiration from the merged vector operations frst proposed by Bielich et al. [Parallel Comput., 112 (2022), 102940]. This adaptation increases computational intensity on GPU platforms through optimized vector update strategies. The algorithm incorporates modifed and classical Gram-Schmidt methods with an algebraic multigrid (AMG) preconditioner, each tailored for GPU performance. A key innovation in our work is the development of a Gram-Schmidt projector Pk employing a rank-1 perturbation of the identity matrix. Designed to maximize the high memory bandwidth utilization of the AMD MI-250X GPU, this approach includes a strategy for treating the unit diagonal that minimizes memory reads, leading to a 2 5 \ increase in computational efciency. The application of perturbation \ theory further ensures that orthogonality loss is limited to \ (\) k, where k is the number of iterations. Additionally, we introduce a mixed AMG V -cycle strategy combining ILU(0) and \1-Jacobi smoothers, which achieves a 30-50\ reduction in GPU compute times compared to conventional methods, while maintaining low backward error. This strategy, alongside our novel treatment of the diagonal in triangular matrices, marks a substantial increase in AMG efcicency for GPU systems. We believe that these contributions represent a signifcant advance in optimizing GMRES+AMG algorithms for GPU computations. The empirical results demonstrate notable speed increments and maintain rigorous backward error bounds, underscoring the potential of our methods to substantially increase computational efciency in large-scale scientifc applications.
| Original language | English |
|---|---|
| Pages (from-to) | S246-S270 |
| Journal | SIAM Journal on Scientific Computing |
| Volume | 46 |
| Issue number | 5 |
| DOIs | |
| State | Published - 2024 |
| Externally published | Yes |
Keywords
- GMRES
- Gram-Schmidt
- algebraic multigrid
- composite
- mixed V-cycle
- smoothers