Abstract
Least Squares with QR-factorization (LSQR) method is a widely used Krylov subspace algorithm to solve sparse rectangular linear systems for tomographic problems. Traditional parallel implementations of LSQR have the potential, depending on the non-zero structure of the matrix, to have significant communication cost. The communication cost can dramatically limit the scalability of the algorithm at large core counts. We describe a scalable parallel LSQR algorithm that utilizes the particular non-zero structure of matrices that occurs in tomographic problems. In particular, we specially treat the kernel component of the matrix, which is relatively dense with a random structure, and the damping component, which is very sparse and highly structured separately. The resulting algorithm has a scalable communication volume with a bounded number of communication neighbors regardless of core count. We present scaling studies from real seismic tomography datasets that illustrate good scalability up to O(10, 000) cores on a Cray XT cluster.
| Original language | English |
|---|---|
| Pages (from-to) | 581-590 |
| Number of pages | 10 |
| Journal | Procedia Computer Science |
| Volume | 18 |
| DOIs | |
| State | Published - 2013 |
| Event | 13th Annual International Conference on Computational Science, ICCS 2013 - Barcelona, Spain Duration: Jun 5 2013 → Jun 7 2013 |
Keywords
- LSQR
- MPI
- Matrix vector multiplication
- Parallel scientific computing
- Scalable communication
- Seismic tomography
- Structural seismology
- Tomographic problems