Analysis of MURaM, a Solar Physics Application, for Scalability, Performance and Portability

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

With the advent of GPUs in parallel computing several languages, tools and compilers are being developed. Many impactful applications can benefit from the performance capabilities these GPUs provide, but moving large, complex code bases to GPU execution often poses many hurdles and growing pains as developers adapt unfamiliar programming models and interface with increasingly complex, but powerful hardwares. One such advanced model is OpenACC, a directive-based parallel programming model designed to target various architectures using a singular source code. In this paper we present our experiences using OpenACC to bring GPU acceleration to MURaM, a state-of-the-art solar physics application jointly developed and used by the National Center for Atmospheric Research (NCAR) and the Max Planck Institute of Solar System Research (MPS). Our work presents several challenges we faced for mapping general parallel concepts to low-level GPU hardware and corresponding performance penalties inherent to these models. While OpenACC provides architecture portability it lacks performance portability in some cases. We discuss possible solutions to the problem of performance portability that we see and create some prototypes to explore what could be gained from these solutions in terms of MURaM. We then provide scaling results and findings transitioning to current generation GPU architectures with strong and weak scaling on up to 512 NVIDIA A100 GPUs, observing that several portions of the code could perform and scale significantly better with the inclusion of more advanced hardware features in OpenACC. On our HPC systems, current performance of MURaM showcases that one A100 GPU provides roughly as much throughput as 90-100 CPU cores, while also scaling further than CPU runs are capable.

Original languageEnglish
Title of host publicationProceedings of 2023 SC Workshops of the International Conference on High Performance Computing, Network, Storage, and Analysis, SC Workshops 2023
PublisherAssociation for Computing Machinery
Pages1929-1938
Number of pages10
ISBN (Electronic)9798400707858
DOIs
StatePublished - Nov 12 2023
Event2023 International Conference on High Performance Computing, Network, Storage, and Analysis, SC Workshops 2023 - Denver, United States
Duration: Nov 12 2023Nov 17 2023

Publication series

NameACM International Conference Proceeding Series

Conference

Conference2023 International Conference on High Performance Computing, Network, Storage, and Analysis, SC Workshops 2023
Country/TerritoryUnited States
CityDenver
Period11/12/2311/17/23

Keywords

  • GPU, solar physics
  • directive-based programming models
  • magnetohydrodynamics
  • radiation transport

Fingerprint

Dive into the research topics of 'Analysis of MURaM, a Solar Physics Application, for Scalability, Performance and Portability'. Together they form a unique fingerprint.

Cite this