Optimizing Weather Model Radiative Transfer Physics for Intel's Many Integrated Core (MIC) Architecture

John Michalakes, Michael J. Iacono, Elizabeth R. Jessup

Research output: Contribution to journalArticlepeer-review

10 Scopus citations

Abstract

Large numerical weather prediction (NWP) codes such as the Weather Research and Forecast (WRF) model and the NOAA Nonhydrostatic Multiscale Model (NMM-B) port easily to Intel's Many Integrated Core (MIC) architecture. But for NWP to significantly realize MIC's one- to two-TFLOP/s peak computational power, we must expose and exploit thread and fine-grained (vector) parallelism while overcoming memory system bottlenecks that starve oating-point performance. We report on our work to improve the Rapid Radiative Transfer Model (RRTMG), responsible for 10-20 percent of total NMM-B run time. We isolated a standalone RRTMG benchmark code and workload from NMM-B and then analyzed performance using hardware performance counters and scaling studies. We restructured the code to improve vectorization, thread parallelism, locality, and thread contention. The restructured code ran three times faster than the original on MIC and, also importantly, 1.3x faster than the original on the host Xeon Sandybridge.

Original languageEnglish
Article number1650019
JournalParallel Processing Letters
Volume26
Issue number4
DOIs
StatePublished - Dec 1 2016

Fingerprint

Dive into the research topics of 'Optimizing Weather Model Radiative Transfer Physics for Intel's Many Integrated Core (MIC) Architecture'. Together they form a unique fingerprint.

Cite this