TY - GEN
T1 - Performance portability of an intermediate-complexity atmospheric research model in coarray Fortran
AU - Rouson, Damian
AU - Gutmann, Ethan D.
AU - Fanfarillo, Alessandro
AU - Friesen, Brian
N1 - Publisher Copyright:
© 2017 Copyright held by the owner/author(s).
PY - 2017/11/12
Y1 - 2017/11/12
N2 - We examine the scalability and performance of an open-source, coarray Fortran (CAF) mini-application (mini-app) that implements the parallel, numerical algorithms that dominate the execution of The Intermediate Complexity Atmospheric Research (ICAR) [4] model developed at the the National Center for Atmospheric Research (NCAR). The Fortran 2008 mini-app includes one Fortran 2008 implementation of a collective subroutine defined in the Committee Draft of the upcoming Fortran 2018 standard. The ability of CAF to run atop various communication layers and the increasing CAF compiler availability facilitated evaluating several compilers, runtime libraries and hardware platforms. Results are presented for the GNU and Cray compilers, each of which offers different parallel runtime libraries employing one or more communication layers, including MPI, OpenSHMEM, and proprietary alternatives. We study performance on multi- and many-core processors in distributed memory. The results show promising scaling across a range of hardware, compiler, and runtime choices on up to ∼100,000 cores.
AB - We examine the scalability and performance of an open-source, coarray Fortran (CAF) mini-application (mini-app) that implements the parallel, numerical algorithms that dominate the execution of The Intermediate Complexity Atmospheric Research (ICAR) [4] model developed at the the National Center for Atmospheric Research (NCAR). The Fortran 2008 mini-app includes one Fortran 2008 implementation of a collective subroutine defined in the Committee Draft of the upcoming Fortran 2018 standard. The ability of CAF to run atop various communication layers and the increasing CAF compiler availability facilitated evaluating several compilers, runtime libraries and hardware platforms. Results are presented for the GNU and Cray compilers, each of which offers different parallel runtime libraries employing one or more communication layers, including MPI, OpenSHMEM, and proprietary alternatives. We study performance on multi- and many-core processors in distributed memory. The results show promising scaling across a range of hardware, compiler, and runtime choices on up to ∼100,000 cores.
KW - Coarray Fortran
KW - Computational hydrometeorology
UR - https://www.scopus.com/pages/publications/85044155723
U2 - 10.1145/3144779.3169104
DO - 10.1145/3144779.3169104
M3 - Conference contribution
AN - SCOPUS:85044155723
T3 - Proceedings of PAW 2017: 2nd Annual PGAS Applications Workshop - Held in conjunction with SC 2017: The International Conference for High Performance Computing, Networking, Storage and Analysis
SP - 1
EP - 4
BT - Proceedings of PAW 2017
PB - Association for Computing Machinery, Inc
T2 - 2nd Annual PGAS Applications Workshop, PAW 2017
Y2 - 12 November 2017 through 17 November 2017
ER -