Automatically Parallelizing Batch Inference on Deep Neural Networks Using Fiats and Fortran 2023 “Do Concurrent”

  • Damian Rouson
  • , Zhe Bai
  • , Dan Bonachea
  • , Kareem Ergawy
  • , Ethan Gutmann
  • , Michael Klemm
  • , Katherine Rasmussen
  • , Brad Richardson
  • , Sameer Shende
  • , David Torres
  • , Yunhao Zhang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper introduces novel programming strategies that leverage features of the Fortran 2023 standard of the International Standards Organization (ISO) to automatically parallelize computations on deep neural networks. The paper focuses on the interplay of object-oriented, parallel, and functional programming paradigms in the Fiats deep learning library. We demonstrate how several infrequently-used language features play a role in enabling efficient, parallel execution. Specifically, the ability to explicitly declare that a procedure is pure facilitates inference in the context of the language’s loop-parallelism construct do concurrent. Also, explicitly prohibiting the overriding of a parent type’s type-bound procedures eliminates the need for dynamic dispatch in performance-critical code. Finally, this paper uses batch inference calculations on a neural network surrogate for atmospheric aerosol dynamics to demonstrate that LLVM Flang compiler’s automatic parallelization of do concurrent achieves roughly the same performance and scalability as achieved by OpenMP compiler directives. We also demonstrate that double-precision inference costs 37–72% longer runtime than default-real precision with most values in the range 57–60%.

Original languageEnglish
Title of host publicationHigh Performance Computing - ISC High Performance 2025 International Workshops, Revised Selected Papers
EditorsSarah Neuwirth, Arnab Kumar Paul, Tobias Weinzierl, Erin Claire Carson
PublisherSpringer Science and Business Media Deutschland GmbH
Pages135-147
Number of pages13
ISBN (Print)9783032076113
DOIs
StatePublished - 2026
Externally publishedYes
Event40th International Conference on High Performance Computing, ISC High Performance 2025 - Hamburg, Germany
Duration: Jun 10 2025Jun 13 2025

Publication series

NameLecture Notes in Computer Science
Volume16091 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference40th International Conference on High Performance Computing, ISC High Performance 2025
Country/TerritoryGermany
CityHamburg
Period06/10/2506/13/25

Keywords

  • Atmospheric Sciences
  • Deep learning
  • Fortran

Fingerprint

Dive into the research topics of 'Automatically Parallelizing Batch Inference on Deep Neural Networks Using Fiats and Fortran 2023 “Do Concurrent”'. Together they form a unique fingerprint.

Cite this