Light-weight parallel Python tools for earth system modeling workflows

Kevin Paul, Sheri Mickelson, John M. Dennis, Haiying Xu, David Brown

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

12 Scopus citations

Abstract

In the last 30 years, earth system modeling has become increasingly data-intensive. The Community Earth System Model (CESM) response to the next Intergovernmental Panel on Climate Change (IPCC) assessment report (AR6) may require close to 1 Billion CPU hours of computation and generate up to 12 PB of raw data for post-processing. Existing post-processing tools are serial-only and impossibly slow with this much data. To improve the post-processing performance, our team has adopted a strategy of targeted replacement of the «bottleneck software» with light-weight parallel Python alternatives. This allows maximum impact with the least disruption to the CESM community and the shortest delivery time. We developed two light-weight parallel Python tools: one to convert model output from time-slice to time-series format, and one to perform fast time-averaging of time-series data. We present the motivation, approach, and results of these two tools, and our plans for future research and development.

Original languageEnglish
Title of host publicationProceedings - 2015 IEEE International Conference on Big Data, Big Data 2015
EditorsHoward Ho, Beng Chin Ooi, Mohammed J. Zaki, Xiaohua Hu, Laura Haas, Vipin Kumar, Sudarsan Rachuri, Shipeng Yu, Morris Hui-I Hsiao, Jian Li, Feng Luo, Saumyadipta Pyne, Kemafor Ogan
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1985-1994
Number of pages10
ISBN (Electronic)9781479999255
DOIs
StatePublished - Dec 22 2015
Event3rd IEEE International Conference on Big Data, Big Data 2015 - Santa Clara, United States
Duration: Oct 29 2015Nov 1 2015

Publication series

NameProceedings - 2015 IEEE International Conference on Big Data, IEEE Big Data 2015

Conference

Conference3rd IEEE International Conference on Big Data, Big Data 2015
Country/TerritoryUnited States
CitySanta Clara
Period10/29/1511/1/15

Keywords

  • Data processing
  • High performance computing
  • Parallel processing
  • Scientific computing

Fingerprint

Dive into the research topics of 'Light-weight parallel Python tools for earth system modeling workflows'. Together they form a unique fingerprint.

Cite this