A methodology for evaluating the impact of data compression on climate simulation data

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

83 Scopus citations

Abstract

High-resolution climate simulations require tremendous computing resources and can generate massive datasets. At present, preserving the data from these simulations consumes vast storage resources at institutions such as the National Center for Atmospheric Research (NCAR). The historical data generation trends are economically unsustainable, and storage resources are already beginning to limit science objectives. To mitigate this problem, we investigate the use of data compression techniques on climate simulation data from the Community Earth System Model. Ultimately, to convince climate scientists to compress their simulation data, we must be able to demonstrate that the reconstructed data reveals the same mean climate as the original data, and this paper is a first step toward that goal. To that end, we develop an approach for verifying the climate data and use it to evaluate several compression algorithms. We find that the diversity of the climate data requires the individual treatment of variables, and, in doing so, the reconstructed data can fall within the natural variability of the system, while achieving compression rates of up to 5:1.

Original languageEnglish
Title of host publicationHPDC 2014 - Proceedings of the 23rd International Symposium on High-Performance Parallel and Distributed Computing
PublisherAssociation for Computing Machinery
Pages203-214
Number of pages12
ISBN (Print)9781450327480
DOIs
StatePublished - 2014
Event23rd ACM Symposium on High-Performance Parallel and Distributed Computing, HPDC 2014 - Vancouver, BC, Canada
Duration: Jun 23 2014Jun 27 2014

Publication series

NameHPDC 2014 - Proceedings of the 23rd International Symposium on High-Performance Parallel and Distributed Computing

Conference

Conference23rd ACM Symposium on High-Performance Parallel and Distributed Computing, HPDC 2014
Country/TerritoryCanada
CityVancouver, BC
Period06/23/1406/27/14

Keywords

  • Data compression
  • High performance computing

Fingerprint

Dive into the research topics of 'A methodology for evaluating the impact of data compression on climate simulation data'. Together they form a unique fingerprint.

Cite this