A collaborative effort to improve lossy compression methods for climate data

Dorit M. Hammerling, Allison H. Baker, Alexander Pinard, Peter Lindstrom

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

7 Scopus citations

Abstract

Climate model simulations produce large volumes of data, and reducing the storage burden with data compression is increasingly of interest to climate scientists. A key concern to the climate community, though, is ensuring that any data loss due to compression does not in any way affect their scientific analysis. For this reason, the climate community is taking a cautious approach to adopting lossy compression by carefully investigating the potential existence of artifacts due to compression in a wide variety of analysis settings. Spatio-temporal statistical analysis in particular can highlight compression-induced features that would go unnoticed by the standard metrics common to the data compression community. Communicating such findings to the algorithm developers in the context of a collaborative improvement cycle is one - in our view productive - way to foster trust within the climate community and pave the way for eventual adoption of lossy compression. In this work, we report on the initial results of a successful and mutually beneficial collaboration between the two communities that led to improvements in a well regarded compression algorithm and more effective compression of climate simulation data.

Original languageEnglish
Title of host publicationProceedings of DRBSD-5 2019
Subtitle of host publication5th International Workshop on Data Analysis and Reduction for Big Scientific Data - Held in conjunction with SC 2019: The International Conference for High Performance Computing, Networking, Storage and Analysis
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages16-22
Number of pages7
ISBN (Electronic)9781728160177
DOIs
StatePublished - Nov 2019
Event5th IEEE/ACM International Workshop on Data Analysis and Reduction for Big Scientific Data, DRBSD-5 2019 - Denver, United States
Duration: Nov 17 2019 → …

Publication series

NameProceedings of DRBSD-5 2019: 5th International Workshop on Data Analysis and Reduction for Big Scientific Data - Held in conjunction with SC 2019: The International Conference for High Performance Computing, Networking, Storage and Analysis

Conference

Conference5th IEEE/ACM International Workshop on Data Analysis and Reduction for Big Scientific Data, DRBSD-5 2019
Country/TerritoryUnited States
CityDenver
Period11/17/19 → …

Fingerprint

Dive into the research topics of 'A collaborative effort to improve lossy compression methods for climate data'. Together they form a unique fingerprint.

Cite this