TY - GEN
T1 - A collaborative effort to improve lossy compression methods for climate data
AU - Hammerling, Dorit M.
AU - Baker, Allison H.
AU - Pinard, Alexander
AU - Lindstrom, Peter
N1 - Publisher Copyright:
© 2019 IEEE.
PY - 2019/11
Y1 - 2019/11
N2 - Climate model simulations produce large volumes of data, and reducing the storage burden with data compression is increasingly of interest to climate scientists. A key concern to the climate community, though, is ensuring that any data loss due to compression does not in any way affect their scientific analysis. For this reason, the climate community is taking a cautious approach to adopting lossy compression by carefully investigating the potential existence of artifacts due to compression in a wide variety of analysis settings. Spatio-temporal statistical analysis in particular can highlight compression-induced features that would go unnoticed by the standard metrics common to the data compression community. Communicating such findings to the algorithm developers in the context of a collaborative improvement cycle is one - in our view productive - way to foster trust within the climate community and pave the way for eventual adoption of lossy compression. In this work, we report on the initial results of a successful and mutually beneficial collaboration between the two communities that led to improvements in a well regarded compression algorithm and more effective compression of climate simulation data.
AB - Climate model simulations produce large volumes of data, and reducing the storage burden with data compression is increasingly of interest to climate scientists. A key concern to the climate community, though, is ensuring that any data loss due to compression does not in any way affect their scientific analysis. For this reason, the climate community is taking a cautious approach to adopting lossy compression by carefully investigating the potential existence of artifacts due to compression in a wide variety of analysis settings. Spatio-temporal statistical analysis in particular can highlight compression-induced features that would go unnoticed by the standard metrics common to the data compression community. Communicating such findings to the algorithm developers in the context of a collaborative improvement cycle is one - in our view productive - way to foster trust within the climate community and pave the way for eventual adoption of lossy compression. In this work, we report on the initial results of a successful and mutually beneficial collaboration between the two communities that led to improvements in a well regarded compression algorithm and more effective compression of climate simulation data.
UR - https://www.scopus.com/pages/publications/85078884274
U2 - 10.1109/DRBSD-549595.2019.00008
DO - 10.1109/DRBSD-549595.2019.00008
M3 - Conference contribution
AN - SCOPUS:85078884274
T3 - Proceedings of DRBSD-5 2019: 5th International Workshop on Data Analysis and Reduction for Big Scientific Data - Held in conjunction with SC 2019: The International Conference for High Performance Computing, Networking, Storage and Analysis
SP - 16
EP - 22
BT - Proceedings of DRBSD-5 2019
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 5th IEEE/ACM International Workshop on Data Analysis and Reduction for Big Scientific Data, DRBSD-5 2019
Y2 - 17 November 2019
ER -