TY - GEN
T1 - IsoSim
T2 - 33rd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL 2025
AU - Li, Zhili
AU - Cheng, Qi
AU - Li, Ruohan
AU - Zhu, Feng
AU - Jia, Xiaowei
AU - Xie, Yiqun
N1 - Publisher Copyright:
© 2025 Copyright held by the owner/author(s).
PY - 2025/12/12
Y1 - 2025/12/12
N2 - Isotopic ratios of hydrogen and oxygen in water serve as powerful tracers of the Earth's hydrological cycle, offering insights into the origins of water vapor, large-scale atmospheric circulation, and moisture transport dynamics. However, integrating water isotopes into fully coupled global climate models (GCMs) is both scientifically and technically challenging due to the complex interactions between water isotopes and the atmosphere, hydrosphere, and cryosphere, as well as the extensive modifications to model physics and dynamics. As a result, most GCMs lack support for isotopes and even the few existing isotope-enabled GCMs still remain highly expensive to run, significantly limiting their usability. Machine learning (ML) offers promising opportunities to emulate the complex process as powerful mathematical approximators. The water isotope fields from the emulators bring potential for applications in isotope-unenabled GCMs. However, the absence of a publicly available ML-ready dataset has hindered the development of robust ML-based emulators. To address this gap, we introduce IsoSim, the first ML-ready benchmark dataset designed to facilitate the development of ML emulators for water isotopes in GCMs. This dataset includes global climate variables and water isotope fields across three spatial dimensions (latitude, longitude, and height) from isotope-enabled GCM simulations, spanning 500 years at a monthly resolution. We also include different climatic scenarios and a diverse set of learning-based emulators to carry out extensive evaluations and build the benchmarks. The dataset and results serve as reference points to compare machine learning models' ability in approximating complex physical relationships.
AB - Isotopic ratios of hydrogen and oxygen in water serve as powerful tracers of the Earth's hydrological cycle, offering insights into the origins of water vapor, large-scale atmospheric circulation, and moisture transport dynamics. However, integrating water isotopes into fully coupled global climate models (GCMs) is both scientifically and technically challenging due to the complex interactions between water isotopes and the atmosphere, hydrosphere, and cryosphere, as well as the extensive modifications to model physics and dynamics. As a result, most GCMs lack support for isotopes and even the few existing isotope-enabled GCMs still remain highly expensive to run, significantly limiting their usability. Machine learning (ML) offers promising opportunities to emulate the complex process as powerful mathematical approximators. The water isotope fields from the emulators bring potential for applications in isotope-unenabled GCMs. However, the absence of a publicly available ML-ready dataset has hindered the development of robust ML-based emulators. To address this gap, we introduce IsoSim, the first ML-ready benchmark dataset designed to facilitate the development of ML emulators for water isotopes in GCMs. This dataset includes global climate variables and water isotope fields across three spatial dimensions (latitude, longitude, and height) from isotope-enabled GCM simulations, spanning 500 years at a monthly resolution. We also include different climatic scenarios and a diverse set of learning-based emulators to carry out extensive evaluations and build the benchmarks. The dataset and results serve as reference points to compare machine learning models' ability in approximating complex physical relationships.
KW - earth system models
KW - emulation
KW - machine learning
KW - water isotopes
UR - https://www.scopus.com/pages/publications/105025537312
U2 - 10.1145/3748636.3762764
DO - 10.1145/3748636.3762764
M3 - Conference contribution
AN - SCOPUS:105025537312
T3 - 33rd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL 2025
SP - 526
EP - 537
BT - 33rd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL 2025
A2 - Mokbel, Mohamed
A2 - Shekar, Shashi
A2 - Zufle, Andreas
A2 - Chiang, Yao-Yi
A2 - Damiani, Maria Luisa
A2 - Youssef, Moustafa
PB - Association for Computing Machinery, Inc
Y2 - 3 November 2025 through 6 November 2025
ER -