TY - JOUR
T1 - On Using AI-Based Large-Sample Emulators for Land/Hydrology Model Calibration and Regionalization
AU - Tang, Guoqiang
AU - Wood, Andrew W.
AU - Swenson, Sean
N1 - Publisher Copyright:
© 2025 The Author(s).
PY - 2025/7
Y1 - 2025/7
N2 - AI-based model emulators have emerged as a pragmatic strategy for calibrating Earth System models or their components (e.g., land, atmosphere, ocean), circumventing the previously insurmountable hurdle of the process-heavy models' computational expense. Such emulators require large, spatially diverse data sets for training, however, which—in the land/hydrology context—contrasts with parameter estimation approaches that have traditionally emphasized optimizing model performance for individual basins, followed by similarity-based transfer schemes for parameter regionalization. Compared to calibrating basins individually, direct land/hydrology process model calibration approaches typically perform worse when trained jointly on large collections of basins. Building on insights from large-sample deep learning hydrologic modeling, this study introduces a Large-Sample Emulator (LSE) approach that unifies and streamlines process model parameter calibration and regionalization. Tested across 627 basins in the continental United States using the Community Terrestrial Systems Model (CTSM), the LSE approach consistently improves runoff predictions in all basins, outperforming the Single-Site Emulator (SSE) in both single-objective and multi-objective calibration tasks. Moreover, LSE-based regionalization in unseen basins, evaluated through spatial cross-validation, achieves better results than the default parameters in most cases. This LSE framework offers a promising strategy for effective large-domain process-based model calibration and regionalization.
AB - AI-based model emulators have emerged as a pragmatic strategy for calibrating Earth System models or their components (e.g., land, atmosphere, ocean), circumventing the previously insurmountable hurdle of the process-heavy models' computational expense. Such emulators require large, spatially diverse data sets for training, however, which—in the land/hydrology context—contrasts with parameter estimation approaches that have traditionally emphasized optimizing model performance for individual basins, followed by similarity-based transfer schemes for parameter regionalization. Compared to calibrating basins individually, direct land/hydrology process model calibration approaches typically perform worse when trained jointly on large collections of basins. Building on insights from large-sample deep learning hydrologic modeling, this study introduces a Large-Sample Emulator (LSE) approach that unifies and streamlines process model parameter calibration and regionalization. Tested across 627 basins in the continental United States using the Community Terrestrial Systems Model (CTSM), the LSE approach consistently improves runoff predictions in all basins, outperforming the Single-Site Emulator (SSE) in both single-objective and multi-objective calibration tasks. Moreover, LSE-based regionalization in unseen basins, evaluated through spatial cross-validation, achieves better results than the default parameters in most cases. This LSE framework offers a promising strategy for effective large-domain process-based model calibration and regionalization.
KW - CTSM
KW - hydrological modeling
KW - large-sample emulator
KW - parameter calibration
KW - regionalization
UR - https://www.scopus.com/pages/publications/105009231422
U2 - 10.1029/2024WR039525
DO - 10.1029/2024WR039525
M3 - Article
AN - SCOPUS:105009231422
SN - 0043-1397
VL - 61
JO - Water Resources Research
JF - Water Resources Research
IS - 7
M1 - e2024WR039525
ER -