Recommendations for Comprehensive and Independent Evaluation of Machine Learning-Based Earth System Models

  • P. A. Ullrich
  • , E. A. Barnes
  • , W. D. Collins
  • , K. Dagon
  • , S. Duan
  • , J. Elms
  • , J. Lee
  • , L. R. Leung
  • , D. Lu
  • , M. J. Molina
  • , T. A. O’Brien
  • , F. O. Rebassoo

Research output: Contribution to journalReview articlepeer-review

9 Scopus citations

Abstract

Machine learning (ML) is a revolutionary technology with demonstrable applications across multiple disciplines. Within the Earth science community, ML has been most visible for weather forecasting, producing forecasts that rival modern physics-based models. Given the importance of deepening our understanding and improving predictions of the Earth system on all time scales, efforts are now underway to develop Earth-system models (ESMs) capable of representing all components of the coupled Earth system (or their aggregated behavior) and their response to external changes over long timescales. Building trust in ESMs is a much more difficult problem than for weather forecast models, not least because the model must represent the alternate (e.g., future or paleoclimatic) coupled states of the system for which there are no direct observations. Given that the physical principles that enable predictions about the response of the Earth system are often not explicitly coded in these ML-based models, demonstrating the credibility of ML-based ESMs thus requires us to build evidence of their consistency with the physical system. To this end, this paper puts forward five recommendations to enhance comprehensive, standardized, and independent evaluation of ML-based ESMs to strengthen their credibility and promote their wider use.

Original languageEnglish
Article numbere2024JH000496
JournalJournal of Geophysical Research: Machine Learning and Computation
Volume2
Issue number1
DOIs
StatePublished - Mar 2025
Externally publishedYes

Keywords

  • Earth system models
  • evaluation
  • idealized tests
  • machine learning
  • model credibility
  • testing

Fingerprint

Dive into the research topics of 'Recommendations for Comprehensive and Independent Evaluation of Machine Learning-Based Earth System Models'. Together they form a unique fingerprint.

Cite this