Skip to main navigation Skip to search Skip to main content

Systematic Benchmarking of Climate Models: Methodologies, Applications, and New Directions

  • Birgit Hassler
  • , Forrest M. Hoffman
  • , Rebecca Beadling
  • , Ed Blockley
  • , Bo Huang
  • , Jiwoo Lee
  • , Valerio Lembo
  • , Jared Lewis
  • , Jianhua Lu
  • , Luke Madaus
  • , Elizaveta Malinina
  • , Brian Medeiros
  • , Wilfried Pokam
  • , Enrico Scoccimarro
  • , Ranjini Swaminathan

Research output: Contribution to journalReview articlepeer-review

Abstract

As climate models become increasingly complex, there is a growing need to comprehensively and systematically assess model performance with respect to observations. Given the increasing number and diversity of climate model simulations in use, the community has moved beyond simple model intercomparison and toward developing methods capable of benchmarking a large number of simulations against a suite of climate metrics. Here, we present a detailed review of evaluation and benchmarking methods and approaches developed in the last decade, focusing primarily on scientific implications for Coupled Model Intercomparison Project (CMIP) simulations and CMIP6 results that contributed to the Intergovernmental Panel on Climate Change (IPCC) Sixth Assessment Report (AR6). Based on this review, we explain the resulting contemporary philosophy of model benchmarking, and provide clear distinctions and definitions of the terms model verification, process validation, evaluation, and benchmarking. While significant progress has been made in model development based on systematic evaluation and benchmarking efforts, some climate system biases still remain. The development of open-source community software packages has played a fundamental role in identifying areas of significant model improvement and bias reduction. We review the key features of several software packages that have been commonly used over the past decade to evaluate and benchmark global and regional climate models. Additionally, we discuss best practices for the selection of evaluation and benchmarking metrics and for interpreting the obtained results, the importance of selecting suitable sources of reference data and accurate uncertainty quantification.

Original languageEnglish
Article numbere2025RG000891
JournalReviews of Geophysics
Volume64
Issue number1
DOIs
StatePublished - Mar 2026
Externally publishedYes

Keywords

  • CMIP
  • climate models
  • model benchmarking
  • model evaluation

Fingerprint

Dive into the research topics of 'Systematic Benchmarking of Climate Models: Methodologies, Applications, and New Directions'. Together they form a unique fingerprint.

Cite this