Calibration of machine learning–based probabilistic hail predictions for operational forecasting

Amanda Burke, Nathan Snook, David John Gagne, Sarah McCorkle, Amy McGovern

Research output: Contribution to journalArticlepeer-review

51 Scopus citations

Abstract

In this study, we use machine learning (ML) to improve hail prediction by postprocessing numerical weather prediction (NWP) data from the new High-Resolution Ensemble Forecast system, version 2 (HREFv2). Multiple operational models and ensembles currently predict hail, however ML models are more computationally efficient and do not require the physical assumptions associated with explicit predictions. Calibrating the ML-based predictions toward familiar forecaster output allows for a combination of higher skill associated with ML models and increased forecaster trust in the output. The observational dataset used to train and verify the random forest model is the Maximum Estimated Size of Hail (MESH), a Multi-Radar Multi-Sensor (MRMS) product. To build trust in the predictions, the ML-based hail predictions are calibrated using isotonic regression. The target datasets for isotonic regression include the local storm reports and Storm Prediction Center (SPC) practically perfect data. Verification of the ML predictions indicates that the probability magnitudes output from the calibrated models closely resemble the day-1 SPC outlook and practically perfect data. The ML model calibrated toward the local storm reports exhibited better or similar skill to the uncalibrated predictions, while decreasing model bias. Increases in reliability and skill after calibration may increase forecaster trust in the automated hail predictions.

Original languageEnglish
Pages (from-to)149-168
Number of pages20
JournalWeather and Forecasting
Volume35
Issue number1
DOIs
StatePublished - Feb 2020

Keywords

  • Classification
  • Data science
  • Decision trees
  • Operational forecasting
  • Regression
  • Short-range prediction

Fingerprint

Dive into the research topics of 'Calibration of machine learning–based probabilistic hail predictions for operational forecasting'. Together they form a unique fingerprint.

Cite this