TY - JOUR
T1 - Aviation turbulence forecasting at upper levels with machine learning techniques based on regression trees
AU - Muñoz-Esparza, Domingo
AU - Sharman, Robert D.
AU - Deierling, Wiebke
N1 - Publisher Copyright:
© 2020 American Meteorological Society.
PY - 2020/11
Y1 - 2020/11
N2 - We explore the use of machine learning (ML) techniques, namely, regression trees (RT), for the purpose of aviation turbulence forecasting at upper levels [20–45 kft (~6–14 km) in altitude]. In particular, we develop a series of RT-based algorithms that include random forests (RF) and gradient-boosted regression trees (GBRT) methods. Numerical weather prediction model prognostic variables and derived turbulence diagnostics based on 6-h forecasts from the 3-km High-Resolution Rapid Refresh model are used as features to train these data-driven models. Training and evaluation are based on turbulence estimates of eddy dissipation rate (EDR) obtained from automated in situ aircraft reports. Our baseline RF model, consisting of 100 trees with 30 layers of maximum depth, significantly reduces forecast errors for EDR < 0.1 m2/3 s-1 (which corresponds roughly to null and light turbulence) when compared with a simple regression model, increasing the probability of detection and in turn reducing the number of false alarms. Model complexity reduction via GBRT and feature-relevance analyses is performed, indicating that considerable execution speedups can be achieved while maintaining the model’s predictive skill. Overall, the ML models exhibit enhanced performance in discriminating the EDR forecast among the light, moderate, and severe turbulence categories. In addition, these artificial intelligence techniques significantly simplify the generation of new NWP and grid-spacing specific turbulence forecast products.
AB - We explore the use of machine learning (ML) techniques, namely, regression trees (RT), for the purpose of aviation turbulence forecasting at upper levels [20–45 kft (~6–14 km) in altitude]. In particular, we develop a series of RT-based algorithms that include random forests (RF) and gradient-boosted regression trees (GBRT) methods. Numerical weather prediction model prognostic variables and derived turbulence diagnostics based on 6-h forecasts from the 3-km High-Resolution Rapid Refresh model are used as features to train these data-driven models. Training and evaluation are based on turbulence estimates of eddy dissipation rate (EDR) obtained from automated in situ aircraft reports. Our baseline RF model, consisting of 100 trees with 30 layers of maximum depth, significantly reduces forecast errors for EDR < 0.1 m2/3 s-1 (which corresponds roughly to null and light turbulence) when compared with a simple regression model, increasing the probability of detection and in turn reducing the number of false alarms. Model complexity reduction via GBRT and feature-relevance analyses is performed, indicating that considerable execution speedups can be achieved while maintaining the model’s predictive skill. Overall, the ML models exhibit enhanced performance in discriminating the EDR forecast among the light, moderate, and severe turbulence categories. In addition, these artificial intelligence techniques significantly simplify the generation of new NWP and grid-spacing specific turbulence forecast products.
KW - Atmosphere
KW - Decision trees
KW - Forecasting techniques
KW - Regression
KW - Turbulence
UR - https://www.scopus.com/pages/publications/85096351283
U2 - 10.1175/JAMC-D-20-0116.1
DO - 10.1175/JAMC-D-20-0116.1
M3 - Article
AN - SCOPUS:85096351283
SN - 1558-8424
VL - 59
SP - 1883
EP - 1899
JO - Journal of Applied Meteorology and Climatology
JF - Journal of Applied Meteorology and Climatology
IS - 11
ER -