TY - JOUR
T1 - Exploring NWS Forecasters’ Assessment of AI Guidance Trustworthiness
AU - Cains, Mariana G.
AU - Wirz, Christopher D.
AU - Demuth, Julie L.
AU - Bostrom, Ann
AU - Gagne, David John
AU - McGovern, Amy
AU - Sobash, Ryan A.
AU - Madlambayan, Deianna
N1 - Publisher Copyright:
© 2024 American Meteorological Society.
PY - 2024/8
Y1 - 2024/8
N2 - As artificial intelligence (AI) methods are increasingly used to develop new guidance intended for operational use by forecasters, it is critical to evaluate whether forecasters deem the guidance trustworthy. Past trust-related AI research suggests that certain attributes (e.g., understanding how the AI was trained, interactivity, and performance) con-tribute to users perceiving the AI as trustworthy. However, little research has been done to examine the role of these and other attributes for weather forecasters. In this study, we conducted 16 online interviews with National Weather Service (NWS) forecasters to examine (i) how they make guidance use decisions and (ii) how the AI model technique used, training, input variables, performance, and developers as well as interacting with the model output influenced their assessments of trustworthiness of new guidance. The interviews pertained to either a random forest model predicting the probability of severe hail or a 2D convolutional neural network model predicting the probability of storm mode. When taken as a whole, our findings illustrate how forecasters’ assessment of AI guidance trustworthiness is a process that occurs over time rather than automatically or at first introduction. We recommend developers center end users when creating new AI guidance tools, making end users integral to their thinking and efforts. This approach is essential for the development of useful and used tools. The details of these findings can help AI developers understand how forecasters perceive AI guidance and in-form AI development and refinement efforts.
AB - As artificial intelligence (AI) methods are increasingly used to develop new guidance intended for operational use by forecasters, it is critical to evaluate whether forecasters deem the guidance trustworthy. Past trust-related AI research suggests that certain attributes (e.g., understanding how the AI was trained, interactivity, and performance) con-tribute to users perceiving the AI as trustworthy. However, little research has been done to examine the role of these and other attributes for weather forecasters. In this study, we conducted 16 online interviews with National Weather Service (NWS) forecasters to examine (i) how they make guidance use decisions and (ii) how the AI model technique used, training, input variables, performance, and developers as well as interacting with the model output influenced their assessments of trustworthiness of new guidance. The interviews pertained to either a random forest model predicting the probability of severe hail or a 2D convolutional neural network model predicting the probability of storm mode. When taken as a whole, our findings illustrate how forecasters’ assessment of AI guidance trustworthiness is a process that occurs over time rather than automatically or at first introduction. We recommend developers center end users when creating new AI guidance tools, making end users integral to their thinking and efforts. This approach is essential for the development of useful and used tools. The details of these findings can help AI developers understand how forecasters perceive AI guidance and in-form AI development and refinement efforts.
KW - Artificial intelligence
KW - Decision-making
KW - Forecasting
KW - Machine learning
KW - Model evaluation/performance
KW - Social Science
UR - https://www.scopus.com/pages/publications/85206880659
U2 - 10.1175/WAF-D-23-0180.1
DO - 10.1175/WAF-D-23-0180.1
M3 - Article
AN - SCOPUS:85206880659
SN - 0882-8156
VL - 39
SP - 1219
EP - 1241
JO - Weather and Forecasting
JF - Weather and Forecasting
IS - 8
ER -