Abstract
Reinforcement learning (RL)-based heating, ventilation, and air
conditioning (HVAC) control has emerged as a promising technology for
reducing building energy consumption while maintaining indoor thermal
comfort. However, the efficacy of such strategies is influenced by the
background climate and their implementation may potentially alter both
the indoor climate and local urban climate. This study proposes an
integrated framework combining RL with an urban climate model that
incorporates a building energy model, aiming to evaluate the efficacy of
RL-based HVAC control across different background climates, impacts of
RL strategies on indoor climate and local urban climate, and the
transferability of RL strategies across cities. Our findings reveal that
the reward (defined as a weighted combination of energy consumption and
thermal comfort) and the impacts of RL strategies on indoor climate and
local urban climate exhibit marked variability across cities with
different background climates. The sensitivity of reward weights and the
transferability of RL strategies are also strongly influenced by the
background climate. Cities in hot climates tend to achieve higher
rewards across most reward weight configurations that balance energy
consumption and thermal comfort, and those cities with more varying
atmospheric temperatures demonstrate greater RL strategy
transferability. These findings underscore the importance of thoroughly
evaluating RL-based HVAC control strategies in diverse climatic
contexts. This study also provides a new insight that city-to-city
learning will potentially aid the deployment of RL-based HVAC control.
| Original language | English |
|---|---|
| Journal | arXiv |
| State | E-pub ahead of print - May 1 2025 |
Keywords
- Machine Learning
- Artificial Intelligence
- Atmospheric and Oceanic Physics