RE.PUBLIC@POLIMI pubblicazioni di ricerca del Politecnico di Milano

Ensuring high indoor air quality (IAQ) while minimizing energy consumption and preserving occupant comfort is a central challenge in building management systems. Traditional rule-based or single-objective controls often neglect dynamic fluctuations in pollution levels and occupant behavior, leading to suboptimal trade-offs. In this paper, we propose a context-aware Meta-Reinforcement Learning (Meta-RL) framework that simultaneously addresses multiple objectives-IAQ, energy efficiency, and comfort-under a variety of building configurations and disturbances (e.g., wildfires, equipment faults, occupancy surges). Our approach integrates a Transformer-based encoder for latent context extraction, a Meta-Pareto hypernetwork that generates diverse policies for user-driven preferences, and safety-constrained adaptation to maintain strict pollutant thresholds. Through extensive simulations using EnergyPlus and real-world calibration data, the proposed framework demonstrates (1) significantly lower IAQ violations compared to standard RL and rule-based baselines, (2) reduced energy usage while maintaining comfortable thermal conditions, and (3) rapid transfer to new buildings via few-shot meta-training. These findings underscore the potential of Meta-RL to deliver robust, flexible HVAC control solutions in complex, real-world indoor environments.

Context-Aware Meta-Reinforcement Learning for Intelligent Diverse Indoor HVAC Control

Liu, Songling;Li, Jing;Buganza, Tommaso

2025-01-01

Abstract

Ensuring high indoor air quality (IAQ) while minimizing energy consumption and preserving occupant comfort is a central challenge in building management systems. Traditional rule-based or single-objective controls often neglect dynamic fluctuations in pollution levels and occupant behavior, leading to suboptimal trade-offs. In this paper, we propose a context-aware Meta-Reinforcement Learning (Meta-RL) framework that simultaneously addresses multiple objectives-IAQ, energy efficiency, and comfort-under a variety of building configurations and disturbances (e.g., wildfires, equipment faults, occupancy surges). Our approach integrates a Transformer-based encoder for latent context extraction, a Meta-Pareto hypernetwork that generates diverse policies for user-driven preferences, and safety-constrained adaptation to maintain strict pollutant thresholds. Through extensive simulations using EnergyPlus and real-world calibration data, the proposed framework demonstrates (1) significantly lower IAQ violations compared to standard RL and rule-based baselines, (2) reduced energy usage while maintaining comfortable thermal conditions, and (3) rapid transfer to new buildings via few-shot meta-training. These findings underscore the potential of Meta-RL to deliver robust, flexible HVAC control solutions in complex, real-world indoor environments.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2025
			
	Titolo del libro
	
				2025 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN)
			
	Titolo della collana
	
				PROCEEDINGS OF ... INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS
			
	Parole chiave
	
				Healthy Buildings
HVAC
Indoor Air Quality
Reinforcement Learning
			
	Appare nelle tipologie:
	
				04.1 Contributo in Atti di convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1311036

Citazioni

ND

0

0

social impact