RE.PUBLIC@POLIMI pubblicazioni di ricerca del Politecnico di Milano

This work presents a novel approach to temperature control in buildings using Meta-Reinforcement Learning (Meta-RL) to address uncertainties in building thermal dynamics. The proposed method utilizes a control-oriented model to train a Meta-RL controller in simulation across a wide range of building dynamics, generated by varying the most influential parameters of the building dynamics within large uncertainty ranges. As a result, the Meta-RL controller learns adaptable control strategies tailored to the dynamic characteristics within the whole uncertainty region. Initially, an RL agent is employed to create a suitable encoding of thermal dynamics parameters. During the deployment, an Adaptation Module substitutes the RL agent inferring the right encoding from real-time data. The paper provides details on the specifically designed modeling and control architecture and its parametrization. Extensive validation demonstrates that the developed Meta-RL architecture is able to effectively and quickly adapt to diverse building dynamics, is as efficient as a standard dedicated MPC control, and finally is also able to quickly restrict the uncertainty ranges during deployment.

A Fast Adaptive Temperature Control Approach for Uncertain Building Models via Meta-RL

Mastrangelo, Bruno Maria;Valentini, Alberto;Ferrarini, Luca

2025-01-01

Abstract

This work presents a novel approach to temperature control in buildings using Meta-Reinforcement Learning (Meta-RL) to address uncertainties in building thermal dynamics. The proposed method utilizes a control-oriented model to train a Meta-RL controller in simulation across a wide range of building dynamics, generated by varying the most influential parameters of the building dynamics within large uncertainty ranges. As a result, the Meta-RL controller learns adaptable control strategies tailored to the dynamic characteristics within the whole uncertainty region. Initially, an RL agent is employed to create a suitable encoding of thermal dynamics parameters. During the deployment, an Adaptation Module substitutes the RL agent inferring the right encoding from real-time data. The paper provides details on the specifically designed modeling and control architecture and its parametrization. Extensive validation demonstrates that the developed Meta-RL architecture is able to effectively and quickly adapt to diverse building dynamics, is as efficient as a standard dedicated MPC control, and finally is also able to quickly restrict the uncertainty ranges during deployment.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2025
			
	Titolo del libro
	
				2025 IEEE Conference on Control Technology and Applications, CCTA 2025
			
	Appare nelle tipologie:
	
				04.1 Contributo in Atti di convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1309599

Citazioni

ND

0

0

social impact