RE.PUBLIC@POLIMI pubblicazioni di ricerca del Politecnico di Milano

Cyber-Physical Energy Systems (CPESs) integrate cyber and hardware components to ensure a reliable and safe physical power production and supply. Renewable Energy Sources (RESs) add uncertainty to energy demand that can be dealt with flexible operation (e.g., load-following) of CPES; at the same time, scenarios that could result in severe consequences due to both component stochastic failures and aging of the cyber system of CPES (commonly overlooked) must be accounted for Operation & Maintenance (O&M) planning. In this paper, we make use of Deep Reinforcement Learning (DRL) to search for the optimal O&M strategy that, not only considers the actual system hardware components health conditions and their Remaining Useful Life (RUL), but also the possible accident scenarios caused by the failures and the aging of the hardware and the cyber components, respectively. The novelty of the work lies in embedding the cyber aging model into the CPES model of production planning and failure process; this model is used to help the RL agent, trained with Proximal Policy Optimization (PPO) and Imitation Learning (IL), finding the proper rejuvenation timing for the cyber system accounting for the uncertainty of the cyber system aging process. An application is provided, with regards to the Advanced Lead-cooled Fast Reactor European Demonstrator (ALFRED).

Flexible operation and maintenance optimization of aging cyber-physical energy systems by deep reinforcement learning

Hao Z.;Di Maio F.;Zio E.

2023-01-01

Abstract

Cyber-Physical Energy Systems (CPESs) integrate cyber and hardware components to ensure a reliable and safe physical power production and supply. Renewable Energy Sources (RESs) add uncertainty to energy demand that can be dealt with flexible operation (e.g., load-following) of CPES; at the same time, scenarios that could result in severe consequences due to both component stochastic failures and aging of the cyber system of CPES (commonly overlooked) must be accounted for Operation & Maintenance (O&M) planning. In this paper, we make use of Deep Reinforcement Learning (DRL) to search for the optimal O&M strategy that, not only considers the actual system hardware components health conditions and their Remaining Useful Life (RUL), but also the possible accident scenarios caused by the failures and the aging of the hardware and the cyber components, respectively. The novelty of the work lies in embedding the cyber aging model into the CPES model of production planning and failure process; this model is used to help the RL agent, trained with Proximal Policy Optimization (PPO) and Imitation Learning (IL), finding the proper rejuvenation timing for the cyber system accounting for the uncertainty of the cyber system aging process. An application is provided, with regards to the Advanced Lead-cooled Fast Reactor European Demonstrator (ALFRED).

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
			2023
		
	Titolo della rivista
	
			NUCLEAR ENGINEERING AND TECHNOLOGY
		
	Parole chiave
	
			Advanced lead-cooled fast reactor european demonstrator (ALFRED)
Cyber aging
Cyber-physical energy system (CPES)
Deep reinforcement learning (DRL)
Nuclear power plant (NPP)
Operation &amp; maintenance (O&amp;M)
Optimization
		
	Appare nelle tipologie:
	
			01.1 Articolo in Rivista

File in questo prodotto:

File	Dimensione	Formato
1-s2.0-S1738573323005521-main.pdf accesso aperto : Publisher’s version Dimensione 3.35 MB Formato Adobe PDF Visualizza/Apri	3.35 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1260256

Citazioni

ND

0

ND

social impact