RE.PUBLIC@POLIMI pubblicazioni di ricerca del Politecnico di Milano

This paper proposes a novel approach that applies state-of-the-art concepts in reinforcement learning (RL) to the optimal control of human papillomavirus (HPV) infection. The methodology transforms the nonlinear optimal control problem into a constrained nonlinear programming problem, thus allowing effective application of the RL algorithms. This approach combines Hamilton–Jacobi–Bellman (HJB) equations with actor–critic neural networks and control barrier functions to obtain an adaptive strategy for optimal vaccination and screening against HPV infection. A key innovation is the Sophia optimizer with experience replay, addressing the critical need for online data application in infectious disease control. Unlike the traditional methods that rely on the accumulation of extensive data, this approach utilizes experience replay to learn and adapt continuously, hence giving practical solutions for diseases like HPV where waiting for data is not practical or desirable. Experience replay helps to store and reuse past experience, hence improving the learning efficiency and stability of the system. This is an important feature for online applications to make sure that an RL model responds quickly enough to changing epidemiological conditions. Numerical simulations demonstrate the effectiveness of this approach in minimizing HPV prevalence and optimizing resource allocation. This research offers significant insights into the application of advanced control strategies in infectious disease management, highlighting the potential of RL to address complex epidemiological challenges. The ability to apply these techniques to online underscores the importance of adaptive and responsive strategies in public health.

Towards optimal control of HPV model using safe reinforcement learning with actor–critic neural networks

Amirabadi, Roya Khalili;Fard, Omid S.;Farimani, Mohsen Jalaeian

2025-12-01

Abstract

This paper proposes a novel approach that applies state-of-the-art concepts in reinforcement learning (RL) to the optimal control of human papillomavirus (HPV) infection. The methodology transforms the nonlinear optimal control problem into a constrained nonlinear programming problem, thus allowing effective application of the RL algorithms. This approach combines Hamilton–Jacobi–Bellman (HJB) equations with actor–critic neural networks and control barrier functions to obtain an adaptive strategy for optimal vaccination and screening against HPV infection. A key innovation is the Sophia optimizer with experience replay, addressing the critical need for online data application in infectious disease control. Unlike the traditional methods that rely on the accumulation of extensive data, this approach utilizes experience replay to learn and adapt continuously, hence giving practical solutions for diseases like HPV where waiting for data is not practical or desirable. Experience replay helps to store and reuse past experience, hence improving the learning efficiency and stability of the system. This is an important feature for online applications to make sure that an RL model responds quickly enough to changing epidemiological conditions. Numerical simulations demonstrate the effectiveness of this approach in minimizing HPV prevalence and optimizing resource allocation. This research offers significant insights into the application of advanced control strategies in infectious disease management, highlighting the potential of RL to address complex epidemiological challenges. The ability to apply these techniques to online underscores the importance of adaptive and responsive strategies in public health.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				dic-2025
			
	Titolo della rivista
	
				EXPERT SYSTEMS WITH APPLICATIONS
			
	Parole chiave
	
				Actor–critic neural network
Control barrier function
Experience replay
HPV model
Nonlinear systems
Optimal control
Reinforcement learning
Safety
			
	Appare nelle tipologie:
	
				01.1 Articolo in Rivista

File in questo prodotto:

File	Dimensione	Formato
DRL_HPV_1.pdf Accesso riservato Dimensione 7.4 MB Formato Adobe PDF Visualizza/Apri	7.4 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1279336

Citazioni

ND

0

ND

social impact