RE.PUBLIC@POLIMI pubblicazioni di ricerca del Politecnico di Milano

This paper investigates a safe optimal control method for time-delay nonlinear systems based on multiplayer Stackelberg-Nash games (SNGs). Initially, considering different roles of players in the multiplayer SNGs, a hierarchical decision-making process is described as the designed value functions for the leader and all followers. Next, this paper incorporates a control barrier function into the value function to ensure that the system states remain within a safe range, thereby guaranteeing safety while achieving the optimization. Meanwhile, new cost functions are constructed which include the Lyapunov-Krasovskii (L-K) function to eliminate time-delay effects, and a single critic neural network is used to approximate the optimal controller for the leader and the N-followers. Based on the Lyapunov stability theory, it is proven that all signals in the closed-loop system are uniformly ultimately bounded (UUB). Finally, a simulation example is provided to demonstrate the effectiveness of the proposed optimal control method.

Safe optimal control for multiplayer Stackelberg–Nash games of nonlinear time-delay systems via adaptive dynamic programming

Zhao, Junzheng;Karimi, Hamid Reza;Niu, Ben;Zhao, Xudong

2025-01-01

Abstract

This paper investigates a safe optimal control method for time-delay nonlinear systems based on multiplayer Stackelberg-Nash games (SNGs). Initially, considering different roles of players in the multiplayer SNGs, a hierarchical decision-making process is described as the designed value functions for the leader and all followers. Next, this paper incorporates a control barrier function into the value function to ensure that the system states remain within a safe range, thereby guaranteeing safety while achieving the optimization. Meanwhile, new cost functions are constructed which include the Lyapunov-Krasovskii (L-K) function to eliminate time-delay effects, and a single critic neural network is used to approximate the optimal controller for the leader and the N-followers. Based on the Lyapunov stability theory, it is proven that all signals in the closed-loop system are uniformly ultimately bounded (UUB). Finally, a simulation example is provided to demonstrate the effectiveness of the proposed optimal control method.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2025
			
	Titolo della rivista
	
				NEUROCOMPUTING
			
	Parole chiave
	
				Adaptive dynamic programming; Safety and optimal robust control; Stackelberg-Nash games; Time delay;
			
	Appare nelle tipologie:
	
				01.1 Articolo in Rivista

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1310793

Citazioni

ND

37

43

ND

social impact