This paper investigates a safe optimal control method for time-delay nonlinear systems based on multiplayer Stackelberg-Nash games (SNGs). Initially, considering different roles of players in the multiplayer SNGs, a hierarchical decision-making process is described as the designed value functions for the leader and all followers. Next, this paper incorporates a control barrier function into the value function to ensure that the system states remain within a safe range, thereby guaranteeing safety while achieving the optimization. Meanwhile, new cost functions are constructed which include the Lyapunov-Krasovskii (L-K) function to eliminate time-delay effects, and a single critic neural network is used to approximate the optimal controller for the leader and the N-followers. Based on the Lyapunov stability theory, it is proven that all signals in the closed-loop system are uniformly ultimately bounded (UUB). Finally, a simulation example is provided to demonstrate the effectiveness of the proposed optimal control method.

Safe optimal control for multiplayer Stackelberg–Nash games of nonlinear time-delay systems via adaptive dynamic programming

Karimi, Hamid Reza;
2025-01-01

Abstract

This paper investigates a safe optimal control method for time-delay nonlinear systems based on multiplayer Stackelberg-Nash games (SNGs). Initially, considering different roles of players in the multiplayer SNGs, a hierarchical decision-making process is described as the designed value functions for the leader and all followers. Next, this paper incorporates a control barrier function into the value function to ensure that the system states remain within a safe range, thereby guaranteeing safety while achieving the optimization. Meanwhile, new cost functions are constructed which include the Lyapunov-Krasovskii (L-K) function to eliminate time-delay effects, and a single critic neural network is used to approximate the optimal controller for the leader and the N-followers. Based on the Lyapunov stability theory, it is proven that all signals in the closed-loop system are uniformly ultimately bounded (UUB). Finally, a simulation example is provided to demonstrate the effectiveness of the proposed optimal control method.
2025
Adaptive dynamic programming; Safety and optimal robust control; Stackelberg-Nash games; Time delay;
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1310793
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 30
  • ???jsp.display-item.citation.isi??? 34
social impact