RE.PUBLIC@POLIMI pubblicazioni di ricerca del Politecnico di Milano

Adaptive dynamic programming (ADP) technique is adopted in this work to investigate the optimal control problem of Markovian jump systems. By utilizing Bellman's optimality principle, a discrete Hamilton Jacobi Bellman (HJB) equation is established to design the optimal controller for the system under consideration. Then, based on value iteration, a new ADP algorithm is proposed for finding the solution of the established HJB equation. It is proven that the iterative solution sequence generated by the developed ADP iterative approach under zero initial values is monotonically convergent. Neural networks are constructed to accomplish the presented value iteration ADP algorithm. At last, simulation researches for two Markovian jump systems demonstrate the effectiveness of the proposed optimal control method.

Optimal control of Markovian jump systems via a neural network-based ADP iterative algorithm

Sun H. -J.;Zhang J. X.;Karimi H. R.

2022-01-01

Abstract

Adaptive dynamic programming (ADP) technique is adopted in this work to investigate the optimal control problem of Markovian jump systems. By utilizing Bellman's optimality principle, a discrete Hamilton Jacobi Bellman (HJB) equation is established to design the optimal controller for the system under consideration. Then, based on value iteration, a new ADP algorithm is proposed for finding the solution of the established HJB equation. It is proven that the iterative solution sequence generated by the developed ADP iterative approach under zero initial values is monotonically convergent. Neural networks are constructed to accomplish the presented value iteration ADP algorithm. At last, simulation researches for two Markovian jump systems demonstrate the effectiveness of the proposed optimal control method.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2022
			
	Titolo della rivista
	
				NEUROCOMPUTING
			
	Parole chiave
	
				ADP
Markovian jump systems
Optimal control
Value iteration
			
	Appare nelle tipologie:
	
				01.1 Articolo in Rivista

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1205300

Citazioni

ND

11

9

social impact