RE.PUBLIC@POLIMI pubblicazioni di ricerca del Politecnico di Milano

This paper proposes an aperiodic control approach based on hierarchical reinforcement learning for the longitudinal speed tracking control of unmanned sightseeing vehicles in scenic areas. The method develops the MAXQ hierarchical framework to decompose the speed tracking task into a series of subtasks, including learning the control inputs and the triggering instants. This allows each subtask to focus on a specific local issue, independently learning and optimising, making the learning process simpler and more efficient. In the learning of triggering instants, an event-triggered strategy is incorporated, which greatly reduces the controller update rate, thereby reducing communication and resources occupation. In the learning of control inputs, the reward function not only considers tracking performance, but also considers both inter-vehicle distance and acceleration change rate, which ensures tracking accuracy while enhancing driving safety and comfort. Finally, the efficacy of the proposed method is validated through a simulation study.

Aperiodic speed tracking control of unmanned sightseeing vehicles based on hierarchical reinforcement learning

Wan, Haiying;Tian, Yingqiang;Wang, Chenyang;Luan, Xiaoli;Karimi, Hamid Reza;Liu, Fei

2025-01-01

Abstract

This paper proposes an aperiodic control approach based on hierarchical reinforcement learning for the longitudinal speed tracking control of unmanned sightseeing vehicles in scenic areas. The method develops the MAXQ hierarchical framework to decompose the speed tracking task into a series of subtasks, including learning the control inputs and the triggering instants. This allows each subtask to focus on a specific local issue, independently learning and optimising, making the learning process simpler and more efficient. In the learning of triggering instants, an event-triggered strategy is incorporated, which greatly reduces the controller update rate, thereby reducing communication and resources occupation. In the learning of control inputs, the reward function not only considers tracking performance, but also considers both inter-vehicle distance and acceleration change rate, which ensures tracking accuracy while enhancing driving safety and comfort. Finally, the efficacy of the proposed method is validated through a simulation study.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2025
			
	Titolo della rivista
	
				INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE
			
	Parole chiave
	
				event-triggered strategy; hierarchical reinforcement learning; longitudinal speed tracking control; MAXQ hierarchical framework; Unmanned sightseeing vehicles;
			
	Appare nelle tipologie:
	
				01.1 Articolo in Rivista

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1288223

Citazioni

ND

0

0

social impact