RE.PUBLIC@POLIMI pubblicazioni di ricerca del Politecnico di Milano

In formation control for unmanned aerial vehicles (UAVs), a fleet of drones is arranged in a predefined geometric configuration that must be maintained throughout the flight, while avoiding collisions with other drones and obstacles. In real-world applications, the need for quick deployment of UAV fleets often makes controller parameter tuning a significant challenge. In this paper, we introduce an end-to-end formation controller based on Nonlinear Model Predictive Control (NMPC), enhanced by a reinforcement learning algorithm for optimal hyperparameter tuning. Specifically, we adapt the Policy Gradient with Parameter-based Exploration (PGPE) algorithm to the formation control context. This method offers a fast and scalable solution for parameter tuning that does not require a differentiable controller and can be customized to the specific needs of the deployer. To validate our approach, we conduct simulation experiments using a realistic quadrotor model in a three-dimensional environment with static obstacles. Our results demonstrate the effectiveness and advantages of our method in comparison to state-of-the-art algorithms.

Precision UAV formation control via PGPE-enhanced NMPC

Olivieri, Pierriccardo;Sanchini, Andrea;Spica, Riccardo;Gatti, Nicola;Formentin, Simone

2025-01-01

Abstract

In formation control for unmanned aerial vehicles (UAVs), a fleet of drones is arranged in a predefined geometric configuration that must be maintained throughout the flight, while avoiding collisions with other drones and obstacles. In real-world applications, the need for quick deployment of UAV fleets often makes controller parameter tuning a significant challenge. In this paper, we introduce an end-to-end formation controller based on Nonlinear Model Predictive Control (NMPC), enhanced by a reinforcement learning algorithm for optimal hyperparameter tuning. Specifically, we adapt the Policy Gradient with Parameter-based Exploration (PGPE) algorithm to the formation control context. This method offers a fast and scalable solution for parameter tuning that does not require a differentiable controller and can be customized to the specific needs of the deployer. To validate our approach, we conduct simulation experiments using a realistic quadrotor model in a three-dimensional environment with static obstacles. Our results demonstrate the effectiveness and advantages of our method in comparison to state-of-the-art algorithms.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2025
			
	Titolo del libro
	
				Proceedings of the IEEE Conference on Decision and Control
			
	Titolo della collana
	
				PROCEEDINGS OF THE ... IEEE CONFERENCE ON DECISION & CONTROL.
			
	Parole chiave
	
				Formation Control
Model Predictive Control
Policy Gradient
Reinforcement Learning
			
	Appare nelle tipologie:
	
				04.1 Contributo in Atti di convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1310478

Citazioni

ND

0

ND

social impact