We propose a technique for global optimization considering black-box cost function and constraints, which have to be learned from data during the optimization process, arising for example in plant-control co-design of complex systems or controller tuning based on experiments. Assuming Lipschitz continuity of the cost function and constraints, we build a surrogate model and derive tight bounds on such functions based on a Set Membership framework. An exploitation step is designed to improve on the current best feasible candidate solution, searching in regions where all constraints are estimated as fulfilled, thus preserving safety. On the other hand, an exploration routine aims to discover the shape of the cost and constraint functions by picking points with large uncertainty, prioritizing regions where more constraints are predictably satisfied. The proposed algorithm can intuitively trade-off safety, exploration, and exploitation. The performance is evaluated on the problem of model predictive control tuning for a servomechanism with plant uncertainties and task-level constraints.

Trading-off safety, exploration, and exploitation in learning-based optimization: A Set Membership approach

Sabug L.;Ruiz Palacios F.;Fagiano L.
2021-01-01

Abstract

We propose a technique for global optimization considering black-box cost function and constraints, which have to be learned from data during the optimization process, arising for example in plant-control co-design of complex systems or controller tuning based on experiments. Assuming Lipschitz continuity of the cost function and constraints, we build a surrogate model and derive tight bounds on such functions based on a Set Membership framework. An exploitation step is designed to improve on the current best feasible candidate solution, searching in regions where all constraints are estimated as fulfilled, thus preserving safety. On the other hand, an exploration routine aims to discover the shape of the cost and constraint functions by picking points with large uncertainty, prioritizing regions where more constraints are predictably satisfied. The proposed algorithm can intuitively trade-off safety, exploration, and exploitation. The performance is evaluated on the problem of model predictive control tuning for a servomechanism with plant uncertainties and task-level constraints.
2021
Proceedings of the IEEE Conference on Decision and Control
978-1-6654-3659-5
File in questo prodotto:
File Dimensione Formato  
smgo_d_cdc2021.pdf

accesso aperto

: Post-Print (DRAFT o Author’s Accepted Manuscript-AAM)
Dimensione 3.69 MB
Formato Adobe PDF
3.69 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1208002
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? 0
social impact