RE.PUBLIC@POLIMI pubblicazioni di ricerca del Politecnico di Milano

Hybrid electric vehicles (HEVs) encompass diverse powertrain configurations and serve varied purposes. Commonly, energy management strategies (EMSs) have been developed separately for individual vehicle types and powertrain configurations under specific operating scenarios, often lacking generalizability across vehicle models and operating scenarios. To fill this gap, we propose a unified deep reinforcement learning (DRL) EMS based on meta-learning and online hard sample mining. This strategy enables adaptation to diverse vehicle types and powertrain configurations with minimal sample training through online fine-tuning. Firstly, meta-reinforcement learning is employed to simultaneously learn EMS for multiple vehicle types across various operating scenarios, establishing a base-learner capable of achieving satisfactory performance with minor adjustments when confronted with new configurations and operating scenarios. Furthermore, to mitigate the slow convergence associated with training multiple vehicle types and operating scenarios concurrently, hard sample mining method is used to optimize the presentation of random operating scenarios during training. This entails recording poorly performing conditions during training and prioritizing the training of simpler conditions before advancing to more challenging ones, thereby enhancing training efficiency through a scientifically informed approach. Additionally, we validate the proposed EMS on a simulated vehicle emulator. Results demonstrate a significant improvement in convergence efficiency, with respective enhancements of 40% in convergence efficiency while achieving comparable final performance metrics.

A unified deep reinforcement learning energy management strategy for multi-powertrain vehicles based on meta learning and hard sample mining

Chen, Xiaokai;Wu, Zhiming;Karimi, Hamid Reza;Li, Qianhui;Li, Zhengyu

2025-01-01

Abstract

Hybrid electric vehicles (HEVs) encompass diverse powertrain configurations and serve varied purposes. Commonly, energy management strategies (EMSs) have been developed separately for individual vehicle types and powertrain configurations under specific operating scenarios, often lacking generalizability across vehicle models and operating scenarios. To fill this gap, we propose a unified deep reinforcement learning (DRL) EMS based on meta-learning and online hard sample mining. This strategy enables adaptation to diverse vehicle types and powertrain configurations with minimal sample training through online fine-tuning. Firstly, meta-reinforcement learning is employed to simultaneously learn EMS for multiple vehicle types across various operating scenarios, establishing a base-learner capable of achieving satisfactory performance with minor adjustments when confronted with new configurations and operating scenarios. Furthermore, to mitigate the slow convergence associated with training multiple vehicle types and operating scenarios concurrently, hard sample mining method is used to optimize the presentation of random operating scenarios during training. This entails recording poorly performing conditions during training and prioritizing the training of simpler conditions before advancing to more challenging ones, thereby enhancing training efficiency through a scientifically informed approach. Additionally, we validate the proposed EMS on a simulated vehicle emulator. Results demonstrate a significant improvement in convergence efficiency, with respective enhancements of 40% in convergence efficiency while achieving comparable final performance metrics.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2025
			
	Titolo della rivista
	
				CONTROL ENGINEERING PRACTICE
			
	Parole chiave
	
				Deep reinforcement learning; Energy management strategy; Hybrid electric vehicle; Meta learning;
			
	Appare nelle tipologie:
	
				01.1 Articolo in Rivista

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1310782

Citazioni

ND

3

3

ND

social impact