Hybrid electric vehicles (HEVs) encompass diverse powertrain configurations and serve varied purposes. Commonly, energy management strategies (EMSs) have been developed separately for individual vehicle types and powertrain configurations under specific operating scenarios, often lacking generalizability across vehicle models and operating scenarios. To fill this gap, we propose a unified deep reinforcement learning (DRL) EMS based on meta-learning and online hard sample mining. This strategy enables adaptation to diverse vehicle types and powertrain configurations with minimal sample training through online fine-tuning. Firstly, meta-reinforcement learning is employed to simultaneously learn EMS for multiple vehicle types across various operating scenarios, establishing a base-learner capable of achieving satisfactory performance with minor adjustments when confronted with new configurations and operating scenarios. Furthermore, to mitigate the slow convergence associated with training multiple vehicle types and operating scenarios concurrently, hard sample mining method is used to optimize the presentation of random operating scenarios during training. This entails recording poorly performing conditions during training and prioritizing the training of simpler conditions before advancing to more challenging ones, thereby enhancing training efficiency through a scientifically informed approach. Additionally, we validate the proposed EMS on a simulated vehicle emulator. Results demonstrate a significant improvement in convergence efficiency, with respective enhancements of 40% in convergence efficiency while achieving comparable final performance metrics.
A unified deep reinforcement learning energy management strategy for multi-powertrain vehicles based on meta learning and hard sample mining
Karimi, Hamid Reza;
2025-01-01
Abstract
Hybrid electric vehicles (HEVs) encompass diverse powertrain configurations and serve varied purposes. Commonly, energy management strategies (EMSs) have been developed separately for individual vehicle types and powertrain configurations under specific operating scenarios, often lacking generalizability across vehicle models and operating scenarios. To fill this gap, we propose a unified deep reinforcement learning (DRL) EMS based on meta-learning and online hard sample mining. This strategy enables adaptation to diverse vehicle types and powertrain configurations with minimal sample training through online fine-tuning. Firstly, meta-reinforcement learning is employed to simultaneously learn EMS for multiple vehicle types across various operating scenarios, establishing a base-learner capable of achieving satisfactory performance with minor adjustments when confronted with new configurations and operating scenarios. Furthermore, to mitigate the slow convergence associated with training multiple vehicle types and operating scenarios concurrently, hard sample mining method is used to optimize the presentation of random operating scenarios during training. This entails recording poorly performing conditions during training and prioritizing the training of simpler conditions before advancing to more challenging ones, thereby enhancing training efficiency through a scientifically informed approach. Additionally, we validate the proposed EMS on a simulated vehicle emulator. Results demonstrate a significant improvement in convergence efficiency, with respective enhancements of 40% in convergence efficiency while achieving comparable final performance metrics.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


