Accurately estimating the Remaining Useful Life (RUL) of lithium-ion batteries using data-driven models requires large amounts of clean, diverse data. However, publicly available datasets are often collected under varying and limited testing conditions, which poses challenges for model training and generalization. To address this, we present a pipeline for cleaning, transforming, and integrating two distinct datasets covering four battery specifications and a wide range of test scenarios. We apply filtering and normalization techniques to address noise, outliers, and inconsistencies, and organize the cleaned data into a relational database. Building on this integrated dataset, we train a Conv-LSTM neural network and evaluate its performance against models trained on individual battery groups. Our results show that multisource training improves model generalization, particularly for smaller or more heterogeneous datasets, and can help enhance prediction accuracy in the later stages of battery life

A Pipeline for Lithium-Ion Battery Data Integration and Remaining Useful Life Estimation

L. Martiri;A. Moschetti;L. Cristaldi
2026-01-01

Abstract

Accurately estimating the Remaining Useful Life (RUL) of lithium-ion batteries using data-driven models requires large amounts of clean, diverse data. However, publicly available datasets are often collected under varying and limited testing conditions, which poses challenges for model training and generalization. To address this, we present a pipeline for cleaning, transforming, and integrating two distinct datasets covering four battery specifications and a wide range of test scenarios. We apply filtering and normalization techniques to address noise, outliers, and inconsistencies, and organize the cleaned data into a relational database. Building on this integrated dataset, we train a Conv-LSTM neural network and evaluate its performance against models trained on individual battery groups. Our results show that multisource training improves model generalization, particularly for smaller or more heterogeneous datasets, and can help enhance prediction accuracy in the later stages of battery life
2026
2025 IEEE International Conference on Metrology for eXtended Reality, Artificial Intelligence and Neural Engineering (MetroXRAINE)
data pipeline
database
data integration
RUL estimation
lithium-ion batteries
File in questo prodotto:
File Dimensione Formato  
A_Pipeline_for_Lithium-Ion_Battery_Data_Integration_and_Remaining_Useful_Life_Estimation.pdf

Accesso riservato

: Publisher’s version
Dimensione 347.07 kB
Formato Adobe PDF
347.07 kB Adobe PDF   Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1304933
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact