Accurate forecasting of solar photovoltaic (PV) power is essential for grid stability and efficient energy management. This study investigates the impact of missing data, introduced through both random and block removal at varying percentages, on the performance of hybrid neural network architectures, Convolutional Neural Network-Long Short-Term Memory (CNN-LSTM), LSTM Autoencoder, and standalone LSTM models, for short-term PV power prediction. Using two real-world PV datasets representing semi-arid and desert climates, the focus is placed on non-parametric probabilistic forecasting, leveraging Kernel Density Estimation and quantile regression to generate confidence intervals that capture forecast uncertainty. Experimental results demonstrate that while all models experience performance degradation with increasing levels of missing data, hybrid architectures such as CNN-LSTM and LSTM Autoencoder exhibit greater robustness in capturing temporal patterns and maintaining forecast reliability. Notably, the CNN-LSTM model achieved a 19.35% reduction in quantile loss compared to the standalone LSTM under 30% random data removal. The findings underscore the importance of evaluating both predictive accuracy and uncertainty calibration under data scarcity scenarios to enable dependable solar energy forecasting and resilient energy management.

Quantile-Based Short-Term Probabilistic Forecasting of Solar PV Power Under Data Loss

Dhingra S.;Gruosso G.;Gajani G. S.
2025-01-01

Abstract

Accurate forecasting of solar photovoltaic (PV) power is essential for grid stability and efficient energy management. This study investigates the impact of missing data, introduced through both random and block removal at varying percentages, on the performance of hybrid neural network architectures, Convolutional Neural Network-Long Short-Term Memory (CNN-LSTM), LSTM Autoencoder, and standalone LSTM models, for short-term PV power prediction. Using two real-world PV datasets representing semi-arid and desert climates, the focus is placed on non-parametric probabilistic forecasting, leveraging Kernel Density Estimation and quantile regression to generate confidence intervals that capture forecast uncertainty. Experimental results demonstrate that while all models experience performance degradation with increasing levels of missing data, hybrid architectures such as CNN-LSTM and LSTM Autoencoder exhibit greater robustness in capturing temporal patterns and maintaining forecast reliability. Notably, the CNN-LSTM model achieved a 19.35% reduction in quantile loss compared to the standalone LSTM under 30% random data removal. The findings underscore the importance of evaluating both predictive accuracy and uncertainty calibration under data scarcity scenarios to enable dependable solar energy forecasting and resilient energy management.
2025
IECON Proceedings (Industrial Electronics Conference)
hybrid neural networks
kernel density estimation
missing data analysis
non-parametric probabilistic forecasting
quantile regression
Solar photovoltaic forecasting
File in questo prodotto:
File Dimensione Formato  
2025252303.pdf

Accesso riservato

: Post-Print (DRAFT o Author’s Accepted Manuscript-AAM)
Dimensione 3.37 MB
Formato Adobe PDF
3.37 MB Adobe PDF   Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1302645
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact