Real-world time series often present missing values due to sensor malfunctions or human errors. Traditionally, missing values are simply omitted or reconstructed through imputation or interpolation methods. Omitting missing values may cause temporal discontinuity. Reconstruction methods, on the other hand, alter in some way the original time series. In this paper, we consider an application in the field of meteorological variables that exploits end-to-end machine learning. The idea is to entrust the task of dealing with missing values to a suitably trained recurrent neural network that completely by-passes the phase of reconstruction of missing values. A difficult case of reproduction of a rainfall field from five rain gauges in Northern Italy is used as an example, and the results are compared to those computed by more traditional methods. The proposed methodology is general-purpose and can be easily applied to every kind of spatial time series prediction problem, quite common in many environmental studies.

Reconstructing Environmental Variables with Missing Field Data via End-to-End Machine Learning

M. Sangiorgio;S. Barindelli;V. Guglieri;G. Venuti;G. Guariso
2020-01-01

Abstract

Real-world time series often present missing values due to sensor malfunctions or human errors. Traditionally, missing values are simply omitted or reconstructed through imputation or interpolation methods. Omitting missing values may cause temporal discontinuity. Reconstruction methods, on the other hand, alter in some way the original time series. In this paper, we consider an application in the field of meteorological variables that exploits end-to-end machine learning. The idea is to entrust the task of dealing with missing values to a suitably trained recurrent neural network that completely by-passes the phase of reconstruction of missing values. A difficult case of reproduction of a rainfall field from five rain gauges in Northern Italy is used as an example, and the results are compared to those computed by more traditional methods. The proposed methodology is general-purpose and can be easily applied to every kind of spatial time series prediction problem, quite common in many environmental studies.
2020
Proceedings of the 21st EANN (Engineering Applications of Neural Networks) 2020 Conference
978-3-030-48791-1
Time Series
Rainfall field
Data Augmentation
Artificial Neural Networks
LSTM Cell
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1146282
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact