Spatio-temporal areal data can be seen as a collection of time series which are spatially correlated according to a specific neighboring structure. Incorporating the temporal and spatial dimension into a statistical model poses challenges regarding the underlying theoretical framework as well as the implementation of efficient computational methods. We propose to include spatio-temporal random effects using a conditional autoregressive prior, where the temporal correlation is modeled through an autoregressive mean decomposition and the spatial correlation by the precision matrix inheriting the neighboring structure. Their joint distribution constitutes a Gaussian Markov random field, whose sparse precision matrix enables the usage of efficient sampling algorithms. We cluster the areal units using a nonparametric prior, thereby learning latent partitions of the areal units. The performance of the model is assessed via an application to study regional unemployment patterns in Italy. When compared to other spatial and spatio-temporal competitors, the proposed model shows more precise estimates and the additional information obtained from the clustering allows for an extended economic interpretation of the unemployment rates of the Italian provinces.
Bayesian modeling and clustering for spatio-temporal areal data: an application to Italian unemployment
A. Cadonna;A. Guglielmi;
2022-01-01
Abstract
Spatio-temporal areal data can be seen as a collection of time series which are spatially correlated according to a specific neighboring structure. Incorporating the temporal and spatial dimension into a statistical model poses challenges regarding the underlying theoretical framework as well as the implementation of efficient computational methods. We propose to include spatio-temporal random effects using a conditional autoregressive prior, where the temporal correlation is modeled through an autoregressive mean decomposition and the spatial correlation by the precision matrix inheriting the neighboring structure. Their joint distribution constitutes a Gaussian Markov random field, whose sparse precision matrix enables the usage of efficient sampling algorithms. We cluster the areal units using a nonparametric prior, thereby learning latent partitions of the areal units. The performance of the model is assessed via an application to study regional unemployment patterns in Italy. When compared to other spatial and spatio-temporal competitors, the proposed model shows more precise estimates and the additional information obtained from the clustering allows for an extended economic interpretation of the unemployment rates of the Italian provinces.File | Dimensione | Formato | |
---|---|---|---|
2206.10509.pdf
accesso aperto
Descrizione: Arxiv paper
:
Pre-Print (o Pre-Refereeing)
Dimensione
4.85 MB
Formato
Adobe PDF
|
4.85 MB | Adobe PDF | Visualizza/Apri |
11311-1228084_Cadonna.pdf
accesso aperto
:
Publisher’s version
Dimensione
5.03 MB
Formato
Adobe PDF
|
5.03 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.