: Biobank-scale imaging provides an unprecedented opportunity to characterise how thousands of organ phenotypes vary in populations. However, deriving specific phenotypes from imaging data requires time-consuming expert annotation, limiting scalability. In this study, we develop a 3D diffusion autoencoder to derive latent phenotypes from temporally resolved cardiac MRI data of 71,017 UK Biobank participants. These phenotypes are reproducible, heritable (h2 = [4-18%]), and significantly associated with cardiometabolic traits. To establish the genetic basis of such traits, we perform a genome-wide association study, identifying 89 significant common variants (P < 2.3 × 10-9) across 42 loci, including seven novel loci. Extensive multi-trait colocalisation analyses (PP.H4 > 0.8) link variants across phenotypic scales, from intermediate cardiac traits to cardiac disease endpoints. In conclusion, this study showcases the use of diffusion autoencoding methods as powerful tools for unsupervised phenotyping, genetic discovery and disease risk prediction using cardiac MRI data.

Hundreds of cardiac MRI traits derived using 3D diffusion autoencoders share a common genetic architecture

Soda, Emanuel M.;Ieva, Francesca;
2026-01-01

Abstract

: Biobank-scale imaging provides an unprecedented opportunity to characterise how thousands of organ phenotypes vary in populations. However, deriving specific phenotypes from imaging data requires time-consuming expert annotation, limiting scalability. In this study, we develop a 3D diffusion autoencoder to derive latent phenotypes from temporally resolved cardiac MRI data of 71,017 UK Biobank participants. These phenotypes are reproducible, heritable (h2 = [4-18%]), and significantly associated with cardiometabolic traits. To establish the genetic basis of such traits, we perform a genome-wide association study, identifying 89 significant common variants (P < 2.3 × 10-9) across 42 loci, including seven novel loci. Extensive multi-trait colocalisation analyses (PP.H4 > 0.8) link variants across phenotypic scales, from intermediate cardiac traits to cardiac disease endpoints. In conclusion, this study showcases the use of diffusion autoencoding methods as powerful tools for unsupervised phenotyping, genetic discovery and disease risk prediction using cardiac MRI data.
2026
File in questo prodotto:
File Dimensione Formato  
s41467-026-74575-y_reference.pdf

accesso aperto

: Post-Print (DRAFT o Author’s Accepted Manuscript-AAM)
Dimensione 59.53 MB
Formato Adobe PDF
59.53 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1319195
Citazioni
  • ???jsp.display-item.citation.pmc??? 1
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact