In the oncogenomics domain, there is an increasing interest and need for computational methodologies that harness multidimensional patient characterizations. These methods aim to decipher the growing availability and heterogeneity of omics and annotation data at our disposal. In this study, we propose a two-phase workflow designed for data integration and knowledge extraction, applied to the hot clinical topic of breast cancer subtyping and employing Non-negative Matrix Tri-Factorization (NMTF). This technique decomposes a non-negative data matrix into three lower-rank matrices for exploration and pattern discovery. Our NMTF-based innovative strategy jointly analysed association matrices of a multi-partite network including different omics data of breast cancer patients and their corresponding subtypes, and found a network reconstruction able to maximize correct patient-subtype predictions. Based on this optimized reconstruction, the workflow second stage builds subtype-specific subnetworks and latent representations of all their nodes to establish associations also between subtypes and involved omics data. Our approach can delve into latent patterns and intricate relationships spanning various input data within a multidimensional framework. Thus, it not only sheds fresh light on BRCA subtypes but also offers adaptability for analogous classification efforts on similar clinical issues.

Inferring Breast Cancer Subtype Associations Using an Original Omics Integration Based on Non-negative Matrix Tri-Factorization

Cascianelli, Silvia;Ceddia, Gaia;Masseroli, Marco
2025-01-01

Abstract

In the oncogenomics domain, there is an increasing interest and need for computational methodologies that harness multidimensional patient characterizations. These methods aim to decipher the growing availability and heterogeneity of omics and annotation data at our disposal. In this study, we propose a two-phase workflow designed for data integration and knowledge extraction, applied to the hot clinical topic of breast cancer subtyping and employing Non-negative Matrix Tri-Factorization (NMTF). This technique decomposes a non-negative data matrix into three lower-rank matrices for exploration and pattern discovery. Our NMTF-based innovative strategy jointly analysed association matrices of a multi-partite network including different omics data of breast cancer patients and their corresponding subtypes, and found a network reconstruction able to maximize correct patient-subtype predictions. Based on this optimized reconstruction, the workflow second stage builds subtype-specific subnetworks and latent representations of all their nodes to establish associations also between subtypes and involved omics data. Our approach can delve into latent patterns and intricate relationships spanning various input data within a multidimensional framework. Thus, it not only sheds fresh light on BRCA subtypes but also offers adaptability for analogous classification efforts on similar clinical issues.
2025
COMPUTATIONAL INTELLIGENCE METHODS FOR BIOINFORMATICS AND BIOSTATISTICS, CIBB 2023
9783031907135
9783031907142
Cancer subtyping
Latent representations
Non-negative Matrix Tri-Factorization
Omics data integration
File in questo prodotto:
File Dimensione Formato  
C38_CIBB_2023_LNBI_2025_255-272.pdf

Accesso riservato

: Publisher’s version
Dimensione 795.77 kB
Formato Adobe PDF
795.77 kB Adobe PDF   Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1310285
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact