Physics-Based Region Clustering to Boost Inference on Computational Fluid Dynamics Flow Fields

Margheritti, Riccardo; Semeraro, Onofrio; Quadrio, Maurizio; Boracchi, Giacomo

doi:10.1007/978-3-032-06129-4_1

The high dimensionality and variability of Computational Fluid Dynamics (CFD) data pose a significant challenge for Machine Learning (ML) models. The only solutions in the literature addressing inference from CFD flow fields are based on expert-driven features, which consist of fluid dynamic quantities averaged on specific regions of the entire computational domain. However, using handcrafted features can limit the scalability and portability of existing methods, and result in the loss of critical flow field information that might be essential for capturing non-linear patterns inherent in the CFD data. We propose a method to replace handcrafted features with features defined on regions obtained by clustering. Our approach combines: i) physics-based clustering, to identify meaningful regions within the flow field, ii) cluster-based feature extraction, to capture localized fluid dynamics properties, and iii) set-learning models to process the extracted information. Our solution allows integrating physics-based modeling with ML, and provides a portable and flexible pipeline capable of effectively dealing with the variability and dimensionality of CFD flow fields. We validate our method on publicly available CFD datasets (from the aerospace domain) and apply it to a realistic scenario, that is, the classification of pathologies in real 3D human upper airways extracted from CT scans, acquired in collaboration with a medical hospital. Experimental results demonstrate the accuracy and scalability of our method, and highlight its potential for leveraging CFD data in ML frameworks for other scientific and engineering applications.

Physics-Based Region Clustering to Boost Inference on Computational Fluid Dynamics Flow Fields

Margheritti, Riccardo;Semeraro, Onofrio;Quadrio, Maurizio;Boracchi, Giacomo

2026-01-01

Abstract

The high dimensionality and variability of Computational Fluid Dynamics (CFD) data pose a significant challenge for Machine Learning (ML) models. The only solutions in the literature addressing inference from CFD flow fields are based on expert-driven features, which consist of fluid dynamic quantities averaged on specific regions of the entire computational domain. However, using handcrafted features can limit the scalability and portability of existing methods, and result in the loss of critical flow field information that might be essential for capturing non-linear patterns inherent in the CFD data. We propose a method to replace handcrafted features with features defined on regions obtained by clustering. Our approach combines: i) physics-based clustering, to identify meaningful regions within the flow field, ii) cluster-based feature extraction, to capture localized fluid dynamics properties, and iii) set-learning models to process the extracted information. Our solution allows integrating physics-based modeling with ML, and provides a portable and flexible pipeline capable of effectively dealing with the variability and dimensionality of CFD flow fields. We validate our method on publicly available CFD datasets (from the aerospace domain) and apply it to a realistic scenario, that is, the classification of pathologies in real 3D human upper airways extracted from CT scans, acquired in collaboration with a medical hospital. Experimental results demonstrate the accuracy and scalability of our method, and highlight its potential for leveraging CFD data in ML frameworks for other scientific and engineering applications.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2026
			
	Titolo del libro
	
				Machine Learning and Knowledge Discovery in Databases. Applied Data Science Track and Demo Track
			
	Titolo della collana
	
				LECTURE NOTES IN COMPUTER SCIENCE
			
	ISBN (International Standard Book Number)
	
				9783032061287
9783032061294
			
	Parole chiave
	
				Computational Fluid Dynamics
Features Extraction
Machine Learning
Physics-Based Clustering
Set Learning
			
	Appare nelle tipologie:
	
				04.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
MARGR07-25.pdf accesso aperto : Publisher’s version Dimensione 2 MB Formato Adobe PDF Visualizza/Apri	2 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1301495

Citazioni

ND

1

ND

RE.PUBLIC@POLIMI pubblicazioni di ricerca del Politecnico di Milano

Physics-Based Region Clustering to Boost Inference on Computational Fluid Dynamics Flow Fields

Margheritti, Riccardo;Semeraro, Onofrio;Quadrio, Maurizio;Boracchi, Giacomo

2026-01-01

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

Citazioni

social impact

RE.PUBLIC@POLIMI pubblicazioni di ricerca del Politecnico di Milano

Physics-Based Region Clustering to Boost Inference on Computational Fluid Dynamics Flow Fields

Margheritti, Riccardo;Semeraro, Onofrio;Quadrio, Maurizio;Boracchi, Giacomo

2026-01-01

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Informazioni

Citazioni

social impact

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)