Lung cancer screening using low-dose computed tomography (LDCT) faces challenges in nodule classification and radiologist burden. This study introduces a novel approach leveraging degraded standard-dose CT (SDCT) images and develops an explainable radiomics-based model to improve lung nodule classification. Using data from LIDC-IDRI public dataset, our pipeline extracts and preprocesses 851 radiomic features per nodule, ensuring stability, discriminant characteristics, and non-redundancy. To address dataset imbalance, we applied data augmentation and random undersampling techniques. Varying the number of features selected by gradient-boosted decision tree across multiple classifiers, we identified an optimal combination of 15 features with random forest classifier, achieving balanced accuracy, sensitivity, specificity and AUC-ROC scores of 0.82 ± 0.04, 0.81 ± 0.08, 0.83 ± 0.05, 0.88 ± 0.05, respectively. Explainability analysis using SHAP highlights features influencing classification, including major axis length, sphericity, and wavelet-LHH-ngtdm-Busyness. In conclusion, our study demonstrates the potential of integrating degraded SDCT and radiomics to improve lung nodule classification.

Low-Dose CT-Based Radiomic Model to Identify Malignant Lung Nodules

Liu J.;Corti A.;Corino V.;Mainardi L.
2024-01-01

Abstract

Lung cancer screening using low-dose computed tomography (LDCT) faces challenges in nodule classification and radiologist burden. This study introduces a novel approach leveraging degraded standard-dose CT (SDCT) images and develops an explainable radiomics-based model to improve lung nodule classification. Using data from LIDC-IDRI public dataset, our pipeline extracts and preprocesses 851 radiomic features per nodule, ensuring stability, discriminant characteristics, and non-redundancy. To address dataset imbalance, we applied data augmentation and random undersampling techniques. Varying the number of features selected by gradient-boosted decision tree across multiple classifiers, we identified an optimal combination of 15 features with random forest classifier, achieving balanced accuracy, sensitivity, specificity and AUC-ROC scores of 0.82 ± 0.04, 0.81 ± 0.08, 0.83 ± 0.05, 0.88 ± 0.05, respectively. Explainability analysis using SHAP highlights features influencing classification, including major axis length, sphericity, and wavelet-LHH-ngtdm-Busyness. In conclusion, our study demonstrates the potential of integrating degraded SDCT and radiomics to improve lung nodule classification.
2024
2024 IEEE International Conference on Metrology for eXtended Reality, Artificial Intelligence and Neural Engineering, MetroXRAINE 2024 - Proceedings
low-dose computed tomography (LDCT)
lung cancer
nodule classification
radiomics
File in questo prodotto:
File Dimensione Formato  
2024Liu_MetroXRain.pdf

embargo fino al 25/12/2025

Descrizione: 2024Liu_PostPrint
: Post-Print (DRAFT o Author’s Accepted Manuscript-AAM)
Dimensione 1.09 MB
Formato Adobe PDF
1.09 MB Adobe PDF   Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1283686
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact