RE.PUBLIC@POLIMI pubblicazioni di ricerca del Politecnico di Milano

Earth Observation increasingly uses machine learning to evaluate and monitor the environment. However, the potential of deep learning for studying wilderness is an under-explored frontier. This study aims to give insights into using different architectures (ResNet18, ResNet50, U-Net, DeepLabV3, and FCN), batch sizes (small, medium, and large), and spectral setups (RGB, RGB+NIR, full spectrum) for the classification and semantic segmentation of Sentinel-2 images. The focus is on optimising performance over accuracy using limited computational resources and pre-trained networks widely from the AI community. Experiments are performed on the AnthroProtect dataset, which was developed explicitly for this purpose. Results show that when computation resources are a concern, ResNet18 with 64 or 256 batch size is an optimal configuration for image classification. The U-Net is a sub-optimal solution for semantic segmentation, but our experiments did not identify a clear optimality for the batch size. Finally, different spectral setups highlight no significant impact on the data processing, thus raising critical thinking on the usefulness of neural networks in Earth Observation that are pre-trained with generic data like ImageNet, which is widely used in the AI community.

The Potential of Deep Learning for Studying Wilderness with Copernicus Sentinel-2 Data: Some Critical Insights

Vallarino, Gaia;Genzano, Nicola;Gianinetto, Marco

2025-01-01

Abstract

Earth Observation increasingly uses machine learning to evaluate and monitor the environment. However, the potential of deep learning for studying wilderness is an under-explored frontier. This study aims to give insights into using different architectures (ResNet18, ResNet50, U-Net, DeepLabV3, and FCN), batch sizes (small, medium, and large), and spectral setups (RGB, RGB+NIR, full spectrum) for the classification and semantic segmentation of Sentinel-2 images. The focus is on optimising performance over accuracy using limited computational resources and pre-trained networks widely from the AI community. Experiments are performed on the AnthroProtect dataset, which was developed explicitly for this purpose. Results show that when computation resources are a concern, ResNet18 with 64 or 256 batch size is an optimal configuration for image classification. The U-Net is a sub-optimal solution for semantic segmentation, but our experiments did not identify a clear optimality for the batch size. Finally, different spectral setups highlight no significant impact on the data processing, thus raising critical thinking on the usefulness of neural networks in Earth Observation that are pre-trained with generic data like ImageNet, which is widely used in the AI community.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2025
			
	Titolo della rivista
	
				LAND
			
	Parole chiave
	
				satellite images, artificial intelligence, optimisation, image classification, semantic segmentation
			
	Appare nelle tipologie:
	
				01.1 Articolo in Rivista

File in questo prodotto:

File	Dimensione	Formato
land-14-02333.pdf accesso aperto Descrizione: Vallarino_et_al_2025 : Publisher’s version Dimensione 3.91 MB Formato Adobe PDF Visualizza/Apri	3.91 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1301586

Citazioni

ND

ND

ND

social impact