RE.PUBLIC@POLIMI pubblicazioni di ricerca del Politecnico di Milano

Serverless computing breaks down applications into workflows of stateless functions, where the output of each function serves as the input for the next. When these workflows are distributed across multiple processing nodes, managing the interactions between nodes is crucial to maintaining overall system performance. However, existing analytical performance models do not cope well with the dependencies involved in distributed workflows when the offloaded jobs arrive in batches. In this paper, we study the problem for two processing resources in tandem and develop a scalable surrogate modeling approach based on neural networks that can be used for serverless resource management purposes. We validate our performance model with both synthetic and real-world AI application traces, demonstrating that our surrogate model achieves a mean average percentage error of about 5%.

Deep Surrogate Models of Serverless Batch Processing Services

Gao, Yicheng;Sala, Roberto;Ardagna, Danilo;Casale, Giuliano

2025-01-01

Abstract

Serverless computing breaks down applications into workflows of stateless functions, where the output of each function serves as the input for the next. When these workflows are distributed across multiple processing nodes, managing the interactions between nodes is crucial to maintaining overall system performance. However, existing analytical performance models do not cope well with the dependencies involved in distributed workflows when the offloaded jobs arrive in batches. In this paper, we study the problem for two processing resources in tandem and develop a scalable surrogate modeling approach based on neural networks that can be used for serverless resource management purposes. We validate our performance model with both synthetic and real-world AI application traces, demonstrating that our surrogate model achieves a mean average percentage error of about 5%.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2025
			
	Titolo del libro
	
				Service-Oriented and Cloud Computing. ESOCC 2025
			
	Titolo della collana
	
				LECTURE NOTES IN COMPUTER SCIENCE
			
	ISBN (International Standard Book Number)
	
				9783031846168
9783031846175
			
	Parole chiave
	
				Batch processing
Function-as-a-service
Performance measures
Stochastic model
			
	Appare nelle tipologie:
	
				04.1 Contributo in Atti di convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1286889

Citazioni

ND

0

0

social impact