RE.PUBLIC@POLIMI pubblicazioni di ricerca del Politecnico di Milano

Designing a software architecture starting from specifications and requirements is a time-consuming and errorprone process that demands domain expertise. Automating this process has become a significant research focus in software engineering. Traditional approaches rely on rule-based mechanisms to translate manually derived, standardized requirements into the desired architecture. However, these methods struggle to identify implicit patterns without expert intervention. Recently, approaches leveraging Large Language Models (LLMs) have gained attention. This study evaluates the performance of LLMs in generating software architecture blueprints, specifically UML component diagrams, from informal natural-language specifications. We develop a formal characterization of component diagrams to derive quantitative metrics for analyzing LLM-generated diagrams, comparing them against expert-drawn ground truths associated with the specifications. Our findings indicate that while LLM-based approaches show promise in addressing the flaws of rule-based methods, they currently lack the accuracy needed for deployment in real-world scenarios.

Leveraging LLMs to Automate Software Architecture Design from Informal Specifications

Tagliaferro, Alberto;Corbo, Simone;Guindani, Bruno

2025-01-01

Abstract

Designing a software architecture starting from specifications and requirements is a time-consuming and errorprone process that demands domain expertise. Automating this process has become a significant research focus in software engineering. Traditional approaches rely on rule-based mechanisms to translate manually derived, standardized requirements into the desired architecture. However, these methods struggle to identify implicit patterns without expert intervention. Recently, approaches leveraging Large Language Models (LLMs) have gained attention. This study evaluates the performance of LLMs in generating software architecture blueprints, specifically UML component diagrams, from informal natural-language specifications. We develop a formal characterization of component diagrams to derive quantitative metrics for analyzing LLM-generated diagrams, comparing them against expert-drawn ground truths associated with the specifications. Our findings indicate that while LLM-based approaches show promise in addressing the flaws of rule-based methods, they currently lack the accuracy needed for deployment in real-world scenarios.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2025
			
	Titolo del libro
	
				2025 IEEE 22nd International Conference on Software Architecture Companion (ICSA-C)
			
	Appare nelle tipologie:
	
				04.1 Contributo in Atti di convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1292266

Citazioni

ND

2

2

social impact