RE.PUBLIC@POLIMI pubblicazioni di ricerca del Politecnico di Milano

Effective Cyber Threat Intelligence (CTI) is crucial for proactive defense; however, the high costs and technical complexity of commercial solutions make them inaccessible to the majority of Small and Medium Enterprises (SMEs). Furthermore, public CTI is predominantly available as unstructured text, making manual processing unbearable for resource-constrained organizations. To address this gap, we present a unified, low-cost pipeline that ingests, processes, and presents actionable CTI to Small and Medium Enterprises (SMEs) through a Retrieval-Augmented Generation (RAG) conversational interface. The system automatically crawls public CTI sources and integrates them with up-to-date Common Vulnerability Exposure (CVE) repositories to ensure coverage of structured and unstructured threat data. To enhance scalability compared to existing solutions, we transform raw reports into a knowledge graph that facilitates efficient querying and contextual retrieval. The system finally combines graph-derived intelligence with organization-specific software configurations to generate tailored, accessible threat guidance from a Large Language Model (LLM). We evaluate the pipeline through large-scale ingestion of 49,747 public threat documents and the latest CVE records, alongside a survey of security experts and an unsupervised assessment. Results show a substantial improvement in response completeness and relevance compared to a baseline language model, achieving a median answer relevance score of 93.7\%, while remaining interpretable and actionable for non-specialists. This work demonstrates that structured CTI extraction combined with lightweight generative interfaces can enable cost-effective cybersecurity awareness for SMEs.

Enhancing CTI Awareness of Small and Medium Enterprises with LLMs and Knowledge Graphs

Francesco Panebianco;Nicolò Amigoni;Gabriele Ruini;Tommaso Paladini;Stefano Longari;Stefano Zanero;Michele Carminati

2026-01-01

Abstract

Effective Cyber Threat Intelligence (CTI) is crucial for proactive defense; however, the high costs and technical complexity of commercial solutions make them inaccessible to the majority of Small and Medium Enterprises (SMEs). Furthermore, public CTI is predominantly available as unstructured text, making manual processing unbearable for resource-constrained organizations. To address this gap, we present a unified, low-cost pipeline that ingests, processes, and presents actionable CTI to Small and Medium Enterprises (SMEs) through a Retrieval-Augmented Generation (RAG) conversational interface. The system automatically crawls public CTI sources and integrates them with up-to-date Common Vulnerability Exposure (CVE) repositories to ensure coverage of structured and unstructured threat data. To enhance scalability compared to existing solutions, we transform raw reports into a knowledge graph that facilitates efficient querying and contextual retrieval. The system finally combines graph-derived intelligence with organization-specific software configurations to generate tailored, accessible threat guidance from a Large Language Model (LLM). We evaluate the pipeline through large-scale ingestion of 49,747 public threat documents and the latest CVE records, alongside a survey of security experts and an unsupervised assessment. Results show a substantial improvement in response completeness and relevance compared to a baseline language model, achieving a median answer relevance score of 93.7\%, while remaining interpretable and actionable for non-specialists. This work demonstrates that structured CTI extraction combined with lightweight generative interfaces can enable cost-effective cybersecurity awareness for SMEs.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2026
			
	Titolo del libro
	
				Proceedings of the Joint National Conference on Cybersecurity (ITASEC & SERICS 2026)
			
	Parole chiave
	
				Cyber Threat Intelligence,Small and Medium Enterprises,Large Language Models,Knowledge Graphs,Natural Language,Artificial Intelligence
			
	Appare nelle tipologie:
	
				04.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
_ITASEC__Google_org_CTI_Tool.pdf accesso aperto : Post-Print (DRAFT o Author’s Accepted Manuscript-AAM) Dimensione 571.65 kB Formato Adobe PDF Visualizza/Apri	571.65 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1307365

Citazioni

ND

ND

ND

ND

social impact