Cost estimation in building industy largely relies on manually extracting and classifying textual descriptions, a process susceptible to human error. Although recent advancements in Large Language Models (LLMs) hold promise, their application in this domain requires further investigation. This study proposes a methodology to optimize LLM performance validated through the development of a tool that classifies cost descriptions into a three-level hierarchical taxonomy and extracts relevant information organising the data in a database as output. Results demonstrate a F1 score of 0.96 on both tasks contributing to cost estimation automation, reducing manual processing, and enhancing knowledge management within the domain.
Methodology for classifying and extracting information with LLM: application on cost estimation case
C. Gatto;C. Mirarchi;A. Pavan
In corso di stampa
Abstract
Cost estimation in building industy largely relies on manually extracting and classifying textual descriptions, a process susceptible to human error. Although recent advancements in Large Language Models (LLMs) hold promise, their application in this domain requires further investigation. This study proposes a methodology to optimize LLM performance validated through the development of a tool that classifies cost descriptions into a three-level hierarchical taxonomy and extracts relevant information organising the data in a database as output. Results demonstrate a F1 score of 0.96 on both tasks contributing to cost estimation automation, reducing manual processing, and enhancing knowledge management within the domain.| File | Dimensione | Formato | |
|---|---|---|---|
|
Methodology_for_classifying_and_extracting_information_with_LLM__application_on_cost_estimation_case_.pdf
accesso aperto
:
Post-Print (DRAFT o Author’s Accepted Manuscript-AAM)
Dimensione
1.43 MB
Formato
Adobe PDF
|
1.43 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


