Smart meter-driven remote auditing of buildings, as an alternative to the labor-intensive on-site visits, permits large-scale and rapid identification of buildings with low energy performance. The existing literature has mainly focused on electricity meters' data from a rather small set of buildings and efforts have often not been made to facilitate the models' physical interpretability. Accordingly, the present work focuses on the implementation and optimization of ML-based pipelines for building characterization (by use type (A), performance class (B), and operation group (C)) employing hourly electrical and chilled-water consumption data. Utilizing the Building Data Genome Project II dataset (with data from 1636 buildings), feature generation, feature selection, and pipeline optimization steps are performed for each pipeline. Results demonstrate that performing the latter two steps improves the model's accuracy (5.3%, 2.9%, and 3.9% for pipelines A, B, and C compared to a benchmark model), while notably reduces the number of utilized features (94.7%, 88.3%, 89.4%), enhancing the models' interpretability. Furthermore, adding features extracted from chilled-water consumption data boosts the accuracy (with respect to baseline) for the second subset by 12.4%, 13.5%, and 7.2%, while decreasing the feature count by 97.2%, 96.4%, and 96.5%, respectively.

Machine learning-based estimation of buildings' characteristics employing electrical and chilled water consumption data: Pipeline optimization

Najafi B.;Rinaldi F.
2023-01-01

Abstract

Smart meter-driven remote auditing of buildings, as an alternative to the labor-intensive on-site visits, permits large-scale and rapid identification of buildings with low energy performance. The existing literature has mainly focused on electricity meters' data from a rather small set of buildings and efforts have often not been made to facilitate the models' physical interpretability. Accordingly, the present work focuses on the implementation and optimization of ML-based pipelines for building characterization (by use type (A), performance class (B), and operation group (C)) employing hourly electrical and chilled-water consumption data. Utilizing the Building Data Genome Project II dataset (with data from 1636 buildings), feature generation, feature selection, and pipeline optimization steps are performed for each pipeline. Results demonstrate that performing the latter two steps improves the model's accuracy (5.3%, 2.9%, and 3.9% for pipelines A, B, and C compared to a benchmark model), while notably reduces the number of utilized features (94.7%, 88.3%, 89.4%), enhancing the models' interpretability. Furthermore, adding features extracted from chilled-water consumption data boosts the accuracy (with respect to baseline) for the second subset by 12.4%, 13.5%, and 7.2%, while decreasing the feature count by 97.2%, 96.4%, and 96.5%, respectively.
2023
Commercial buildings classification, Feature extraction, Feature selection, Machine learning, Pipeline optimization, Smart meter.
File in questo prodotto:
File Dimensione Formato  
2023 4 Machine learning-based estimation of buildings’ characteristics employing electrical and chilled water consumption data Pipeline optimization.pdf

accesso aperto

: Publisher’s version
Dimensione 6.11 MB
Formato Adobe PDF
6.11 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1245380
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? 1
social impact