Smart meter-driven remote auditing of buildings, as an alternative to the labor-intensive on-site visits, permits large-scale and rapid identification of buildings with low energy performance. The existing literature has mainly focused on electricity meters' data from a rather small set of buildings and efforts have often not been made to facilitate the models' physical interpretability. Accordingly, the present work focuses on the implementation and optimization of ML-based pipelines for building characterization (by use type (A), performance class (B), and operation group (C)) employing hourly electrical and chilled-water consumption data. Utilizing the Building Data Genome Project II dataset (with data from 1636 buildings), feature generation, feature selection, and pipeline optimization steps are performed for each pipeline. Results demonstrate that performing the latter two steps improves the model's accuracy (5.3%, 2.9%, and 3.9% for pipelines A, B, and C compared to a benchmark model), while notably reduces the number of utilized features (94.7%, 88.3%, 89.4%), enhancing the models' interpretability. Furthermore, adding features extracted from chilled-water consumption data boosts the accuracy (with respect to baseline) for the second subset by 12.4%, 13.5%, and 7.2%, while decreasing the feature count by 97.2%, 96.4%, and 96.5%, respectively.
Machine learning-based estimation of buildings' characteristics employing electrical and chilled water consumption data: Pipeline optimization
Najafi B.;Rinaldi F.
2023-01-01
Abstract
Smart meter-driven remote auditing of buildings, as an alternative to the labor-intensive on-site visits, permits large-scale and rapid identification of buildings with low energy performance. The existing literature has mainly focused on electricity meters' data from a rather small set of buildings and efforts have often not been made to facilitate the models' physical interpretability. Accordingly, the present work focuses on the implementation and optimization of ML-based pipelines for building characterization (by use type (A), performance class (B), and operation group (C)) employing hourly electrical and chilled-water consumption data. Utilizing the Building Data Genome Project II dataset (with data from 1636 buildings), feature generation, feature selection, and pipeline optimization steps are performed for each pipeline. Results demonstrate that performing the latter two steps improves the model's accuracy (5.3%, 2.9%, and 3.9% for pipelines A, B, and C compared to a benchmark model), while notably reduces the number of utilized features (94.7%, 88.3%, 89.4%), enhancing the models' interpretability. Furthermore, adding features extracted from chilled-water consumption data boosts the accuracy (with respect to baseline) for the second subset by 12.4%, 13.5%, and 7.2%, while decreasing the feature count by 97.2%, 96.4%, and 96.5%, respectively.File | Dimensione | Formato | |
---|---|---|---|
2023 4 Machine learning-based estimation of buildings’ characteristics employing electrical and chilled water consumption data Pipeline optimization.pdf
accesso aperto
:
Publisher’s version
Dimensione
6.11 MB
Formato
Adobe PDF
|
6.11 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.