Advancing industrial building energy measurement and verification (M&V) with deep learning: Evaluating data size and feature selection impact
This study explores the potential of advancing industrial building energy Measurement and Verification (M&V) using Deep Learning techniques, with a focus on the impact of data size and feature selection methods on model performance. Traditional M&V practices in the industry require extensive...
Saved in:
id |
my.utem.eprints.27838 |
---|---|
record_format |
eprints |
institution |
Universiti Teknikal Malaysia Melaka |
building |
UTEM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Teknikal Malaysia Melaka |
content_source |
UTEM Institutional Repository |
url_provider |
http://eprints.utem.edu.my/ |
language |
English |
description |
This study explores the potential of advancing industrial building energy Measurement and Verification (M&V) using Deep Learning techniques, with a focus on the impact of data size and feature selection methods on model performance. Traditional M&V practices in the industry require extensive datasets, typically spanning 24 months, to ensure accurate energy savings verification post energy conservation measures (ECMs). This research challenges the norm by investigating whether a condensed four-month dataset could suffice for reliable M&V, thereby accelerating the verification process which is crucial for industrial applications. Utilizing a dataset with 30-minute intervals, the study first applies multi-linear regression across twelve feature selection methods to
identify the most effective techniques based on a comprehensive set of performance metrics including Training
Time, RMSE, MAE, R-squared, CVRMSE, NMBE, Mean MSE (via k-fold cross-validation), and the Standard Deviation of MSE (over 10-fold cross-validation). The top three methods—LASSO regression, Sequential Feature Selector, and Recursive Feature Elimination—were further analyzed as input features for a Deep Neural Network (DNN) model. The DNN’s performance was evaluated across varying data sizes (20 %, 40 %, 50 %, 60 %, 80 %, and 100 %) and configurations ranging from one to ten hidden layers. The findings reveal that LASSO regression, when integrated with DNN, consistently outperforms in terms of CVRMSE and NMBE metrics across different data sizes and model complexities. Notably, increasing the dataset size from 20 % to 80 % markedly improves the model’s predictive accuracy, underscoring the significance of larger datasets in enhancing DNN generalization and performance. Additionally, the study highlights a critical trade-off in DNN architecture: while models with fewer hidden layers (1–3) show greater performance variability in smaller datasets, increasing the number of layers does not linearly translate to better performance, illustrating the nuanced balance between model complexity and efficacy. This research contributes to the energy M&V field by demonstrating that shorter duration datasets, previously considered insufficient, can indeed provide accurate energy savings verification when analyzed with advanced deep learning techniques. This advancement not only paves the way for quicker, more efficient M&V processes in industrial settings but also offers valuable insights into the optimization of DNN models for energy data analysis. |
format |
Article |
author |
Sukarti, Suziee Sulaima, Mohamad Fani Abdul Kadir, Aida Fazliana Shamsor, Muhamad Hafizul Siaw, Wei Yao |
spellingShingle |
Sukarti, Suziee Sulaima, Mohamad Fani Abdul Kadir, Aida Fazliana Shamsor, Muhamad Hafizul Siaw, Wei Yao Advancing industrial building energy measurement and verification (M&V) with deep learning: Evaluating data size and feature selection impact |
author_facet |
Sukarti, Suziee Sulaima, Mohamad Fani Abdul Kadir, Aida Fazliana Shamsor, Muhamad Hafizul Siaw, Wei Yao |
author_sort |
Sukarti, Suziee |
title |
Advancing industrial building energy measurement and verification (M&V) with deep learning: Evaluating data size and feature selection impact |
title_short |
Advancing industrial building energy measurement and verification (M&V) with deep learning: Evaluating data size and feature selection impact |
title_full |
Advancing industrial building energy measurement and verification (M&V) with deep learning: Evaluating data size and feature selection impact |
title_fullStr |
Advancing industrial building energy measurement and verification (M&V) with deep learning: Evaluating data size and feature selection impact |
title_full_unstemmed |
Advancing industrial building energy measurement and verification (M&V) with deep learning: Evaluating data size and feature selection impact |
title_sort |
advancing industrial building energy measurement and verification (m&v) with deep learning: evaluating data size and feature selection impact |
publisher |
Elsevier B.V. |
publishDate |
2024 |
url |
http://eprints.utem.edu.my/id/eprint/27838/2/019010708202416140971.pdf http://eprints.utem.edu.my/id/eprint/27838/ https://pdf.sciencedirectassets.com/271089/1-s2.0-S0378778824X00146/1-s2.0-S0378778824005735/main.pdf?X-Amz-Security-Token=IQoJb3JpZ2luX2VjECwaCXVzLWVhc3QtMSJHMEUCIQDcG%2Bb9nYlsdHlRIcRlqKFK%2FoQJdEigo3Bd6GFdujlzQgIgYjMnpIeMft8ac2zdmyi4nZfFCVtBSJJApq9cn9592rQqsgUIVBAFGgwwNTkwMDM1NDY4NjUiDGt%2F%2BEkXN5UO1IiUxiqPBbkCPc1Ep0fZ5nUrqpW5M4s8m1fbSn0qfKu6Pyz3BVg3j%2BgaNoq5fG5mR7pnaRVy86UbgnfHUWezmTwq9nqpUygnzreW%2BvepBil8wEu%2FYSZouZ5jW7YAeP7P0lTrCSusWQnr3myaL0%2BrKI3goAk7II9%2Flw5kXgFcpaOH1TNcNeeLf4EviBuycPv9Gc9zEG0rshph11CrBvjfx6qfKpF7soP2Ms4A0PJWHNTZTNvBfKq8HFyb9CElg8V2J565l0zvnNvSpYqCRGpnlKenPpOT0pEarsVlvMWIwVea9MRoTTLxOqdNE%2BlBG2G7HssSbIxBjDxeeh5txOF%2BSNgEWKVqiFHn%2BZP6F0Zy7nYRIl4F92L185xo6pBnBZUOPJB2yLxlIdDgyzA5Cip58vpX2DjDxC5%2FFD9%2FSio7KPmkzAkeTFFzE%2Bic3gUeUkcCTRsnmyG6l95AEaWj3NCHhGlKtxqNAniznDD5VUsLZUaAq01N%2FqbbI2zvicdYN4ptE8NwZN8JXcIBWDqkz9bfFVrMeTbjYDaiNvE1efV%2BpmE1pUTj%2B4KJd1iAVVTfqwySgu4qJ0iEp3qS975D1udtTyHpE4xIIwJcT%2FaEhsfiUKQFAmOugkWK5GozlAhsLuBkJ9QDdehgHaltD2wSylWHUmoA96O14xekfscIL4hBNqnYEkfsyI3xCik3EjfPGOcufSJicWtBfmazrF0bUKFErJ%2BQdJla1uFzpbKTNuDuLqyXFS6gO8q58ZBqSSksCGPbcdjCuDmp72%2BQ%2F0P7d%2B%2BC9FNAJnHN%2FgiF3B8tO7PnKwMGk0%2Bcs6h8n12h7rOUE7I6WjBT4E%2B4IdC7fe6zl7ql%2F64Dc%2FjQTo3WOpNbF8ZW6sPtxvFdhCMwjMz5tgY6sQGfjuqHk0lo4HyZGOcbNhCKcS2oqTgU0LTgqI4zHpKBghsWhsTEDKqjQoPnVAfX%2BU%2FArNz%2FGfXDzFvqLaTOY8YUFV9ICdCT%2BCyQRXJiPlSu6p%2B8foWvkAyVcOncPBd3%2FD%2BNOmGhrWSEr5%2BwRczoj7BeQvd90WIyEFEuFIVNwP4GuM7VbNsq39fMCq9bghrsOrKkLD796UmvULK%2FfxKTjQ8sEcgJRvgJvaQGQCO1qJD2iOQ%3D&X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Date=20240909T034228Z&X-Amz-SignedHeaders=host&X-Amz-Expires=300&X-Amz-Credential=ASIAQ3PHCVTY63ICLMAN%2F20240909%2Fus-east-1%2Fs3%2Faws4_request&X-Amz-Signature=9e4b44b97844e9a32126b4920af16ebdf84d44426864ae9347d1a90e2b4049e4&hash=d84d07bea581989162b04b9a97a66964380ff51edf0002271d2eb6d2ead5d2cc&host=68042c943591013ac2b2430a89b270f6af2c76d8dfd086a07176afe7c76c2c61&pii=S0378778824005735&tid=spdf-bb010c5d-fe50-4a92-b875-bbbd668d0018&sid=9f1caed672cc6540619a5ce7501616f7ba80gxrqb&type=client&tsoh=d3d3LnNjaWVuY2VkaXJlY3QuY29t&ua=031d5a03575053075d&rr=8c042b210c639cb7&cc=my https://doi.org/10.1016/j.enbuild.2024.114457 |
_version_ |
1814061433575440384 |
spelling |
my.utem.eprints.278382024-10-09T16:30:28Z http://eprints.utem.edu.my/id/eprint/27838/ Advancing industrial building energy measurement and verification (M&V) with deep learning: Evaluating data size and feature selection impact Sukarti, Suziee Sulaima, Mohamad Fani Abdul Kadir, Aida Fazliana Shamsor, Muhamad Hafizul Siaw, Wei Yao This study explores the potential of advancing industrial building energy Measurement and Verification (M&V) using Deep Learning techniques, with a focus on the impact of data size and feature selection methods on model performance. Traditional M&V practices in the industry require extensive datasets, typically spanning 24 months, to ensure accurate energy savings verification post energy conservation measures (ECMs). This research challenges the norm by investigating whether a condensed four-month dataset could suffice for reliable M&V, thereby accelerating the verification process which is crucial for industrial applications. Utilizing a dataset with 30-minute intervals, the study first applies multi-linear regression across twelve feature selection methods to identify the most effective techniques based on a comprehensive set of performance metrics including Training Time, RMSE, MAE, R-squared, CVRMSE, NMBE, Mean MSE (via k-fold cross-validation), and the Standard Deviation of MSE (over 10-fold cross-validation). The top three methods—LASSO regression, Sequential Feature Selector, and Recursive Feature Elimination—were further analyzed as input features for a Deep Neural Network (DNN) model. The DNN’s performance was evaluated across varying data sizes (20 %, 40 %, 50 %, 60 %, 80 %, and 100 %) and configurations ranging from one to ten hidden layers. The findings reveal that LASSO regression, when integrated with DNN, consistently outperforms in terms of CVRMSE and NMBE metrics across different data sizes and model complexities. Notably, increasing the dataset size from 20 % to 80 % markedly improves the model’s predictive accuracy, underscoring the significance of larger datasets in enhancing DNN generalization and performance. Additionally, the study highlights a critical trade-off in DNN architecture: while models with fewer hidden layers (1–3) show greater performance variability in smaller datasets, increasing the number of layers does not linearly translate to better performance, illustrating the nuanced balance between model complexity and efficacy. This research contributes to the energy M&V field by demonstrating that shorter duration datasets, previously considered insufficient, can indeed provide accurate energy savings verification when analyzed with advanced deep learning techniques. This advancement not only paves the way for quicker, more efficient M&V processes in industrial settings but also offers valuable insights into the optimization of DNN models for energy data analysis. Elsevier B.V. 2024 Article PeerReviewed text en http://eprints.utem.edu.my/id/eprint/27838/2/019010708202416140971.pdf Sukarti, Suziee and Sulaima, Mohamad Fani and Abdul Kadir, Aida Fazliana and Shamsor, Muhamad Hafizul and Siaw, Wei Yao (2024) Advancing industrial building energy measurement and verification (M&V) with deep learning: Evaluating data size and feature selection impact. Energy and Buildings, 319. pp. 1-19. ISSN 1872-6178 https://pdf.sciencedirectassets.com/271089/1-s2.0-S0378778824X00146/1-s2.0-S0378778824005735/main.pdf?X-Amz-Security-Token=IQoJb3JpZ2luX2VjECwaCXVzLWVhc3QtMSJHMEUCIQDcG%2Bb9nYlsdHlRIcRlqKFK%2FoQJdEigo3Bd6GFdujlzQgIgYjMnpIeMft8ac2zdmyi4nZfFCVtBSJJApq9cn9592rQqsgUIVBAFGgwwNTkwMDM1NDY4NjUiDGt%2F%2BEkXN5UO1IiUxiqPBbkCPc1Ep0fZ5nUrqpW5M4s8m1fbSn0qfKu6Pyz3BVg3j%2BgaNoq5fG5mR7pnaRVy86UbgnfHUWezmTwq9nqpUygnzreW%2BvepBil8wEu%2FYSZouZ5jW7YAeP7P0lTrCSusWQnr3myaL0%2BrKI3goAk7II9%2Flw5kXgFcpaOH1TNcNeeLf4EviBuycPv9Gc9zEG0rshph11CrBvjfx6qfKpF7soP2Ms4A0PJWHNTZTNvBfKq8HFyb9CElg8V2J565l0zvnNvSpYqCRGpnlKenPpOT0pEarsVlvMWIwVea9MRoTTLxOqdNE%2BlBG2G7HssSbIxBjDxeeh5txOF%2BSNgEWKVqiFHn%2BZP6F0Zy7nYRIl4F92L185xo6pBnBZUOPJB2yLxlIdDgyzA5Cip58vpX2DjDxC5%2FFD9%2FSio7KPmkzAkeTFFzE%2Bic3gUeUkcCTRsnmyG6l95AEaWj3NCHhGlKtxqNAniznDD5VUsLZUaAq01N%2FqbbI2zvicdYN4ptE8NwZN8JXcIBWDqkz9bfFVrMeTbjYDaiNvE1efV%2BpmE1pUTj%2B4KJd1iAVVTfqwySgu4qJ0iEp3qS975D1udtTyHpE4xIIwJcT%2FaEhsfiUKQFAmOugkWK5GozlAhsLuBkJ9QDdehgHaltD2wSylWHUmoA96O14xekfscIL4hBNqnYEkfsyI3xCik3EjfPGOcufSJicWtBfmazrF0bUKFErJ%2BQdJla1uFzpbKTNuDuLqyXFS6gO8q58ZBqSSksCGPbcdjCuDmp72%2BQ%2F0P7d%2B%2BC9FNAJnHN%2FgiF3B8tO7PnKwMGk0%2Bcs6h8n12h7rOUE7I6WjBT4E%2B4IdC7fe6zl7ql%2F64Dc%2FjQTo3WOpNbF8ZW6sPtxvFdhCMwjMz5tgY6sQGfjuqHk0lo4HyZGOcbNhCKcS2oqTgU0LTgqI4zHpKBghsWhsTEDKqjQoPnVAfX%2BU%2FArNz%2FGfXDzFvqLaTOY8YUFV9ICdCT%2BCyQRXJiPlSu6p%2B8foWvkAyVcOncPBd3%2FD%2BNOmGhrWSEr5%2BwRczoj7BeQvd90WIyEFEuFIVNwP4GuM7VbNsq39fMCq9bghrsOrKkLD796UmvULK%2FfxKTjQ8sEcgJRvgJvaQGQCO1qJD2iOQ%3D&X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Date=20240909T034228Z&X-Amz-SignedHeaders=host&X-Amz-Expires=300&X-Amz-Credential=ASIAQ3PHCVTY63ICLMAN%2F20240909%2Fus-east-1%2Fs3%2Faws4_request&X-Amz-Signature=9e4b44b97844e9a32126b4920af16ebdf84d44426864ae9347d1a90e2b4049e4&hash=d84d07bea581989162b04b9a97a66964380ff51edf0002271d2eb6d2ead5d2cc&host=68042c943591013ac2b2430a89b270f6af2c76d8dfd086a07176afe7c76c2c61&pii=S0378778824005735&tid=spdf-bb010c5d-fe50-4a92-b875-bbbd668d0018&sid=9f1caed672cc6540619a5ce7501616f7ba80gxrqb&type=client&tsoh=d3d3LnNjaWVuY2VkaXJlY3QuY29t&ua=031d5a03575053075d&rr=8c042b210c639cb7&cc=my https://doi.org/10.1016/j.enbuild.2024.114457 |
score |
13.211869 |