Ozone concentration forecasting utilizing leveraging of regression machine learnings: A case study at Klang Valley, Malaysia
At Klang Valley, ground-level ozone is a significant source of air pollution. Ozone (O3) concentration is affected by meteorological conditions and air pollutants. Linear Regression Models (LRM), Regression Trees (RT), Support Vector Machines (SVM), Ensembles of Trees (ET), Gaussian Process Regressi...
Saved in:
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Published: |
Elsevier B.V.
2024
|
Subjects: | |
Online Access: | http://eprints.um.edu.my/44780/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my.um.eprints.44780 |
---|---|
record_format |
eprints |
spelling |
my.um.eprints.447802024-07-15T07:51:54Z http://eprints.um.edu.my/44780/ Ozone concentration forecasting utilizing leveraging of regression machine learnings: A case study at Klang Valley, Malaysia Latif, Sarmad Dashti Lai, Vivien Hahzaman, Farah Hazwani Ahmed, Ali Najah Huang, Yuk Feng Birima, Ahmed H. El-Shafie, Ahmed TA Engineering (General). Civil engineering (General) At Klang Valley, ground-level ozone is a significant source of air pollution. Ozone (O3) concentration is affected by meteorological conditions and air pollutants. Linear Regression Models (LRM), Regression Trees (RT), Support Vector Machines (SVM), Ensembles of Trees (ET), Gaussian Process Regression (GPR), and Neural Networks (NN) are utilized in a thorough analysis to determine the accuracy of various machine learning in forecasting the ground level O3 concentration. The primary associated contributions from this research are comparisons of regression statistical model performance based on indicators of root mean squared error (RMSE), coefficient of determination (R2), mean squared error (MSE), mean absolute error (MAE), prediction speed, and training time of regression models. Overall, exponential GPR outperformed other regression models in scenario 1 (S-1), scenario 2 (S-2), scenario (S-3), and scenario 4 (S-4) by incorporating multiple number of lags into respective scenarios and new method of testing “re-substitution” performed more reliable and consistent than applying identical datasets to 20 of model testing. The findings showed that GPR performed accurate results with R2 = 0.98, 0.95, 0.96, and 0.96 for S-1, S-2, S-3 and S-4 respectively. © 2024 The Authors Elsevier B.V. 2024 Article PeerReviewed Latif, Sarmad Dashti and Lai, Vivien and Hahzaman, Farah Hazwani and Ahmed, Ali Najah and Huang, Yuk Feng and Birima, Ahmed H. and El-Shafie, Ahmed (2024) Ozone concentration forecasting utilizing leveraging of regression machine learnings: A case study at Klang Valley, Malaysia. Results in Engineering, 21. ISSN 2590-1230, DOI https://doi.org/10.1016/j.rineng.2024.101872 <https://doi.org/10.1016/j.rineng.2024.101872>. 10.1016/j.rineng.2024.101872 |
institution |
Universiti Malaya |
building |
UM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Malaya |
content_source |
UM Research Repository |
url_provider |
http://eprints.um.edu.my/ |
topic |
TA Engineering (General). Civil engineering (General) |
spellingShingle |
TA Engineering (General). Civil engineering (General) Latif, Sarmad Dashti Lai, Vivien Hahzaman, Farah Hazwani Ahmed, Ali Najah Huang, Yuk Feng Birima, Ahmed H. El-Shafie, Ahmed Ozone concentration forecasting utilizing leveraging of regression machine learnings: A case study at Klang Valley, Malaysia |
description |
At Klang Valley, ground-level ozone is a significant source of air pollution. Ozone (O3) concentration is affected by meteorological conditions and air pollutants. Linear Regression Models (LRM), Regression Trees (RT), Support Vector Machines (SVM), Ensembles of Trees (ET), Gaussian Process Regression (GPR), and Neural Networks (NN) are utilized in a thorough analysis to determine the accuracy of various machine learning in forecasting the ground level O3 concentration. The primary associated contributions from this research are comparisons of regression statistical model performance based on indicators of root mean squared error (RMSE), coefficient of determination (R2), mean squared error (MSE), mean absolute error (MAE), prediction speed, and training time of regression models. Overall, exponential GPR outperformed other regression models in scenario 1 (S-1), scenario 2 (S-2), scenario (S-3), and scenario 4 (S-4) by incorporating multiple number of lags into respective scenarios and new method of testing “re-substitution” performed more reliable and consistent than applying identical datasets to 20 of model testing. The findings showed that GPR performed accurate results with R2 = 0.98, 0.95, 0.96, and 0.96 for S-1, S-2, S-3 and S-4 respectively. © 2024 The Authors |
format |
Article |
author |
Latif, Sarmad Dashti Lai, Vivien Hahzaman, Farah Hazwani Ahmed, Ali Najah Huang, Yuk Feng Birima, Ahmed H. El-Shafie, Ahmed |
author_facet |
Latif, Sarmad Dashti Lai, Vivien Hahzaman, Farah Hazwani Ahmed, Ali Najah Huang, Yuk Feng Birima, Ahmed H. El-Shafie, Ahmed |
author_sort |
Latif, Sarmad Dashti |
title |
Ozone concentration forecasting utilizing leveraging of regression machine learnings: A case study at Klang Valley, Malaysia |
title_short |
Ozone concentration forecasting utilizing leveraging of regression machine learnings: A case study at Klang Valley, Malaysia |
title_full |
Ozone concentration forecasting utilizing leveraging of regression machine learnings: A case study at Klang Valley, Malaysia |
title_fullStr |
Ozone concentration forecasting utilizing leveraging of regression machine learnings: A case study at Klang Valley, Malaysia |
title_full_unstemmed |
Ozone concentration forecasting utilizing leveraging of regression machine learnings: A case study at Klang Valley, Malaysia |
title_sort |
ozone concentration forecasting utilizing leveraging of regression machine learnings: a case study at klang valley, malaysia |
publisher |
Elsevier B.V. |
publishDate |
2024 |
url |
http://eprints.um.edu.my/44780/ |
_version_ |
1805881166479228928 |
score |
13.211869 |