Model Prediction Of Pm2.5 And Pm10 Using Machine Learning Approach

This study was done to develop a multi-input-single-output (MISO) and multi-input-multi-output (MIMO) models using an artificial neural network by MATLAB software to predict the concentrations of PM2.5 and PM10 respectively based on meteorological parameters. For the purpose of this research, the...

Full description

Saved in:
Bibliographic Details
Main Author: Hamid, Norfarhanah
Format: Monograph
Language:English
Published: Universiti Sains Malaysia 2021
Subjects:
Online Access:http://eprints.usm.my/54691/1/Model%20Prediction%20Of%20Pm2.5%20And%20Pm10%20Using%20Machine%20Learning%20Approach_Norfarhanah%20Hamid_K4_2021_ESAR.pdf
http://eprints.usm.my/54691/
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.usm.eprints.54691
record_format eprints
spelling my.usm.eprints.54691 http://eprints.usm.my/54691/ Model Prediction Of Pm2.5 And Pm10 Using Machine Learning Approach Hamid, Norfarhanah T Technology TP Chemical Technology This study was done to develop a multi-input-single-output (MISO) and multi-input-multi-output (MIMO) models using an artificial neural network by MATLAB software to predict the concentrations of PM2.5 and PM10 respectively based on meteorological parameters. For the purpose of this research, the historical dataset is obtained from the Beijing Municipal Environmental Monitoring Centre to be used as the case study. The model was developed as a generic use where data pre-processing using two separate methods of calculating a correlation coefficient and variable importance in projection (VIP) scores managed to select significant input toward output for model development. Both methods of feature selection produced similar results where gaseous pollutants of Carbon Monoxide (CO), Nitrogen Dioxide (NO2) and Sulfur Dioxide (SO2) demonstrated the highest correlation towards the output target. Based on the feature selection, model development was built with and without input selection using the Nonlinear Autoregressive with Exogeneous Input (NARX) neural network model which made use of 10 number of hidden neurons and 2 number of delays, implementing Levenberg-Marquardt as training algorithm. The performance of the prediction model was evaluated by measuring Means Square Error (MSE), Root Mean Square Error (RMSE), Regression Number (R), and Coefficient of Determination (R2) values as a performance validation. Models developed with and without input selections were studied and compared where MISO Model 1, without input selection obtained the best performance having MSE, RMSE, R and R2 with values of 0.0594, 0.2437, 0.9704 and 0.9417 respectively for testing. Meanwhile, with input selection the values obtained 0.0589, 0.2428, 0.9709 and 0.9427. It was found that taking into account the removal of the irrelevant variables does not increase precision significantly nor does it reduce the performance tremendously. Instead, knowing the key parameters with the most relation with PM2.5 and PM10 would guarantee a better predicament of the concentration. Prediction of PM2.5 and PM10 concentration using machine learning is achieved and useful not only to improve public awareness but the air quality management in Malaysia as well as other parts of the world. Universiti Sains Malaysia 2021-07-01 Monograph NonPeerReviewed application/pdf en http://eprints.usm.my/54691/1/Model%20Prediction%20Of%20Pm2.5%20And%20Pm10%20Using%20Machine%20Learning%20Approach_Norfarhanah%20Hamid_K4_2021_ESAR.pdf Hamid, Norfarhanah (2021) Model Prediction Of Pm2.5 And Pm10 Using Machine Learning Approach. Project Report. Universiti Sains Malaysia, Pusat Pengajian Kejuruteraan Kimia. (Submitted)
institution Universiti Sains Malaysia
building Hamzah Sendut Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Sains Malaysia
content_source USM Institutional Repository
url_provider http://eprints.usm.my/
language English
topic T Technology
TP Chemical Technology
spellingShingle T Technology
TP Chemical Technology
Hamid, Norfarhanah
Model Prediction Of Pm2.5 And Pm10 Using Machine Learning Approach
description This study was done to develop a multi-input-single-output (MISO) and multi-input-multi-output (MIMO) models using an artificial neural network by MATLAB software to predict the concentrations of PM2.5 and PM10 respectively based on meteorological parameters. For the purpose of this research, the historical dataset is obtained from the Beijing Municipal Environmental Monitoring Centre to be used as the case study. The model was developed as a generic use where data pre-processing using two separate methods of calculating a correlation coefficient and variable importance in projection (VIP) scores managed to select significant input toward output for model development. Both methods of feature selection produced similar results where gaseous pollutants of Carbon Monoxide (CO), Nitrogen Dioxide (NO2) and Sulfur Dioxide (SO2) demonstrated the highest correlation towards the output target. Based on the feature selection, model development was built with and without input selection using the Nonlinear Autoregressive with Exogeneous Input (NARX) neural network model which made use of 10 number of hidden neurons and 2 number of delays, implementing Levenberg-Marquardt as training algorithm. The performance of the prediction model was evaluated by measuring Means Square Error (MSE), Root Mean Square Error (RMSE), Regression Number (R), and Coefficient of Determination (R2) values as a performance validation. Models developed with and without input selections were studied and compared where MISO Model 1, without input selection obtained the best performance having MSE, RMSE, R and R2 with values of 0.0594, 0.2437, 0.9704 and 0.9417 respectively for testing. Meanwhile, with input selection the values obtained 0.0589, 0.2428, 0.9709 and 0.9427. It was found that taking into account the removal of the irrelevant variables does not increase precision significantly nor does it reduce the performance tremendously. Instead, knowing the key parameters with the most relation with PM2.5 and PM10 would guarantee a better predicament of the concentration. Prediction of PM2.5 and PM10 concentration using machine learning is achieved and useful not only to improve public awareness but the air quality management in Malaysia as well as other parts of the world.
format Monograph
author Hamid, Norfarhanah
author_facet Hamid, Norfarhanah
author_sort Hamid, Norfarhanah
title Model Prediction Of Pm2.5 And Pm10 Using Machine Learning Approach
title_short Model Prediction Of Pm2.5 And Pm10 Using Machine Learning Approach
title_full Model Prediction Of Pm2.5 And Pm10 Using Machine Learning Approach
title_fullStr Model Prediction Of Pm2.5 And Pm10 Using Machine Learning Approach
title_full_unstemmed Model Prediction Of Pm2.5 And Pm10 Using Machine Learning Approach
title_sort model prediction of pm2.5 and pm10 using machine learning approach
publisher Universiti Sains Malaysia
publishDate 2021
url http://eprints.usm.my/54691/1/Model%20Prediction%20Of%20Pm2.5%20And%20Pm10%20Using%20Machine%20Learning%20Approach_Norfarhanah%20Hamid_K4_2021_ESAR.pdf
http://eprints.usm.my/54691/
_version_ 1744354458319454208
score 13.211869