Neural network-based correlation and statistical identification of data outliers in H2S-alkanolamine-H2O and CO2-alkanolamine-H2O datasets

Throughout the published literature for phase equilibrium data of CO2-alkanolamine-H2O and H2S-alkanolamine-H2O systems, it is common to find some discrepant data, called data outliers. The presence of these erroneous values induces inaccuracies and prediction errors in the models and simulation stu...

Full description

Saved in:
Bibliographic Details
Main Authors: Imai, B., Nasir, Q., Maulud, A.S., Nawaz, M., Nasir, R., Suleman, H.
Format: Article
Published: Springer Science and Business Media Deutschland GmbH 2022
Online Access:http://scholars.utp.edu.my/id/eprint/33932/
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85139545410&doi=10.1007%2fs00521-022-07904-z&partnerID=40&md5=97469bda6b87592a38118dbb42645fc5
Tags: Add Tag
No Tags, Be the first to tag this record!
id oai:scholars.utp.edu.my:33932
record_format eprints
spelling oai:scholars.utp.edu.my:339322022-12-20T03:50:54Z http://scholars.utp.edu.my/id/eprint/33932/ Neural network-based correlation and statistical identification of data outliers in H2S-alkanolamine-H2O and CO2-alkanolamine-H2O datasets Imai, B. Nasir, Q. Maulud, A.S. Nawaz, M. Nasir, R. Suleman, H. Throughout the published literature for phase equilibrium data of CO2-alkanolamine-H2O and H2S-alkanolamine-H2O systems, it is common to find some discrepant data, called data outliers. The presence of these erroneous values induces inaccuracies and prediction errors in the models and simulation studies developed using such experimental datasets. Hence, it is important that the data outliers are identified and later corrected or removed before developing a model or simulation. This study proposes a modified approach to identifying data outliers present in the phase equilibrium data of CO2-alkanolamine-H2O and H2S-alkanolamine-H2O systems using an artificial neural network and data outlier identification methods. Firstly, the suggested approach correlates the experimental phase equilibrium data (2152 data points) of CO2 and H2S-loaded monoethanolamine, diethanolamine, and N-methyldiethanolamine solutions by developing an artificial neural network. Following this, the data outliers are identified by applying a modified IQR method and compared graphically to 2.5 standard deviation method. The identified data outliers can then be truncated or winsorised for developing reliable and accurate models/simulations. The modified IQR method coupled with a neural network (based on the normalised data values) can robustly identify data outliers within a large experimental dataset. The proposed approach is superior to the previous data outlier identification techniques that used 2.5 standard deviations method, as it alleviates the need for a human decision in determining the congruence of experimental values. The results also indicate that the developed method can be reliably extended to other/larger non-linear experimental datasets having similar correlative complexity. © 2022, The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature. Springer Science and Business Media Deutschland GmbH 2022 Article NonPeerReviewed Imai, B. and Nasir, Q. and Maulud, A.S. and Nawaz, M. and Nasir, R. and Suleman, H. (2022) Neural network-based correlation and statistical identification of data outliers in H2S-alkanolamine-H2O and CO2-alkanolamine-H2O datasets. Neural Computing and Applications. ISSN 09410643 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85139545410&doi=10.1007%2fs00521-022-07904-z&partnerID=40&md5=97469bda6b87592a38118dbb42645fc5 10.1007/s00521-022-07904-z 10.1007/s00521-022-07904-z 10.1007/s00521-022-07904-z
institution Universiti Teknologi Petronas
building UTP Resource Centre
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Petronas
content_source UTP Institutional Repository
url_provider http://eprints.utp.edu.my/
description Throughout the published literature for phase equilibrium data of CO2-alkanolamine-H2O and H2S-alkanolamine-H2O systems, it is common to find some discrepant data, called data outliers. The presence of these erroneous values induces inaccuracies and prediction errors in the models and simulation studies developed using such experimental datasets. Hence, it is important that the data outliers are identified and later corrected or removed before developing a model or simulation. This study proposes a modified approach to identifying data outliers present in the phase equilibrium data of CO2-alkanolamine-H2O and H2S-alkanolamine-H2O systems using an artificial neural network and data outlier identification methods. Firstly, the suggested approach correlates the experimental phase equilibrium data (2152 data points) of CO2 and H2S-loaded monoethanolamine, diethanolamine, and N-methyldiethanolamine solutions by developing an artificial neural network. Following this, the data outliers are identified by applying a modified IQR method and compared graphically to 2.5 standard deviation method. The identified data outliers can then be truncated or winsorised for developing reliable and accurate models/simulations. The modified IQR method coupled with a neural network (based on the normalised data values) can robustly identify data outliers within a large experimental dataset. The proposed approach is superior to the previous data outlier identification techniques that used 2.5 standard deviations method, as it alleviates the need for a human decision in determining the congruence of experimental values. The results also indicate that the developed method can be reliably extended to other/larger non-linear experimental datasets having similar correlative complexity. © 2022, The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature.
format Article
author Imai, B.
Nasir, Q.
Maulud, A.S.
Nawaz, M.
Nasir, R.
Suleman, H.
spellingShingle Imai, B.
Nasir, Q.
Maulud, A.S.
Nawaz, M.
Nasir, R.
Suleman, H.
Neural network-based correlation and statistical identification of data outliers in H2S-alkanolamine-H2O and CO2-alkanolamine-H2O datasets
author_facet Imai, B.
Nasir, Q.
Maulud, A.S.
Nawaz, M.
Nasir, R.
Suleman, H.
author_sort Imai, B.
title Neural network-based correlation and statistical identification of data outliers in H2S-alkanolamine-H2O and CO2-alkanolamine-H2O datasets
title_short Neural network-based correlation and statistical identification of data outliers in H2S-alkanolamine-H2O and CO2-alkanolamine-H2O datasets
title_full Neural network-based correlation and statistical identification of data outliers in H2S-alkanolamine-H2O and CO2-alkanolamine-H2O datasets
title_fullStr Neural network-based correlation and statistical identification of data outliers in H2S-alkanolamine-H2O and CO2-alkanolamine-H2O datasets
title_full_unstemmed Neural network-based correlation and statistical identification of data outliers in H2S-alkanolamine-H2O and CO2-alkanolamine-H2O datasets
title_sort neural network-based correlation and statistical identification of data outliers in h2s-alkanolamine-h2o and co2-alkanolamine-h2o datasets
publisher Springer Science and Business Media Deutschland GmbH
publishDate 2022
url http://scholars.utp.edu.my/id/eprint/33932/
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85139545410&doi=10.1007%2fs00521-022-07904-z&partnerID=40&md5=97469bda6b87592a38118dbb42645fc5
_version_ 1753790756318871552
score 13.223943