The Effectiveness Of A Probabilistic Principal Component Analysis Model And Expectation Maximisation Algorithm In Treating Missing Daily Rainfall Data

The reliability and accuracy of a risk assessment of extreme hydro-meteorological events are highly dependent on the quality of the historical rainfall time series data. However, missing data in a time series such as this could result in lower quality data. Therefore, this paper proposes a multiple-...

Full description

Saved in:
Bibliographic Details
Main Authors: Zun Liang, Chuan, Fam, Soo Fen, Mohd Deni, Sayang, Ismail, Noriszura
Format: Article
Language:en
Published: Korean Meteorological Society 2020
Online Access:http://eprints.utem.edu.my/id/eprint/24149/2/1%20CHUANTHE%20EFFECTIVENESS%20OF%20PROBABILISTIC%20PCA%20RAIN%20FALLDATA.PDF
http://eprints.utem.edu.my/id/eprint/24149/
https://link.springer.com/article/10.1007/s13143-019-00135-8
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The reliability and accuracy of a risk assessment of extreme hydro-meteorological events are highly dependent on the quality of the historical rainfall time series data. However, missing data in a time series such as this could result in lower quality data. Therefore, this paper proposes a multiple-imputation algorithm for treating missing data without requiring information from adjoining monitoring stations. The proposed imputation algorithms are based on the M-component probabilistic principal component analysis model and an expectation maximisation algorithm (MPPCA-EM). In order to evaluate the effectiveness of the MPPCA-EM imputation algorithm, six distinct historical daily rainfall time series data were recorded from six monitoring stations. These stations were located at the coastal and inland regions of the East-Coast Economic Region (ECER) Malaysia. The results of analysis show that, when it comes to treating missing historical daily rainfall time series data recorded from coastal monitoring stations, the 2-component probabilistic principal component analysis model and expectation-maximisation algorithm (2PPCA-EM) were found to be superior to the single- and multiple-imputation algorithms proposed in previous studies. On the contrary, the single-imputation algorithms as proposed in previous studies were superior to the MPPCA-EM imputation algorithms when treating missing historical daily rainfall time series data recorded from inland monitoring stations