Random Forest and Extreme Gradient Boosting with Bayesian Hyperparameter Optimization for Landslide Susceptibility Mapping in Penang Island, Malaysia

Landslide susceptibility models often face challenges of overfitting and overestimation. This research focuses on improving the predictive capabilities of the Extreme Gradient Boosting (XGBoost) and Random Forest (RF) algorithms by applying Bayesian Hyperparameter Optimization (BayesOpt). Penang Is...

Full description

Saved in:
Bibliographic Details
Main Authors: Dorothy, Martin Atok, Soo See, Chai, Kok Luong, Goh, Neha, Gautam, Kim On, Chin
Format: Article
Language:en
Published: Science Publications 2025
Subjects:
Online Access:http://ir.unimas.my/id/eprint/51082/1/jcssp.2025.2273.2291.pdf
http://ir.unimas.my/id/eprint/51082/
https://thescipub.com/abstract/jcssp.2025.2273.2291
https://doi.org/10.3844/jcssp.2025.2273.2291
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Landslide susceptibility models often face challenges of overfitting and overestimation. This research focuses on improving the predictive capabilities of the Extreme Gradient Boosting (XGBoost) and Random Forest (RF) algorithms by applying Bayesian Hyperparameter Optimization (BayesOpt). Penang Island, a region in Malaysia prone to frequent landslides, was chosen as the study area. Ten Landslide Conditioning Factors (LCFs), including elevation, slope angle, NDVI, and proximity to streams and roads, were derived using Geographic Information Systems (GIS). From the total of 886 landslide and non-landslide data points, a 70:30 split was employed for training and testing, respectively. BayesOpt-RF emerged as the top-performing model among all those assessed with an AUC of 99.50% (Success Rate) and 95.80% (Prediction Rate). RF (SR: 100.00%, PR: 95.60%), XGBoost (SR: 100.00%, PR: 95.20%), and BayesOpt-XGBoost (SR: 96.70%, PR: 93.00%) followed. While BayesOpt did not consistently improve prediction performance, it effectively minimized overfitting and ensured optimal model operation. For effective site selection, the generated landslide susceptibility maps are significant, infrastructure planning, and disaster mitigation.