Addressing overfitting and overestimation challenges in landslide susceptibility modeling : a case study of Penang Island, Malaysia
In the realm of landslide susceptibility prediction, the challenge of overftting and overestimation has persisted despite various modeling attempts. This study aims to elevate the predictive capabilities of the Extreme Gradient Boosting (XGBoost) and Random Forest (RF) models for landslide susceptib...
Saved in:
| Main Authors: | , |
|---|---|
| Format: | Article |
| Language: | en |
| Published: |
Springer Nature
2025
|
| Subjects: | |
| Online Access: | http://ir.unimas.my/id/eprint/48475/1/s11069-025-07329-6.pdf http://ir.unimas.my/id/eprint/48475/ https://link.springer.com/article/10.1007/s11069-025-07329-6 https://doi.org/10.1007/s11069-025-07329-6 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Summary: | In the realm of landslide susceptibility prediction, the challenge of overftting and overestimation has persisted despite various modeling attempts. This study aims to elevate the predictive capabilities of the Extreme Gradient Boosting (XGBoost) and Random Forest (RF) models for landslide susceptibility assessment through the innovative application of Bayesian Optimization (BO). Using data from Penang Island in Malaysia, we comprehensively incorporated topographical, hydrological, human, and environmental factors infuencing landslides. Leveraging Geographic Information System (GIS) tools, we meticulously constructed spatial databases encompassing all pertinent landslide conditioning elements. Our fndings unveil the remarkable performance of the optimized XGBoost model, achieving an astounding 100.0% Success Rate (SR) and an impressive 97.1% Prediction Rate (PR). In comparison, the optimized RF model achieved an SR of 99.7% and a PR of 96.3%, while the stacked models followed closely with an SR of 96.8% and a PR
of 95.6%. These conclusive results underscore the transformative potential of addressing overftting and overestimation challenges through the strategic combination of stacking and hyperparameter optimization. The improved accuracy of these algorithms bears immense signifcance, extending to applications in site selection, engineering structure health monitoring, and disaster mitigation, thus elevating the importance of Landslide Susceptibility
Maps (LSMs) in safeguarding communities and infrastructure. |
|---|
