Addressing overfitting and overestimation challenges in landslide susceptibility modeling : a case study of Penang Island, Malaysia

Dorothy, Martin Atok and Chai, Soo See (2025) Addressing overfitting and overestimation challenges in landslide susceptibility modeling : a case study of Penang Island, Malaysia. Natural Hazards. pp. 1-28. ISSN 1573-0840

[img] PDF
s11069-025-07329-6.pdf

Download (2MB)
Official URL: https://link.springer.com/article/10.1007/s11069-0...

Abstract

In the realm of landslide susceptibility prediction, the challenge of overftting and overestimation has persisted despite various modeling attempts. This study aims to elevate the predictive capabilities of the Extreme Gradient Boosting (XGBoost) and Random Forest (RF) models for landslide susceptibility assessment through the innovative application of Bayesian Optimization (BO). Using data from Penang Island in Malaysia, we comprehensively incorporated topographical, hydrological, human, and environmental factors infuencing landslides. Leveraging Geographic Information System (GIS) tools, we meticulously constructed spatial databases encompassing all pertinent landslide conditioning elements. Our fndings unveil the remarkable performance of the optimized XGBoost model, achieving an astounding 100.0% Success Rate (SR) and an impressive 97.1% Prediction Rate (PR). In comparison, the optimized RF model achieved an SR of 99.7% and a PR of 96.3%, while the stacked models followed closely with an SR of 96.8% and a PR of 95.6%. These conclusive results underscore the transformative potential of addressing overftting and overestimation challenges through the strategic combination of stacking and hyperparameter optimization. The improved accuracy of these algorithms bears immense signifcance, extending to applications in site selection, engineering structure health monitoring, and disaster mitigation, thus elevating the importance of Landslide Susceptibility Maps (LSMs) in safeguarding communities and infrastructure.

Item Type: Article
Uncontrolled Keywords: Extreme gradient boosting · Geographic information system · Hybrid · Landslide susceptibility · Random forest.
Subjects: Q Science > QA Mathematics > QA76 Computer software
T Technology > T Technology (General)
Divisions: Academic Faculties, Institutes and Centres > Faculty of Computer Science and Information Technology
Faculties, Institutes, Centres > Faculty of Computer Science and Information Technology
Academic Faculties, Institutes and Centres > Faculty of Computer Science and Information Technology
Depositing User: Gani
Date Deposited: 18 Jun 2025 04:05
Last Modified: 19 Jun 2025 07:13
URI: http://ir.unimas.my/id/eprint/48475

Actions (For repository members only: login required)

View Item View Item