Optimizing Tuberculosis Treatment Predictions: A Comparative Study of XGBoost with Hyperparameter in Penang, Malaysia (Mengoptimumkan Peramalan Rawatan Tuberkulosis: Suatu Kajian Perbandingan XGBoost dengan Hiperparameter di Penang, Malaysia)

Yaniza Shaira, Zakaria and Nur Afiqah, Ariffin and Azizul, Ahmad and Ruslan, Rainis and Aidy, M. Muslim and Wan Mohd Muhiyuddin, Wan Ibrahim (2025) Optimizing Tuberculosis Treatment Predictions: A Comparative Study of XGBoost with Hyperparameter in Penang, Malaysia (Mengoptimumkan Peramalan Rawatan Tuberkulosis: Suatu Kajian Perbandingan XGBoost dengan Hiperparameter di Penang, Malaysia). Sains Malaysiana, 54 (1). pp. 3741-3752. ISSN 0126-6039

[img] PDF
OPTIMI~2.PDF - Published Version

Download (597kB)
Official URL: https://www.ukm.my/jsm/pdf_files/SM-PDF-54-1-2025/...

Abstract

The bacterium Mycobacterium tuberculosis causes a viral infection affecting the lungs and liver. Tuberculosis (TB) is a significant public health concern in developing countries, where it is often associated with poverty, poor living conditions, and limited access to healthcare services. According to the World Health Organization (2023), Tuberculosis continues to pose a substantial risk to public health on a global scale, with millions of people affected each year and around 1.5 million deaths in 2020. Healthcare providers often encounter significant challenges in addressing TB, leading to uncertain treatment outcomes. This study introduces a novel method for enhancing TB treatment using sophisticated machine learning techniques, particularly emphasizing the application of XGBoost and various predictive models in Penang State, Malaysia, to predict individual treatment outcomes based on clinical data. The models were trained using 2017 Penang data. Comparing predicted accuracy helps establish the optimum method. Clinical data was anonymized and analyzed. Decision tree accuracy is 63.7% using 2017 data. Logistic Regression is 63.3% accurate, while XGBoost is 66.3%. Hyperparameter-tuned XGBoost performs best at 68.1%. Comparing observed and expected results determines accuracy. TB result predictions are accurate using supervised learning. Calibrated ensemble models like XGBoost makes reliable predictions. Additional clinical characteristics may improve forecasts. The primary objective was to develop a reliable, clinically validated instrument that enhances TB treatments while optimizing resource efficiency across diverse healthcare environments.

Item Type: Article
Additional Information: This study was approved by the National Medical Research Register of Malaysia (NMRR) under registration numbers NMRR-14-998-21596 and NMRR-19-447-447-447. The study is also associated with the Long-term Grant Scheme (LRGS): Evaluation of the National TB Prevention and Control Programme towards Achieving the Millennium Development Goals (MDG) (203.PSK.6722006). Their dedication to information sharing and the advancement of knowledge is genuinely admirable.
Uncontrolled Keywords: Classification; hyperparameter; logistic regression; prediction; random forest; tuberculosis.
Subjects: G Geography. Anthropology. Recreation > G Geography (General)
G Geography. Anthropology. Recreation > GA Mathematical geography. Cartography
H Social Sciences > H Social Sciences (General)
H Social Sciences > HA Statistics
Divisions: Academic Faculties, Institutes and Centres > Faculty of Social Sciences & Humanities
Faculties, Institutes, Centres > Faculty of Social Sciences & Humanities
Depositing User: AZIZUL BIN AHMAD
Date Deposited: 31 Jan 2025 07:52
Last Modified: 31 Jan 2025 07:52
URI: http://ir.unimas.my/id/eprint/47430

Actions (For repository members only: login required)

View Item View Item