Predicting Hospital Readmission for Diabetes Patients

Authors

  • Hajar Hussein AL Qahtani, Abdulmohsen Algarni

Keywords:

Diabetes Readmission Prediction, Machine Learning (ML), Deep Learning (DL), Classification

Abstract

Predicting hospital readmission among diabetes patients is essential for improving patient outcomes, reducing healthcare costs, and optimizing the use of medical resources. However, this task is complex due to the intricate nature of healthcare data, high feature dimensionality, class imbalance issues, and the necessity of integrating both demographic and clinical variables. To address these challenges, a variety of machine learning models were developed and assessed, including traditional classifiers such as Decision Trees, Logistic Regression, and Random Forests, as well as more advanced approaches like XGBoost and Deep Neural Networks. To enhance model performance, we applied preprocessing techniques such as feature transformation, data balancing, and categorical encoding. Experiments were conducted on clinical datasets to predict patient readmission within 30 days, after 30 days, or not at all. Performance metrics included classification accuracy and the AUC-ROC score. Results showed that the Random Forest model achieved the highest performance in binary classification, with an accuracy of 94% and an AUC-ROC of 0.97, while a proposed Multi-Stage Classifier excelled in the multi-class task with 80% accuracy and an AUC-ROC of 0.89. Overall, the study highlights the potential of machine learning, particularly when coupled with effective preprocessing, to accurately predict hospital readmissions in diabetes care, thereby aiding clinical decisions and improving healthcare efficiency.

Downloads

Download data is not yet available.

References

C. J. Ejiyi, Z. Qin, J. Amos, M. B. Ejiyi, A. Nnani, T. U. Ejiyi, V. K. Agbesi, C. Diokpo and C. Okpara, "A robust predictive diagnosis model for diabetes mellitus using Shapley-incorporated machine learning algorithms," Healthcare Analytics, vol. 3, p. 100166, 2023.

B. Sly, A. W. Russell and C. Sullivan, "Digital interventions to improve safety and quality of inpatient diabetes management: A systematic review," International Journal of Medical Informatics, vol. 157, p. 104596, 2022 .

S. Davis, J. Zhang, I. Lee, M. Rezaei, R. Greiner, F. A. McAlister and R. Padwal, "Effective hospital readmission prediction models using machine-learned features," BMC health services research, vol. 22, no. 1, p. 1415, 2022.

Y. Xue, D. Klabjan and Y. Luo, "Predicting ICU readmission using grouped physiological and medication trends," Artificial Intelligence in Medicine, vol. 95, pp. 27-37, 2019.

M. Zeinalnezhad and S. Shishehchi, "An integrated data mining algorithms and meta-heuristic technique to predict the readmission risk of diabetic patients," Healthcare Analytics, vol. 5, p. 100292, 2024.

M. Mahmoud, M. Bader and J. McNicholas, "Short-Term and Long-Term Readmission Prediction in Uncontrolled Diabetic Patients using Machine Learning Techniques," in 16th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2023), 2023.

V. B. Liu, L. Y. Sue and Y. Wu, "Comparison of machine learning models for predicting 30-day readmission rates for patients with diabetes," Medical Artificial Intelligence, vol. 7, p. 23, 2024.

Y. Shang, K. Jiang, L. Wang, Z. Zhang, S. Zhou, Y. Liu, J. Dong and H. Wu, "The 30-days hospital readmission risk in diabetic patients: predictive modeling with machine learning classifiers," BMC Medical Informatics and Decision Making, vol. 21, no. 2, p. 57, 2021.

D. Sathyavathi and A. Mary Sowjanya, "Predicting Hospital Readmission for Diabetes Patients Using Machine Learning," Journal of Emerging Technologies and Innovative Research (JETIR), vol. 7, no. 11, pp. 1006-1012, 2020.

H. He and E. A. Garcia, "Learning from Imbalanced Data," IEEE Transactions on Knowledge and Data Engineering, vol. 21, no. 9, pp. 1263-1284, 2009.

P. Branco, L. Torgo and R. Ribeiro, "A Survey of Predictive Modelling under Imbalanced Distributions.," arXiv, 2015.

T. Elhassan, A. M, A.-M. F and M. Shoukri, "Classification of Imbalance Data using Tomek Link (T-Link) Combined with Random Under-sampling (RUS) as a Data Reduction Method," Global Journal of Technology and Optimization, vol. 1, 2016.

A. P. Ratnasari, "Performance of Random Oversampling, Random Undersampling, and SMOTE-NC Methods in Handling Imbalanced Class in Classification Models," International Journal of Scientific Research and Management (IJSRM), vol. 12, no. 4, pp. 494-501, 2024.

I. D. Mienye and N. Jere, "A Survey of Decision Trees: Concepts, Algorithms, and Applications," IEEE Access PP():1-1, vol. 12, pp. 86716 - 86727, 2024.

Y. Izza, A. Ignatiev and J. Marques-Silva, "On Explaining Decision Trees," arXiv, 2020.

M. Fratello and R. Tagliaferri, "Decision Trees and Random Forests," in Reference Module in Life Sciences, 2018.

X. Zhou, P. Lu, Z. Zheng, D. Tolliver and A. Keramati, "Accident Prediction Accuracy Assessment for Highway-Rail Grade Crossings Using Random Forest Algorithm Compared with Decision Tree," Reliability Engineering & System Safety, vol. 200, p. 106931, 2020.

C. Bentéjac, A. Csörgő and G. Martínez-Muñoz, "A Comparative Analysis of XGBoost," arXiv, 2019.

T. Chen and C. Guestrin, "XGBoost: A Scalable Tree Boosting System," arXiv, 2016.

M. Maalouf, "Logistic regression in data analysis: An overview," International Journal of Data Analysis Techniques and Strategies, vol. 3, no. 3, pp. 281-299, 2011.

M. K. Chung, "Introduction to logistic regression," arXiv, 2020.

E. Jawad, "THE DEEP NEURAL NETWORK-A REVIEW," IJRDO -JOURNAL OF MATHEMATICS, vol. 9, no. 9, pp. 1-5, 2023.

W. Samek, G. Montavon, S. Lapuschkin, C. J. Anders and K.-R. Müller, "Explaining Deep Neural Networks and Beyond: A Review of Methods and Applications," arXiv, 2020.

A. F. Agarap, "Deep Learning using Rectified Linear Units (ReLU)," arXiv, 2018 .

B. Strack, J. P. DeShazo, K. J. Cios and J. N. Clore, "Diabetes 130-US Hospitals for Years 1999-2008," UCI Machine Learning Repository, 2014. [Online]. Available: https://archive.ics.uci.edu/dataset/296/diabetes+130-us+hospitals+for+years+1999-2008. [Accessed March 2025].

B. Strack, J. P. DeShazo, C. Gennings, J. L. Olmo, S. Ventura, K. J. Cios and J. N. Clore, "Impact of HbA1c Measurement on Hospital Readmission Rates: Analysis of 70,000 Clinical Database Patient Records," BioMed Research International, vol. 2014, 2014.

Downloads

Published

19.04.2025

How to Cite

Hajar Hussein AL Qahtani. (2025). Predicting Hospital Readmission for Diabetes Patients. International Journal of Intelligent Systems and Applications in Engineering, 13(1), 469 –. Retrieved from https://www.ijisae.org/index.php/IJISAE/article/view/7836

Issue

Section

Research Article