The loan prediction models are ever-changing due to changes in technology, whereby financial institutions are adopting various technologies to automate the loan process. The surge of loan applicants with diverse attributes has accelerated the need for machine learning models which can incorporate different applicant attributes and improve accuracy in determining loan eligibility. While the use of individual machine learning models was more robust and accurate, these models have some limitations that may hinder the achievement of optimal results when establishing loan eligibility. Thus, the need to combine two or more individual models and leverage their strengths to improve accuracy and robustness. This study aimed to develop an eXtreme Gradient Boosting (XGBoost)-Support Vector Machine (SVM) hybrid model to determine loan eligibility in the banking sector. This work utilized secondary financial dataset from Google Kaggle. The dataset was preprocessed, transformed and used to train and test the models. Evaluation metrics, namely accuracy, precision, F1-score, recall and Receiver Operating Characteristics-Area Under Curve (ROC-AUC), were used to evaluate the performance and reliability of the XGBoost, SVM and the XGBoost-SVM hybrid model. The XGBoost-SVM hybrid model posted strong performance metrics results with accuracy of 0.78, precision of 0.27, a balanced recall of 0.49, F1-Score of 0.34 and AUC-ROC curve of 0.74. It was therefore evident that the hybrid model was able to leverage the standalone model's strength for better performance in determining loan eligibility.
| Published in | International Journal of Data Science and Analysis (Volume 12, Issue 3) |
| DOI | 10.11648/j.ijdsa.20261203.11 |
| Page(s) | 37-51 |
| Creative Commons |
This is an Open Access article, distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution and reproduction in any medium or format, provided the original work is properly cited. |
| Copyright |
Copyright © The Author(s), 2026. Published by Science Publishing Group |
Loan Eligibility, Machine Learning, XGBoost, Support Vector Machine, Hybrid Model
| [1] | Nazim Uddin et al. "An ensemble machine learning based bank loan approval predictions system with a smart application". In: International Journal of Cognitive Computing in Engineering 4 (2023), pp. 327-339. |
| [2] | CL Perera and SC Premaratne. "An ensemble machine learning approach for forecasting credit risk of loan applications". In: WSEAS Transactions on Systems 23 (2024), pp. 31-46. |
| [3] | Sinap, Vahid (2024). Vahid Sinap. "A comparative study of loan approval prediction using machine learning methods". In: Gazi Universitesi Fen Bilimleri Dergisi Part C: Tasarim ve Teknoloji 12.2 (2024), pp. 644-663. |
| [4] | Haque FM Ahosanul and Mahedi Hassan. "Bank Loan Prediction Using Machine Learning Techniques". In: arXiv e-prints (2024), arXiv-2410. |
| [5] | Cheuk Lam Lai. "A Novel Machine Learning-based Ensemble Model for Loan Prediction". In: 2025 3rd International Conference on Image, Algorithms, and Artificial Intelligence (ICIAAI 2025). Atlantis Press. 2025, pp. 263-273. |
| [6] | Debabrata Dansana et al. "Analyzing the impact of loan features on bank loan prediction using R andom F orest algorithm". In: Engineering Reports 6.2 (2024), e12707. |
| [7] | Davinder Paul Singh et al. "Predictive Modeling for Bank Loan Approval: From Data to Decisions". In: Procedia Computer Science 259 (2025), pp. 1426-1431. |
| [8] | Eslam Hussein Sayed et al. "Machine learning and deep learning for loan prediction in banking: Exploring ensemble methods and data balancing". In: IEEE Access 12 (2024), pp. 193997-194019. |
| [9] | Golda T Kisutsa. "Loan Default Prediction Using Machine Learning: a Case of Mobile Based Lending". Doctoral dissertation. University of Nairobi, 2021. |
| [10] | Zheng Zhi Kang et al. "Loan Default Prediction Using Machine Learning Algorithms". In: Journal of Informatics and Web Engineering 4.3 (2025), pp. 232-244. |
| [11] | Xinyu Zhang et al. "Data-driven loan default prediction: A machine learning approach for enhancing business process management". In: Systems 13.7 (2025), p. 581. |
| [12] | Keke Yu et al. "Loan approval prediction improved by XGBoost model based on fourvector optimization algorithm". In: Applied and Computational Engineering (2024). |
| [13] | Ruoyu Qi. "Loan default prediction and feature importance analysis based on the XGBoost model". In: European Journal of Business, Economics & Management 1.2 (2025), pp. 141-149. |
| [14] | Xu Zhu et al. "Explainable prediction of loan default based on machine learning models". In: Data Science and Management 6.3 (2023), pp. |
| [15] | Malti Bansal, Apoorva Goyal, and Apoorva Choudhary. "A comparative analysis of K-nearest neighbor, genetic, support vector machine, decision tree, and long short term memory algorithms in machine learning". In: Decision analytics journal 3 (2022), p. 100071. |
| [16] | Vikas Kumar et al. "AI-based hybrid models for predicting loan risk in the banking sector". In: Big Data Mining and Analytics 6.4 (2023), pp. 478-490. |
| [17] | Dhritiman Saha and Annamalai Manickavasagan. "Machine learning techniques for analysis of hyperspectral images to determine quality of food products: A review". In: Current Research in Food Science 4 (2021), pp. 28-44. |
| [18] | Obare Dm and Muraya Mm. "Comparison of accuracy of support vector machine model and logistic regression model in predicting individual loan defaults". In: American Journal of Applied Mathematics and Statistics 6.6 (2018), pp. 266-271. |
| [19] | Mehmet Furkan Akca and Onur Sevli. "Predicting acceptance of the bank loan offers by using support vector machines". In: International Advanced Researches and Engineering Journal 6.2 (2022), pp. 142-147. |
| [20] | Novita Angraini, Kelly Rosalina, and Andini Kosasih. "Optimizing Loan Approval Processes with Support Vector Machines (SVM)". In: ITEJ (Information Technology Engineering Journals) 9.2 (2024), pp. 53-61. |
| [21] | Purvi Prabhakar Shetty. "A Hybrid Feature Selection and Hybrid Prediction Model for Credit Risk Prediction". Doctoral dissertation. Dublin, National College of Ireland, 2023. |
| [22] | Shivam Krishna and Arun Solanki. "Hybrid Machine Learning Models for Credit Risk Prediction: An Explainable AI Approach". In: Proceedings of Data Analytics and Management. Cham: Springer Nature Switzerland, 2025, pp. 483-492. |
| [23] | Reena Singh, Vedika Bengani, and Khushi Saini. Hybrid learning systems: Integrating traditional machine learning with deep learning techniques. 2024. |
| [24] | Jorge Paredes et al. "A hybrid machine learning algorithm approach to predictive maintenance tasks: a comparison with machine learning algorithms". In: Results in Engineering (2025), p. 105137. |
APA Style
Muhoro, J. G., Gitonga, H. M., Ngure, J. N. (2026). Determining Loan Eligibility in the Banking Sector Using a Hybrid Model of Support Vector Machine and Extreme Gradient Boosting. International Journal of Data Science and Analysis, 12(3), 37-51. https://doi.org/10.11648/j.ijdsa.20261203.11
ACS Style
Muhoro, J. G.; Gitonga, H. M.; Ngure, J. N. Determining Loan Eligibility in the Banking Sector Using a Hybrid Model of Support Vector Machine and Extreme Gradient Boosting. Int. J. Data Sci. Anal. 2026, 12(3), 37-51. doi: 10.11648/j.ijdsa.20261203.11
@article{10.11648/j.ijdsa.20261203.11,
author = {James Githaiga Muhoro and Harun Mwangi Gitonga and Josephine Njeri Ngure},
title = {Determining Loan Eligibility in the Banking Sector Using a Hybrid Model of Support Vector Machine and Extreme Gradient Boosting},
journal = {International Journal of Data Science and Analysis},
volume = {12},
number = {3},
pages = {37-51},
doi = {10.11648/j.ijdsa.20261203.11},
url = {https://doi.org/10.11648/j.ijdsa.20261203.11},
eprint = {https://article.sciencepublishinggroup.com/pdf/10.11648.j.ijdsa.20261203.11},
abstract = {The loan prediction models are ever-changing due to changes in technology, whereby financial institutions are adopting various technologies to automate the loan process. The surge of loan applicants with diverse attributes has accelerated the need for machine learning models which can incorporate different applicant attributes and improve accuracy in determining loan eligibility. While the use of individual machine learning models was more robust and accurate, these models have some limitations that may hinder the achievement of optimal results when establishing loan eligibility. Thus, the need to combine two or more individual models and leverage their strengths to improve accuracy and robustness. This study aimed to develop an eXtreme Gradient Boosting (XGBoost)-Support Vector Machine (SVM) hybrid model to determine loan eligibility in the banking sector. This work utilized secondary financial dataset from Google Kaggle. The dataset was preprocessed, transformed and used to train and test the models. Evaluation metrics, namely accuracy, precision, F1-score, recall and Receiver Operating Characteristics-Area Under Curve (ROC-AUC), were used to evaluate the performance and reliability of the XGBoost, SVM and the XGBoost-SVM hybrid model. The XGBoost-SVM hybrid model posted strong performance metrics results with accuracy of 0.78, precision of 0.27, a balanced recall of 0.49, F1-Score of 0.34 and AUC-ROC curve of 0.74. It was therefore evident that the hybrid model was able to leverage the standalone model's strength for better performance in determining loan eligibility.},
year = {2026}
}
TY - JOUR T1 - Determining Loan Eligibility in the Banking Sector Using a Hybrid Model of Support Vector Machine and Extreme Gradient Boosting AU - James Githaiga Muhoro AU - Harun Mwangi Gitonga AU - Josephine Njeri Ngure Y1 - 2026/07/03 PY - 2026 N1 - https://doi.org/10.11648/j.ijdsa.20261203.11 DO - 10.11648/j.ijdsa.20261203.11 T2 - International Journal of Data Science and Analysis JF - International Journal of Data Science and Analysis JO - International Journal of Data Science and Analysis SP - 37 EP - 51 PB - Science Publishing Group SN - 2575-1891 UR - https://doi.org/10.11648/j.ijdsa.20261203.11 AB - The loan prediction models are ever-changing due to changes in technology, whereby financial institutions are adopting various technologies to automate the loan process. The surge of loan applicants with diverse attributes has accelerated the need for machine learning models which can incorporate different applicant attributes and improve accuracy in determining loan eligibility. While the use of individual machine learning models was more robust and accurate, these models have some limitations that may hinder the achievement of optimal results when establishing loan eligibility. Thus, the need to combine two or more individual models and leverage their strengths to improve accuracy and robustness. This study aimed to develop an eXtreme Gradient Boosting (XGBoost)-Support Vector Machine (SVM) hybrid model to determine loan eligibility in the banking sector. This work utilized secondary financial dataset from Google Kaggle. The dataset was preprocessed, transformed and used to train and test the models. Evaluation metrics, namely accuracy, precision, F1-score, recall and Receiver Operating Characteristics-Area Under Curve (ROC-AUC), were used to evaluate the performance and reliability of the XGBoost, SVM and the XGBoost-SVM hybrid model. The XGBoost-SVM hybrid model posted strong performance metrics results with accuracy of 0.78, precision of 0.27, a balanced recall of 0.49, F1-Score of 0.34 and AUC-ROC curve of 0.74. It was therefore evident that the hybrid model was able to leverage the standalone model's strength for better performance in determining loan eligibility. VL - 12 IS - 3 ER -