Integrating Ensemble Learning and Model-Agnostic Explain ability for Reliable Credit Card Fraud Prediction
Keywords:
XGBoost, ROC–AUC, SHAP and LIMEAbstract
Because of highly imbalanced datasets and rapidly changing fraud patterns, there is still a lack of effective methods for credit card fraud detection in the context of modern financial systems. We propose a novel + explain ability→ machine-learning framework that integrates both Random Forest and XGBoost classifiers with explain ability techniques to enhance not just predictive accuracy, but critical interpretability for audience⇄ the consumers of insights→. This study employs the publicly available credit card fraud dataset (284,807 transactions), with only 0.17% of them being fraudulent samples. Following pre-processing and standardization, models were trained and evaluated on the basis of 70–30 train–test split. Random Forest classifier scored an overall accuracy of 99.99% whilef0.93, recall f0.82, and f1-score f0.87 among the fraud class. From the confusion matrix, we see that there are 111 true fraud detected but only 25 missed cases, and the ROC–AUC of 0.986 indicates, we have great discriminatory power. Finally, with the application of XGBoost, we were able to reach a precision of 0.94, a recall of 0.82, and an F1-score of 0.87 for the minority class, thus also confirming very good generalization and robustness between the metrics. For the sake of interpretability, SHAP and LIME were performed to catch the main contributing features. SHAP bee swarm and bar plots indicated that V14, V17, V12, and V10 were the most important variables for predicting fraud, while LIME gave case-level explanations of the feature-values pointing a transaction to being fraud or non-fraud. XGBoost achieved a robust performance in low-prevalence conditions represented by the Precision–Recall curve, where it retains a high precision over a large range of recall. In summary, these results indicate that ensemble classifiers and model-agnostic explain ability tools make a natural combination that leads to a powerful and transparent fraud detection approach. This provides the framework for the financial institutions to make fast yet reliable and interpretable decisions on a continuous basis. The possible extensions for the visual representation will cover the deep learning architectures, anomaly detection hybrids, and cost-sensitive learning to maximize the performance and minimize the false negative rate.
Downloads
References
11787.https://doi.org/10.3390/app152111787
A. Dal Pozzolo, G. Boracchi, O. Caelen, C. Alippi and G. Bontempi, "Credit Card Fraud Detection: A Realistic Modeling and a Novel Learning Strategy," in IEEE Transactions on Neural Networks and Learning Systems, vol. 29, no. 8, pp. 3784-3797, Aug. 2018, https://doi.org/10.1109/TNNLS.2017.2736643
Rehman, Shoaib. "Credit Card Fraud Detection: Practical Applications and Challenges."
Ranjan, Nihar& Mate, Gitanjali & Jadhav, Archana & Patil, D. &Banubakode, A.. (2024). Credit Card Fraud Detection by Using Ensemble Method of Machine Learning. https://doi.org/10.1007/978-981-99-9521-9_34
Pundkar, Sumedh N., and MohdZubei. "Credit card fraud detection methods: A review." E3S Web of Conferences. Vol. 453. EDP Sciences, 2023.https://doi.org/10.1051/e3sconf/202345301015
Bin Sulaiman, Rejwan, Vitaly Schetinin, and Paul Sant. "Review of machine learning approach on credit card fraud detection." Human-Centric Intelligent Systems 2.1 (2022): 55-68.https://doi.org/10.1007/s44230-022-00004-0
Hajiabdollah, Niloofar, and Mehdi Sadeghzadeh. "A Review of Hybrid Deep Learning Approaches for Credit Card Fraud Detection." Available at SSRN 5129198.https://ssrn.com/abstract=5129198
Aslam, Farhan. "Advancing Credit Card Fraud Detection: A Review of Machine Learning Algorithms and the Power of Light Gradient Boosting." Am. J. Comput. Sci. Technol 7 (2024): 9-12.https://doi.org/10.11648/ajcst.20240701.12
Praveen Gugulothu , Shekhar Katukoori , Swapna Manuparthi "Deep Learning based techniques for Covid-19 diagnosis based on Various Pattern features detection from early stages of diseases”, Network Computation in Neural Systems.(Accepted May 2024, Indexing: SCIE, IF: 9, Publisher: Taylor & Francis).
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
All papers should be submitted electronically. All submitted manuscripts must be original work that is not under submission at another journal or under consideration for publication in another form, such as a monograph or chapter of a book. Authors of submitted papers are obligated not to submit their paper for publication elsewhere until an editorial decision is rendered on their submission. Further, authors of accepted papers are prohibited from publishing the results in other publications that appear before the paper is published in the Journal unless they receive approval for doing so from the Editor-In-Chief.
IJISAE open access articles are licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. This license lets the audience to give appropriate credit, provide a link to the license, and indicate if changes were made and if they remix, transform, or build upon the material, they must distribute contributions under the same license as the original.


