Integrating Ensemble Learning and Model-Agnostic Explain ability for Reliable Credit Card Fraud Prediction

Authors

  • Pradeep Venuthurumilli, Nalli Vinaya Kumari, Srinivasa Rao, D. Mahitha, Vaishnavi Teja, D. Devender

Keywords:

XGBoost, ROC–AUC, SHAP and LIME

Abstract

Because of highly imbalanced datasets and rapidly changing fraud patterns, there is still a lack of effective methods for credit card fraud detection in the context of modern financial systems. We propose a novel + explain ability→ machine-learning framework that integrates both Random Forest and XGBoost classifiers with explain ability techniques to enhance not just predictive accuracy, but critical interpretability for audience⇄ the consumers of insights→. This study employs the publicly available credit card fraud dataset (284,807 transactions), with only 0.17% of them being fraudulent samples. Following pre-processing and standardization, models were trained and evaluated on the basis of 70–30 train–test split. Random Forest classifier scored an overall accuracy of 99.99% whilef0.93, recall f0.82, and f1-score f0.87 among the fraud class. From the confusion matrix, we see that there are 111 true fraud detected but only 25 missed cases, and the ROC–AUC of 0.986 indicates, we have great discriminatory power. Finally, with the application of XGBoost, we were able to reach a precision of 0.94, a recall of 0.82, and an F1-score of 0.87 for the minority class, thus also confirming very good generalization and robustness between the metrics. For the sake of interpretability, SHAP and LIME were performed to catch the main contributing features. SHAP bee swarm and bar plots indicated that V14, V17, V12, and V10 were the most important variables for predicting fraud, while LIME gave case-level explanations of the feature-values pointing a transaction to being fraud or non-fraud. XGBoost achieved a robust performance in low-prevalence conditions represented by the Precision–Recall curve, where it retains a high precision over a large range of recall. In summary, these results indicate that ensemble classifiers and model-agnostic explain ability tools make a natural combination that leads to a powerful and transparent fraud detection approach. This provides the framework for the financial institutions to make fast yet reliable and interpretable decisions on a continuous basis. The possible extensions for the visual representation will cover the deep learning architectures, anomaly detection hybrids, and cost-sensitive learning to maximize the performance and minimize the false negative rate.

Downloads

Download data is not yet available.

References

11787.https://doi.org/10.3390/app152111787

A. Dal Pozzolo, G. Boracchi, O. Caelen, C. Alippi and G. Bontempi, "Credit Card Fraud Detection: A Realistic Modeling and a Novel Learning Strategy," in IEEE Transactions on Neural Networks and Learning Systems, vol. 29, no. 8, pp. 3784-3797, Aug. 2018, https://doi.org/10.1109/TNNLS.2017.2736643

Rehman, Shoaib. "Credit Card Fraud Detection: Practical Applications and Challenges."

Ranjan, Nihar& Mate, Gitanjali & Jadhav, Archana & Patil, D. &Banubakode, A.. (2024). Credit Card Fraud Detection by Using Ensemble Method of Machine Learning. https://doi.org/10.1007/978-981-99-9521-9_34

Pundkar, Sumedh N., and MohdZubei. "Credit card fraud detection methods: A review." E3S Web of Conferences. Vol. 453. EDP Sciences, 2023.https://doi.org/10.1051/e3sconf/202345301015

Bin Sulaiman, Rejwan, Vitaly Schetinin, and Paul Sant. "Review of machine learning approach on credit card fraud detection." Human-Centric Intelligent Systems 2.1 (2022): 55-68.https://doi.org/10.1007/s44230-022-00004-0

Hajiabdollah, Niloofar, and Mehdi Sadeghzadeh. "A Review of Hybrid Deep Learning Approaches for Credit Card Fraud Detection." Available at SSRN 5129198.https://ssrn.com/abstract=5129198

Aslam, Farhan. "Advancing Credit Card Fraud Detection: A Review of Machine Learning Algorithms and the Power of Light Gradient Boosting." Am. J. Comput. Sci. Technol 7 (2024): 9-12.https://doi.org/10.11648/ajcst.20240701.12

Praveen Gugulothu , Shekhar Katukoori , Swapna Manuparthi "Deep Learning based techniques for Covid-19 diagnosis based on Various Pattern features detection from early stages of diseases”, Network Computation in Neural Systems.(Accepted May 2024, Indexing: SCIE, IF: 9, Publisher: Taylor & Francis).

Downloads

Published

30.11.2024

How to Cite

Pradeep Venuthurumilli. (2024). Integrating Ensemble Learning and Model-Agnostic Explain ability for Reliable Credit Card Fraud Prediction. International Journal of Intelligent Systems and Applications in Engineering, 12(23s), 4259–4266. Retrieved from https://www.ijisae.org/index.php/IJISAE/article/view/8248

Issue

Section

Research Article