An Ensemble Learning Approach to Enhance Customer Churn Prediction in Telecom Industry
Keywords:
Ensemble learning, customer churn prediction, telecom industry, classification algorithms, feature engineering, performance evaluationAbstract
The phenomenon of customer churn, which occurs when a customer leaves a service provider for another, is a major challenge for businesses in the telecommunications industry. Accurate predictions of this issue can help them improve their profitability and reduce their customer attrition. This study aims to use the data collected by Orange to improve the prediction accuracy of this issue. The study begins by reviewing the Telecom Churn dataset, which contains details about the customers such as their demographics and usage patterns. It also has a label that indicates whether or not the customer has churned. Through exploratory data gathering, we can identify correlations, patterns, and possible predictive elements that can be utilized in the prediction of churn. The goal of this study is to develop an ensemble learning framework that combines multiple classifiers. The framework is composed of various classification algorithms- Stochastic Gradient Boost(SGD), Random Forests (RF), Gradient Boosting (GB) and AdaBoost. We also test the performance of these through cross-validation techniques. The goal of this research is to improve the accuracy of its predictions by capturing more accurate information about the behavior of customers. The evaluation of the ensembles involves using different performance metrics, such as accuracy, recall, and F1 score. According to our experiments, the ensemble learning GB outperforms the other classifiers when it comes to predicting the likelihood of a customer leaving a service provider. By incorporating the base classifiers' predictions, the ensembles were able to achieve a robust and accurate prediction. This method can help businesses identify potential churners and implement effective retention strategies. The findings of this study demonstrate the utility of ensembles in improving the accuracy of churn prediction models for telecommunications companies. These findings can be utilized to develop more reliable and accurate churn prediction models, which can help improve the customer retention rate and enhance the business performance of service providers.
Downloads
References
A. K. Ahmad, A. Jafar, and K. Aljoumaa, “Customer churn prediction in telecom using machine learning in big data platform,” J. Big Data, vol. 6, no. 1, 2019, doi: 10.1186/s40537-019-0191-6.
A. A. Q. Ahmed and D. Maheswari, “Churn prediction on huge telecom data using hybrid firefly based classification Churn prediction on huge telecom data,” Egypt. Informatics J., vol. 18, no. 3, pp. 215–220, 2017, doi: 10.1016/j.eij.2017.02.002.
J. Vijaya and E. Sivasankar, “An efficient system for customer churn prediction through particle swarm optimization based feature selection model with simulated annealing,” Cluster Comput., vol. 22, no. s5, pp. 10757–10768, 2019, doi: 10.1007/s10586-017-1172-1.
H. Jain, A. Khunteta, and S. Srivastava, “Churn Prediction in Telecommunication using Logistic Regression and Logit Boost,” Procedia Comput. Sci., vol. 167, no. 2019, pp. 101–112, 2020, doi: 10.1016/j.procs.2020.03.187.
D. Das Adhikary and D. Gupta, “Applying over 100 classifiers for churn prediction in telecom companies,” Multimed. Tools Appl., vol. 80, no. 28–29, pp. 35123–35144, 2021, doi: 10.1007/s11042-020-09658-z.
A. Idris, M. Rizwan, and A. Khan, “Churn prediction in telecom using Random Forest and PSO based data balancing in combination with various feature selection strategies,” Comput. Electr. Eng., vol. 38, no. 6, pp. 1808–1819, 2012, doi: 10.1016/j.compeleceng.2012.09.001.
L. Zhao, Q. Gao, X. J. Dong, A. Dong, and X. Dong, “K- local maximum margin feature extraction algorithm for churn prediction in telecom,” Cluster Comput., vol. 20, no. 2, pp. 1401–1409, 2017, doi: 10.1007/s10586-017-0843-2.
D. L. García, À. Nebot, and A. Vellido, “Intelligent data analysis approaches to churn as a business problem: a survey,” Knowl. Inf. Syst., vol. 51, no. 3, pp. 719–774, 2017, doi: 10.1007/s10115-016-0995-z.
A. Idris, A. Iftikhar, and Z. ur Rehman, “Intelligent churn prediction for telecom using GP-AdaBoost learning and PSO undersampling,” Cluster Comput., vol. 22, no. s3, pp. 7241–7255, 2019, doi: 10.1007/s10586-017-1154-3.
K. Lu, X. Zhao, and B. Wang, “A Study on Mobile Customer Churn Based on Learning from Soft Label Proportions,” Procedia Comput. Sci., vol. 162, no. Itqm 2019, pp. 413–420, 2019, doi: 10.1016/j.procs.2019.12.005.
W. N. Wassouf, R. Alkhatib, K. Salloum, and S. Balloul, “Predictive analytics using big data for increased customer loyalty: Syriatel Telecom Company case study,” J. Big Data, vol. 7, no. 1, 2020, doi: 10.1186/s40537-020-00290-0.
P. Sulikowski and T. Zdziebko, “Churn factors identification from real-world data in the telecommunications industry: Case study,” Procedia Comput. Sci., vol. 192, pp. 4800–4809, 2021, doi: 10.1016/j.procs.2021.09.258.
T. W. Cenggoro, R. A. Wirastari, E. Rudianto, M. I. Mohadi, D. Ratj, and B. Pardamean, “Deep Learning as a Vector Embedding Model for Customer Churn,” Procedia Comput. Sci., vol. 179, no. 2019, pp. 624–631, 2021, doi: 10.1016/j.procs.2021.01.048.
Y. Liu, J. Fan, J. Zhang, X. Yin, and Z. Song, “Research on telecom customer churn prediction based on ensemble learning,” J. Intell. Inf. Syst., no. 0123456789, 2022, doi: 10.1007/s10844-022-00739-z.
M. T. Quasim, A. Sulaiman, A. Shaikh, and M. Younus, “Blockchain in churn prediction based telecommunication system on climatic weather application,” Sustain. Comput. Informatics Syst., vol. 35, no. December 2021, p. 100705, 2022, doi: 10.1016/j.suscom.2022.100705.
S. M. Shrestha and A. Shakya, “A Customer Churn Prediction Model using XGBoost for the Telecommunication Industry in Nepal,” Procedia Comput. Sci., vol. 215, pp. 652–661, 2022, doi: 10.1016/j.procs.2022.12.067.
A. Amin, A. Adnan, and S. Anwar, “An adaptive learning approach for customer churn prediction in the telecommunication industry using evolutionary computation and Naïve Bayes,” Appl. Soft Comput., vol. 137, p. 110103, 2023, doi: 10.1016/j.asoc.2023.110103.
Orange, “Telecom Churn Dataset | Kaggle.” [Online]. Available: https://www.kaggle.com/mnassrib/telecom-churn-datasets.
Purnima, T., & Rao, C. K. . (2023). CROD: Context Aware Role based Offensive Detection using NLP/ DL Approaches. International Journal on Recent and Innovation Trends in Computing and Communication, 11(1), 01–11. https://doi.org/10.17762/ijritcc.v11i1.5981
Brown, R., Brown, J., Rodriguez, C., Garcia, J., & Herrera, J. Predictive Analytics for Effective Resource Allocation in Engineering Education. Kuwait Journal of Machine Learning, 1(1). Retrieved from http://kuwaitjournals.com/index.php/kjml/article/view/91
Downloads
Published
How to Cite
Issue
Section
License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
All papers should be submitted electronically. All submitted manuscripts must be original work that is not under submission at another journal or under consideration for publication in another form, such as a monograph or chapter of a book. Authors of submitted papers are obligated not to submit their paper for publication elsewhere until an editorial decision is rendered on their submission. Further, authors of accepted papers are prohibited from publishing the results in other publications that appear before the paper is published in the Journal unless they receive approval for doing so from the Editor-In-Chief.
IJISAE open access articles are licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. This license lets the audience to give appropriate credit, provide a link to the license, and indicate if changes were made and if they remix, transform, or build upon the material, they must distribute contributions under the same license as the original.