A Customer Churn Prediction Model using XGBoost for the Telecommunication Industry in Nepal
S. Shrestha, and A. Shakya. Procedia Computer Science. Proceedings of the 4th International Conference on Innovative Data Communication Technology and Application (ICIDCA 2022), Coimbatore, India, 215, page 652--661. Elsevier, (2022)
DOI: 10.1016/j.procs.2022.12.067
Abstract
Telecommunication industry is one of the major sectors which is at higher risk of losing revenue due to customer churn. Thus, when churn management is done effectively, it provides a competitive advantage to the telecom company over its competitors by increasing customer retention rate. Although many machine learning algorithms exist today, few algorithms are effective to consider the imbalanced nature of the telecommunication's dataset. The real telecommunication data also varies differently from the publicly available dataset and hence the effectiveness of machine learning algorithms may vary differently. Therefore, this research has tried to bridge this gap by undertaking native dataset of one of the major Telecommunications Industry of Nepal and applying XGBoost on this dataset which contains 52332 records of customers, out of which 46204 are non-churned and 6128 are churned customers. The accuracy and f1-score obtained on the native dataset are 97% and 88% respectively. This research work has also undertaken publicly available dataset that contains 3333 subscribers for the purpose of comparison with previous researches and obtained an improved accuracy and f1-score of 96.25% and 86.34% respectively.
Procedia Computer Science. Proceedings of the 4th International Conference on Innovative Data Communication Technology and Application (ICIDCA 2022), Coimbatore, India
year
2022
pages
652--661
publisher
Elsevier
volume
215
venue
Coimbatore, India
eventdate
3-4, November 2022
eventtitle
4th International Conference on Innovative Data Communication Technologies and Application (ICIDCA 2022)
%0 Conference Paper
%1 shrestha2022customer
%A Shrestha, Sagar Maan
%A Shakya, Aman
%B Procedia Computer Science. Proceedings of the 4th International Conference on Innovative Data Communication Technology and Application (ICIDCA 2022), Coimbatore, India
%D 2022
%I Elsevier
%K imported myown
%P 652--661
%R 10.1016/j.procs.2022.12.067
%T A Customer Churn Prediction Model using XGBoost for the Telecommunication Industry in Nepal
%U https://doi.org/10.1016/j.procs.2022.12.067
%V 215
%X Telecommunication industry is one of the major sectors which is at higher risk of losing revenue due to customer churn. Thus, when churn management is done effectively, it provides a competitive advantage to the telecom company over its competitors by increasing customer retention rate. Although many machine learning algorithms exist today, few algorithms are effective to consider the imbalanced nature of the telecommunication's dataset. The real telecommunication data also varies differently from the publicly available dataset and hence the effectiveness of machine learning algorithms may vary differently. Therefore, this research has tried to bridge this gap by undertaking native dataset of one of the major Telecommunications Industry of Nepal and applying XGBoost on this dataset which contains 52332 records of customers, out of which 46204 are non-churned and 6128 are churned customers. The accuracy and f1-score obtained on the native dataset are 97% and 88% respectively. This research work has also undertaken publicly available dataset that contains 3333 subscribers for the purpose of comparison with previous researches and obtained an improved accuracy and f1-score of 96.25% and 86.34% respectively.
@inproceedings{shrestha2022customer,
abstract = {Telecommunication industry is one of the major sectors which is at higher risk of losing revenue due to customer churn. Thus, when churn management is done effectively, it provides a competitive advantage to the telecom company over its competitors by increasing customer retention rate. Although many machine learning algorithms exist today, few algorithms are effective to consider the imbalanced nature of the telecommunication's dataset. The real telecommunication data also varies differently from the publicly available dataset and hence the effectiveness of machine learning algorithms may vary differently. Therefore, this research has tried to bridge this gap by undertaking native dataset of one of the major Telecommunications Industry of Nepal and applying XGBoost on this dataset which contains 52332 records of customers, out of which 46204 are non-churned and 6128 are churned customers. The accuracy and f1-score obtained on the native dataset are 97% and 88% respectively. This research work has also undertaken publicly available dataset that contains 3333 subscribers for the purpose of comparison with previous researches and obtained an improved accuracy and f1-score of 96.25% and 86.34% respectively.},
added-at = {2023-03-07T12:13:40.000+0100},
author = {Shrestha, Sagar Maan and Shakya, Aman},
biburl = {https://www.bibsonomy.org/bibtex/27302ad460668466f404ef021e45abaab/amanshakya},
booktitle = {Procedia Computer Science. Proceedings of the 4th International Conference on Innovative Data Communication Technology and Application (ICIDCA 2022), Coimbatore, India},
doi = {10.1016/j.procs.2022.12.067},
eventdate = {3-4, November 2022},
eventtitle = {4th International Conference on Innovative Data Communication Technologies and Application (ICIDCA 2022)},
interhash = {4867b35902518e31dc2b8b71bed99ef1},
intrahash = {7302ad460668466f404ef021e45abaab},
issn = {1877-0509},
keywords = {imported myown},
pages = {652--661},
publisher = {Elsevier},
timestamp = {2023-04-06T07:48:47.000+0200},
title = {A Customer Churn Prediction Model using XGBoost for the Telecommunication Industry in Nepal},
url = {https://doi.org/10.1016/j.procs.2022.12.067},
venue = {Coimbatore, India},
volume = 215,
year = 2022
}