Processing math: 0%
Improving prediction of customer behavior in nonstationary environments | IEEE Conference Publication | IEEE Xplore

Improving prediction of customer behavior in nonstationary environments


Abstract:

Customer churn, switching from one service provider to another, costs the wireless telecommunications industry $4 billion each year in North America and Europe. To proact...Show More

Abstract:

Customer churn, switching from one service provider to another, costs the wireless telecommunications industry $4 billion each year in North America and Europe. To proactively build lasting relationships with customers, it is thus crucial to predict customer behavior. Machine learning has been applied to churn prediction, using historical data such as usage, billing, customer service, and demographics. However, because customer behavior is often nonstationary, training a model based on data extracted from a window of time in the past yields poor performance on the present. We propose two distinct approaches, using more historical data or new, unlabeled data, to improve the results for this real-world, large-scale, nonstationary problem. A new ensemble classification method, with combination weights learned from both labeled and unlabeled data, is also proposed, and it outperforms bagging and mixture of experts.
Date of Conference: 15-19 July 2001
Date Added to IEEE Xplore: 07 August 2002
Print ISBN:0-7803-7044-9
Print ISSN: 1098-7576
Conference Location: Washington, DC, USA
References is not available for this document.

1 Introduction

Customer churn, switching from one service provider to another, destroys profits and decreases shareholder value. In the wireless telecommunications industry, the annual churn rate reaches 25% in Europe and 30% in the United States [1]. It costs around five times as much to sign on a new subscriber as to retain an existing one. In most developed markets, acquiring a new customer costs an average of 300 to 400. It is estimated that, in the mature markets of North America and Europe, churn costs wireless service providers a combined total of more than $4 billion each year [1]. It is thus crucial to predict customer behavior, e.g. churn, in advance. Accurate prediction may allow one to forestall churn by proactively building lasting relationships with customers.

Select All
1.
"Battling Churn to Increase Shareholder Value: Wireless Challenge for the Future", Anderson Consulting Research Report, 2000.
2.
C. M. Bishop, "Neural Networks for Pattern Recognition" in , Oxford University Press, 1998.
3.
L. Breiman, "Bagging Predictors", Machine Learning, vol. 24, pp. 123-140, 1996.
4.
A. P. Dempster, N. M. Laird and D. B. Rubin, "Maximum-likelihood from Incomplete Data via the EM algorithm", Journal of the Roy. Stat. Soc. Ser. B, vol. 39, pp. 1-38, 1977.
5.
R. A. Jacobs, M. I. Jordan, S. J. Nowlan and G. E. Hinton, "Adaptive Mixture of Local Experts", Neural Computation, vol. 3, pp. 79-87, 1991.
6.
D. J. Miller and H. S. Uyar, "Combined Learning and Use for a Mixture Model Equivalent to the RBF Classifier", Neural Computation, vol. 10, pp. 281-293, 1998.
7.
D. J. Miller and H. S. Uyar, "A Mixture of Experts Classifier with Learning Based on Both Labelled and Unlabelled Data", Advances in Neural Information Processing Systems, vol. 9, pp. 571-577, 1996.
8.
J. Moody and C. J. Darken, "Fast Learning in Locally-tuned Processing Units", Neural Computation, vol. 1, pp. 281-294, 1989.
9.
M. C. Mozer, R. Wolniewicz, D. B. Grimes, E. Johnson and H. Kaushansky, "Predicting Subscriber Dissatisfaction and Improving Retention in the Wireless Telecommunications Industry", IEEE Transactions on Neural Networks, vol. 11, pp. 690-696, 2000.
10.
F. Provost, T. Fawcett and R. Kohavi, "The Case Against Accuracy Estimation for Comparing Induction Algorithms", Proceedings of the 5th International Conference on Machine Learning, 1998.
11.
B. D. Ripley, "Pattern Recognition and Neural Networks" in , Cambridge University Press, 1996.
12.
B. M. Shahshahani and D. A. Landgrebe, "The Effect of Unlabeled Samples in Reducing the Small Sample Size Problem and Mitigating the Hughes Phenomenon", IEEE Transactions on Geoscience and Remote Sensing, vol. 32, pp. 1087-1095, 1994.
13.
J. A. Swets and R. M. Pickett, "Evaluation of Diagnostic Systems: Methods from Signal Detection Theory" in , New York:Academic Press, 1982.
Contact IEEE to Subscribe

References

References is not available for this document.