Skip to main content

Classification of Customer Call Data in the Presence of Concept Drift and Noise

  • Conference paper
  • First Online:
Soft-Ware 2002: Computing in an Imperfect World (Soft-Ware 2002)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2311))

Abstract

Many of today’s real world domains require online classification tasks in very demanding situations. This work presents the results of applying the CD3 algorithm to telecommunications call data. CD3 enables the detection of concept drift in the presence of noise within real time data. The application detects the drift using a TSAR methodology and applies a purging mechanism as a corrective action. The main focus of this work is to identify from customer files and call records if the profile of customers registering for a ‘friends and family’ service is changing over a period of time. We will begin with a review of the CD3 application and the presentation of the data. This will conclude with experimental results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Black, M., Hickey, R.J.: Maintaining the Performance of a Learned Classifier under Concept Drift. Intelligent Data Analysis 3 (1999) 453–474

    Article  Google Scholar 

  2. Hickey, R.J., Black, M.,: Refined Time Stamps for Concept Drift Detection during Mining for Classification Rules. Spatio-Temporal Data Mining-TSDM2000, published in LNAI 2007, Springer-Verlag.

    Google Scholar 

  3. Hickey, R.J., 1996, Noise Modelling and Evaluating Learning from Examples. Artificial Intelligence, 82, pp157–179.

    Google Scholar 

  4. Kelly, M.G., Hand, D.J., Adams, N.M.: The Impact of Changing Populations on Classifier Performance. In: Chaudhuri, S., Madigan, D. (eds.): Proceedings of the Fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. Association for Computing Machinery, New York (1999) 367–371

    Google Scholar 

  5. Klenner, M., Hahn, U.: (1994). Concept Versioning: A Methodology for Tracking Evolutionary Concept Drift in Dynamic Concept Systems. In: Proceedings of Eleventh European Conference on Artificial Intelligence, Wiley, Chichester, England, 473–477

    Google Scholar 

  6. Schlimmer, J.C., Granger, R.H.: Incremental Learning from Noisy Data. Machine Learning 1 (1986) 317–354

    Google Scholar 

  7. Hembold, D.P., Long, P.M.: Tracking Drifting Concepts by Minimising Disagreements. Machine Learning 14 (1994) 27–45

    Google Scholar 

  8. Hulten, G., Spencer, L., Domingos, P., 2001, Mining Time-Changing Data Streams, Proceedings of the Seventh International Conference on Knowledge Discovery and Data Mining.

    Google Scholar 

  9. Widmer, G.: Tracking Changes through Meta-Learning. Machine Learning 27 (1997) 259–286

    Article  Google Scholar 

  10. Widmer, G., Kubat, M.: Learning in the Presence of Concept Drift and Hidden Contexts. Machine Learning 23 (1996) 69–101

    Google Scholar 

  11. Chakrabarti, S., Sarawagi, S., Dom, B.: Mining Surprising Patterns using Temporal Description Length. In: Gupta, A., Shmueli, O., Widom, J. (eds.): Proceedings of the Twenty-Fourth International Conference on Very Large databases. Morgan Kaufmann, San Mateo, California (1998) 606–61

    Google Scholar 

  12. Chen, X., Petrounias, I.: Mining Temporal Features in Association Rules. In: Zytkow, J., Rauch, J. (eds,): Proceedings. of the Third European Conference on Principles and Practice of Knowledge Discovery in Databases. Lecture Notes in Artificial Intelligence, Vol. 1704. Springer-Verlag, Berlin Heidelberg New York (1999) 295–300

    Google Scholar 

  13. Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, San Mateo, California (1993)

    Google Scholar 

  14. Utgoff, P.E., 1997, Decision Tree Induction Based on Efficient Tree Restructuring, Machine Learning 29(1): 5–44.

    Google Scholar 

  15. Clark, P. and Boswell, R. 1991. Rule Induction with CN2: Some Recent Improvements, in Proceedings of the European Workshop on Learning (EWSL-91). 151–163. Berlin: Springer-Verlag.

    Google Scholar 

  16. Bratko, I. 1990. Prolog programming for artificial intelligence. Wokingham: Addison-Wesley.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Black, M., Hickey, R. (2002). Classification of Customer Call Data in the Presence of Concept Drift and Noise. In: Bustard, D., Liu, W., Sterritt, R. (eds) Soft-Ware 2002: Computing in an Imperfect World. Soft-Ware 2002. Lecture Notes in Computer Science, vol 2311. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46019-5_6

Download citation

  • DOI: https://doi.org/10.1007/3-540-46019-5_6

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-43481-8

  • Online ISBN: 978-3-540-46019-0

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics