Skip to main content

Adaptive Ant Colony Decision Forest in Automatic Categorization of Emails

  • Conference paper
  • First Online:
Intelligent Information and Database Systems (ACIIDS 2015)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9011))

Included in the following conference series:

Abstract

In this article an approach to the automatic classification of email messages in mailboxes has been proposed. The aim of this paper is to devise methods to build decision tables from the collection of email messages on which it is possible to build Ant Colony Optimization-based ensemble classifiers, whose application allows to use the collection of emails without cleaning, at the same time improving the accuracy of the email folders classification. The proposed method has been tested by the selected algorithms on the Enron Email Dataset. The results confirm that the proposed solutions allows to improve the accuracy of classification of new emails to folders.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bekkerman, R., McCallum, A., Huang, G.: Automatic categorization of email into folders: Benchmark experiments on enron and sri corpora. Center for Intelligent Information Retrieval, Technical report IR (2004)

    Google Scholar 

  2. Boryczka, U., Kozak, J.: Ant colony decision trees – a new method for constructing decision trees based on ant colony optimization. In: Pan, J.-S., Chen, S.-M., Nguyen, N.T. (eds.) ICCCI 2010, Part I. LNCS, vol. 6421, pp. 373–382. Springer, Heidelberg (2010)

    Google Scholar 

  3. Boryczka, U., Probierz, B., Kozak, J.: An ant colony optimization algorithm for an automatic categorization of emails. In: Hwang, D., Jung, J.J., Nguyen, N.-T. (eds.) ICCCI 2014. LNCS, vol. 8733, pp. 583–592. Springer, Heidelberg (2014)

    Google Scholar 

  4. Boryczka, U., Kozak, J.: Ant colony decision forest meta-ensemble. In: Nguyen, N.-T., Hoang, K., Jȩdrzejowicz, P. (eds.) ICCCI 2012, Part II. LNCS, vol. 7654, pp. 473–482. Springer, Heidelberg (2012)

    Google Scholar 

  5. Boryczka, U., Kozak, J.: On-the-Go adaptability in the new ant colony decision forest approach. In: Nguyen, N.T., Attachoo, B., Trawiński, B., Somboonviwat, K. (eds.) ACIIDS 2014, Part II. LNCS, vol. 8398, pp. 157–166. Springer, Heidelberg (2014)

    Chapter  Google Scholar 

  6. Breiman, L.: Bagging predictors. Machine Learning 24(2), 123–140 (1996)

    MATH  MathSciNet  Google Scholar 

  7. Breiman, L.: Random forests. Machine Learning 45(1), 5–32 (2001)

    Article  MATH  Google Scholar 

  8. Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.J.: Classification and Regression Trees. Chapman and Hall, New York (1984)

    MATH  Google Scholar 

  9. Bühlmann, P., Hothorn, T.: Boosting algorithms: Regularization, prediction and model fitting. Statistical Science 22(4), 477–505 (2007)

    Article  MATH  MathSciNet  Google Scholar 

  10. Doerner, K.F., Merkle, D., Stützle, T.: Special issue on ant colony optimization. Swarm Intelligence 3(1), 1–2 (2009)

    Article  Google Scholar 

  11. Dorigo, M., Caro, G.D., Gambardella, L.: Ant algorithms for distributed discrete optimization. Artif. Life 5(2), 137–172 (1999)

    Article  Google Scholar 

  12. Dorigo, M., Stützle, T.: Ant Colony Optimization. MIT Press, Cambridge (2004)

    Book  MATH  Google Scholar 

  13. Efron, B.: Bootstrap methods: Another look at the jackknife. The Annals of Statistics 7(1), 1–26 (1979)

    Article  MATH  MathSciNet  Google Scholar 

  14. Freund, Y., Schapire, R.E.: Experiments with a new boosting algorithm. In: International Conference on Machine Learning, pp. 148–156 (1996)

    Google Scholar 

  15. Grasse, P.P.: Termitologia, vol. II. Masson, Paris (1984)

    Google Scholar 

  16. Kiritchenko, S., Matwin, S.: Email classification with co-training. University of Ottawa, Technical report (2002)

    Google Scholar 

  17. Lewis, D.D.: Representation and Learning in Information Retrieval. Ph.D. thesis, Department of Computer Science, University of Massachusetts (1992)

    Google Scholar 

  18. Rudin, C., Schapire, R.E.: Margin-based ranking and an equivalence between AdaBoost and RankBoost. J. Mach. Learn. Res. 10, 2193–2232 (2009)

    MATH  MathSciNet  Google Scholar 

  19. Schapire, R.E.: The strength of weak learnability. Machine Learning 5, 197–227 (1990)

    Google Scholar 

  20. Wang, M., He, Y., Jiang, M.: Text categorization of enron email corpus based on information bottleneck and maximal entropy (2010)

    Google Scholar 

  21. Witten, I.H., Frank, E., Hall, M.A.: Data Mining: Practical Machine Learning Tools and Techniques, 3rd edn. Morgan Kaufmann Publishers Inc. (2011)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Barbara Probierz .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Boryczka, U., Probierz, B., Kozak, J. (2015). Adaptive Ant Colony Decision Forest in Automatic Categorization of Emails. In: Nguyen, N., Trawiński, B., Kosala, R. (eds) Intelligent Information and Database Systems. ACIIDS 2015. Lecture Notes in Computer Science(), vol 9011. Springer, Cham. https://doi.org/10.1007/978-3-319-15702-3_44

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-15702-3_44

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-15701-6

  • Online ISBN: 978-3-319-15702-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics