Skip to main content

Parameter Optimization of Local-Concentration Model for Spam Detection by Using Fireworks Algorithm

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7928))

Abstract

This paper proposes a new framework that optimizes anti-spam model with heuristic swarm intelligence optimization algorithms, and this framework could integrate various classifiers and feature extraction methods. In this framework, a swarm intelligence algorithm is utilized to optimize a parameter vector, which is composed of parameters of a feature extraction method and parameters of a classifier, considering the spam detection problem as an optimization process which aims to achieve the lowest error rate. Also, 2 experimental strategies were designed to objectively reflect the performance of the framework. Then, experiments were conducted, using the Fireworks Algorithm (FWA) as the swarm intelligence algorithm, the Local Concentration (LC) approach as the feature extraction method, and SVM as the classifier. Experimental results demonstrate that the framework improves the performance on the corpora PU1, PU2, PU3 and PUA, while the computational efficiency is applicable in real world.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Commtouch,: Internet threats trend report-February 2013. Tech. rep. (2013)

    Google Scholar 

  2. Cost of spam, http://www.ferris.com/2009/01/28/cost-of-spam-is-flattening-our-2009-predictions/

  3. Drucker, H., Wu, D., Vapnik, V.N.: Support vector machines for spam categorization. IEEE Transactions on Neural Network 10, 1048–1054 (1999)

    Article  Google Scholar 

  4. Ruan, G., Tan, Y.: Intelligent detection approaches for spam. In: Proceedings of International Conference on Natural Computation, pp. 1–7 (2007)

    Google Scholar 

  5. Bickel, S., Scheffer, T.: Dirichlet-enhanced spam filtering based on biased samples. Adv. Neural Inf. Process. Syst. 19, 161–168 (2007)

    Google Scholar 

  6. Kanaris, I., Kanaris, K., Houvardas, I., Stamatatos, E.: Words versus character N-grams for anti-spam filtering. Int. J. Artif. Intell. T. 16(6), 1047–1067 (2007)

    Article  Google Scholar 

  7. Zhu, Y.C., Tan, Y.: A local-concentration-based feature extraction approach for spam filtering. IEEE Transactions on Information Forensics and Security 6(2), 1–12 (2011)

    Article  Google Scholar 

  8. Information gain, http://en.wikipedia.org/wiki/Information_gain

  9. Koprinska, I., Poon, J., Clark, J., Chan, J.: Learning to classify e-mail. Inform. Sci. 177, 2167–2187 (2007)

    Article  Google Scholar 

  10. Tan, Y., Zhu, Y.: Fireworks algorithm for optimization. In: Tan, Y., Shi, Y., Tan, K.C. (eds.) ICSI 2010, Part I. LNCS, vol. 6145, pp. 355–364. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  11. Dasgupta, D.: Advances in artificial immune systems. IEEE Computational Intelligence Magazine, 40–49 (2006)

    Google Scholar 

  12. Guzella, T.S., Caminhas, M.: A review of machine learning approaches to spam filtering. Expert Syst. Appl. 36, 10206–10222 (2009)

    Article  Google Scholar 

  13. Blanzieri, E., Bryl, A.: A Survey of Learning-Based Techniques of e-mail Spam Filtering. Tech. Rep. 1 DIT-06-065 (2008)

    Google Scholar 

  14. Timmis, J.: Artificial immune systems—today and tomorrow. Nat. Comput., 1–18 (2007)

    Google Scholar 

  15. Oda, T., White, T.: Developing an immunity to spam. In: Cantú-Paz, E., et al. (eds.) GECCO 2003. LNCS, vol. 2723, pp. 231–242. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  16. Tan, Y., Deng, C., Ruan, G.: Concentration based feature construction approach for spam detection. In: Proceedings of International Joint Conference on Neural Networks, pp. 3088–3093 (2009)

    Google Scholar 

  17. Ruan, G., Tan, Y.: Intelligent detection approaches for spam. In: Proceedings of International Conference on Natural Computation, pp. 1–7 (2007)

    Google Scholar 

  18. Tan, Y.: Multiple-point bit mutation method of detector generation for SNSD model . In: Wang, J., Yi, Z., Żurada, J.M., Lu, B.-L., Yin, H. (eds.) ISNN 2006. LNCS, vol. 3973, pp. 340–345. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  19. Tan, Y., Xiao, Z.: Clonal particle swarm optimization and its applications. In: Proceedings of IEEE Congress on Evolutionary Computation, pp. 2303–2309 (2007)

    Google Scholar 

  20. Tan, Y., Wang, J.: A support vector network with hybrid kernel and minimal vapnik-chervonenkis dimension. IEEE Trans. Knowl. Data Eng. 26, 385–395 (2004)

    Google Scholar 

  21. Stuart, I., Cha, S.-H., Tappert, C.C.: A neural network classifier for junk E-mail. In: Marinai, S., Dengel, A.R. (eds.) DAS 2004. LNCS, vol. 3163, pp. 442–450. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  22. Zhu, Y., Tan, Y.: A danger theory inspired learning model and its application to spam detection. In: Tan, Y., Shi, Y., Chai, Y., Wang, G. (eds.) ICSI 2011, Part I. LNCS, vol. 6728, pp. 382–389. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  23. Ruan, G., Tan, Y.: A three-layer back-propagation neural network for spam detection using artificial immune concentration. Soft Comput. 14, 139–150 (2010)

    Article  Google Scholar 

  24. Zhu, Y., Tan, Y.: Extracting discriminative information from E-mail for spam detection inspired by immune system. In: Proceedings of IEEE Congress on Evolutionary Computation, pp. 2491–2497 (2010)

    Google Scholar 

  25. Wu, C.-H.: Behavior-based spam detection using a hybrid method of rule-based techniques and neural networks. Expert Syst. Appl. 36, 4321–4330 (2009)

    Article  Google Scholar 

  26. Siefkes, C., Assis, F., Chhabra, S., Yerazunis, W.S.: Combining winnow and orthogonal sparse bigrams for incremental spam filtering. In: Boulicaut, J.-F., Esposito, F., Giannotti, F., Pedreschi, D. (eds.) PKDD 2004. LNCS (LNAI), vol. 3202, pp. 410–421. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

He, W., Mi, G., Tan, Y. (2013). Parameter Optimization of Local-Concentration Model for Spam Detection by Using Fireworks Algorithm. In: Tan, Y., Shi, Y., Mo, H. (eds) Advances in Swarm Intelligence. ICSI 2013. Lecture Notes in Computer Science, vol 7928. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38703-6_52

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-38703-6_52

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-38702-9

  • Online ISBN: 978-3-642-38703-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics