Parameter Optimization of Local-Concentration Model for Spam Detection by Using Fireworks Algorithm

He, Wenrui; Mi, Guyue; Tan, Ying

doi:10.1007/978-3-642-38703-6_52

Parameter Optimization of Local-Concentration Model for Spam Detection by Using Fireworks Algorithm

Wenrui He^19,20,
Guyue Mi^19,20 &
Ying Tan^19,20

Conference paper

2880 Accesses
44 Citations
3 Altmetric

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7928))

Abstract

This paper proposes a new framework that optimizes anti-spam model with heuristic swarm intelligence optimization algorithms, and this framework could integrate various classifiers and feature extraction methods. In this framework, a swarm intelligence algorithm is utilized to optimize a parameter vector, which is composed of parameters of a feature extraction method and parameters of a classifier, considering the spam detection problem as an optimization process which aims to achieve the lowest error rate. Also, 2 experimental strategies were designed to objectively reflect the performance of the framework. Then, experiments were conducted, using the Fireworks Algorithm (FWA) as the swarm intelligence algorithm, the Local Concentration (LC) approach as the feature extraction method, and SVM as the classifier. Experimental results demonstrate that the framework improves the performance on the corpora PU1, PU2, PU3 and PUA, while the computational efficiency is applicable in real world.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Commtouch,: Internet threats trend report-February 2013. Tech. rep. (2013)
Google Scholar
Cost of spam, http://www.ferris.com/2009/01/28/cost-of-spam-is-flattening-our-2009-predictions/
Drucker, H., Wu, D., Vapnik, V.N.: Support vector machines for spam categorization. IEEE Transactions on Neural Network 10, 1048–1054 (1999)
Article Google Scholar
Ruan, G., Tan, Y.: Intelligent detection approaches for spam. In: Proceedings of International Conference on Natural Computation, pp. 1–7 (2007)
Google Scholar
Bickel, S., Scheffer, T.: Dirichlet-enhanced spam filtering based on biased samples. Adv. Neural Inf. Process. Syst. 19, 161–168 (2007)
Google Scholar
Kanaris, I., Kanaris, K., Houvardas, I., Stamatatos, E.: Words versus character N-grams for anti-spam filtering. Int. J. Artif. Intell. T. 16(6), 1047–1067 (2007)
Article Google Scholar
Zhu, Y.C., Tan, Y.: A local-concentration-based feature extraction approach for spam filtering. IEEE Transactions on Information Forensics and Security 6(2), 1–12 (2011)
Article Google Scholar
Information gain, http://en.wikipedia.org/wiki/Information_gain
Koprinska, I., Poon, J., Clark, J., Chan, J.: Learning to classify e-mail. Inform. Sci. 177, 2167–2187 (2007)
Article Google Scholar
Tan, Y., Zhu, Y.: Fireworks algorithm for optimization. In: Tan, Y., Shi, Y., Tan, K.C. (eds.) ICSI 2010, Part I. LNCS, vol. 6145, pp. 355–364. Springer, Heidelberg (2010)
Chapter Google Scholar
Dasgupta, D.: Advances in artificial immune systems. IEEE Computational Intelligence Magazine, 40–49 (2006)
Google Scholar
Guzella, T.S., Caminhas, M.: A review of machine learning approaches to spam filtering. Expert Syst. Appl. 36, 10206–10222 (2009)
Article Google Scholar
Blanzieri, E., Bryl, A.: A Survey of Learning-Based Techniques of e-mail Spam Filtering. Tech. Rep. 1 DIT-06-065 (2008)
Google Scholar
Timmis, J.: Artificial immune systems—today and tomorrow. Nat. Comput., 1–18 (2007)
Google Scholar
Oda, T., White, T.: Developing an immunity to spam. In: Cantú-Paz, E., et al. (eds.) GECCO 2003. LNCS, vol. 2723, pp. 231–242. Springer, Heidelberg (2003)
Chapter Google Scholar
Tan, Y., Deng, C., Ruan, G.: Concentration based feature construction approach for spam detection. In: Proceedings of International Joint Conference on Neural Networks, pp. 3088–3093 (2009)
Google Scholar
Ruan, G., Tan, Y.: Intelligent detection approaches for spam. In: Proceedings of International Conference on Natural Computation, pp. 1–7 (2007)
Google Scholar
Tan, Y.: Multiple-point bit mutation method of detector generation for SNSD model . In: Wang, J., Yi, Z., Żurada, J.M., Lu, B.-L., Yin, H. (eds.) ISNN 2006. LNCS, vol. 3973, pp. 340–345. Springer, Heidelberg (2006)
Chapter Google Scholar
Tan, Y., Xiao, Z.: Clonal particle swarm optimization and its applications. In: Proceedings of IEEE Congress on Evolutionary Computation, pp. 2303–2309 (2007)
Google Scholar
Tan, Y., Wang, J.: A support vector network with hybrid kernel and minimal vapnik-chervonenkis dimension. IEEE Trans. Knowl. Data Eng. 26, 385–395 (2004)
Google Scholar
Stuart, I., Cha, S.-H., Tappert, C.C.: A neural network classifier for junk E-mail. In: Marinai, S., Dengel, A.R. (eds.) DAS 2004. LNCS, vol. 3163, pp. 442–450. Springer, Heidelberg (2004)
Chapter Google Scholar
Zhu, Y., Tan, Y.: A danger theory inspired learning model and its application to spam detection. In: Tan, Y., Shi, Y., Chai, Y., Wang, G. (eds.) ICSI 2011, Part I. LNCS, vol. 6728, pp. 382–389. Springer, Heidelberg (2011)
Chapter Google Scholar
Ruan, G., Tan, Y.: A three-layer back-propagation neural network for spam detection using artificial immune concentration. Soft Comput. 14, 139–150 (2010)
Article Google Scholar
Zhu, Y., Tan, Y.: Extracting discriminative information from E-mail for spam detection inspired by immune system. In: Proceedings of IEEE Congress on Evolutionary Computation, pp. 2491–2497 (2010)
Google Scholar
Wu, C.-H.: Behavior-based spam detection using a hybrid method of rule-based techniques and neural networks. Expert Syst. Appl. 36, 4321–4330 (2009)
Article Google Scholar
Siefkes, C., Assis, F., Chhabra, S., Yerazunis, W.S.: Combining winnow and orthogonal sparse bigrams for incremental spam filtering. In: Boulicaut, J.-F., Esposito, F., Giannotti, F., Pedreschi, D. (eds.) PKDD 2004. LNCS (LNAI), vol. 3202, pp. 410–421. Springer, Heidelberg (2004)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Key Laboratory of Machine Perception, Ministry of Education, China
Wenrui He, Guyue Mi & Ying Tan
Department of Machine Intelligence, School of Electronics Engineering and Computer Science, Peking University, Beijing, 100871, P.R. China
Wenrui He, Guyue Mi & Ying Tan

Authors

Wenrui He
View author publications
You can also search for this author in PubMed Google Scholar
Guyue Mi
View author publications
You can also search for this author in PubMed Google Scholar
Ying Tan
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Key Laboratory of Machine Perception (MOE), School of lectronics Engineering and Computer Science, Department of Machine Intelligence, Peking University, 100871, Beijing, China
Ying Tan
Department of Electrical and Electronic Engineering, Xi’an Jiaotong-Liverpool University, 215123, Suzhou, China
Yuhui Shi
Automation College, Harbin Engineering University, 150001, Harbin, China
Hongwei Mo

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

He, W., Mi, G., Tan, Y. (2013). Parameter Optimization of Local-Concentration Model for Spam Detection by Using Fireworks Algorithm. In: Tan, Y., Shi, Y., Mo, H. (eds) Advances in Swarm Intelligence. ICSI 2013. Lecture Notes in Computer Science, vol 7928. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38703-6_52

Download citation

DOI: https://doi.org/10.1007/978-3-642-38703-6_52
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-38702-9
Online ISBN: 978-3-642-38703-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics