Abstract
In recent years, e-mail technology is prospering, bringing efficiency to people from all over the world. It is not limited to time and space, making the transmission of information more convenient. However, the emergence of spam has also brought people a lot of trouble. Thus, spam filtering research is necessary. Traditional spam filtering is mainly based on black and white list technology. Over the past decade, with the development of machine learning, Bayesian classifier has also come into use. However, support for Chinese mail has always been unsatisfactory. This paper proposes a Chinese spam filtering method based on suffix tree, which solves the problem of Chinese character processing and compares it with traditional methods from the aspects of time and space complexity and accuracy.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Clark, J., Koprinska, I., Poon, J.: A neural network based approach to automated e-mail classification. In: Proceedings of the IEEE/WIC International Conference on Web Intelligence, WI 2003, pp. 702–705. IEEE (2003)
Jianlong, T., Ji, Z., Li, G.: Method of spam filtering based on general suffix tree model. Comput. Eng. 33(9), 100–102 (2007)
Kim, J., Chung, K., Choi, K.: Spam filtering with dynamically updated URL statistics. IEEE Secur. Priv. 5(4), 33–39 (2007)
Schneider, K.M.: A comparison of event models for Naive Bayes anti-spam e-mail filtering. In: Proceedings of the Tenth Conference on European Chapter of the Association for Computational Linguistics, vol. 1, pp. 307–314. Association for Computational Linguistics (2003)
Takemura, T., Ebara, H.: Spam mail reduces economic effects. In: 2008 Second International Conference on the Digital Society, pp. 20–24. IEEE (2008)
Pampapathi, R., Mirkin, B., Levene, M.: A suffix tree approach to anti-spam email filtering. Mach. Learn. 65(1), 309–338 (2006)
Firte, L., Lemnaru, C., Potolea, R.: Spam detection filter using KNN algorithm and resampling. In: 2010 IEEE International Conference on Intelligent Computer Communication and Processing (ICCP), pp. 27–33. IEEE (2010)
McCreight, E.M.A.: Space-economical suffix tree construction algorithm. J. ACM (JACM) 23(2), 262–272 (1976)
Ukkonen, E.: On-line construction of suffix trees. Algorithmica 14(3), 249–260 (1995)
http://spamassassin.apache.org/. Apache SpamAssasin
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG
About this paper
Cite this paper
Hu, R., Yang, Y. (2018). Spam Mail Filtering Method Based on Suffix Tree. In: Barolli, L., Zhang, M., Wang, X. (eds) Advances in Internetworking, Data & Web Technologies. EIDWT 2017. Lecture Notes on Data Engineering and Communications Technologies, vol 6. Springer, Cham. https://doi.org/10.1007/978-3-319-59463-7_43
Download citation
DOI: https://doi.org/10.1007/978-3-319-59463-7_43
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-59462-0
Online ISBN: 978-3-319-59463-7
eBook Packages: EngineeringEngineering (R0)