Abstract
Spam mail classification and filtering is a commonly investigated problem, yet there has been little research into the application of nearest neighbour classifiers in this field. This paper examines the possibility of using a nearest neighbour algorithm for simple, word based spam mail classification. This approach is compared to a neural network, and decision-tree along with results published in another conference paper on the subject.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Massey, B., Thomure, M., Budrevich, R., Long, S.: Learning spam: Simple techniques for freely-available software. Usenix 2003, Freenix Track (2003)
Mount, D., Arya, S.: Ann: A library for approximate nearest neighbor searching (1997)
Arya, S., Mount, D.M., Netanyahu, N.S., Silverman, R., Wu, A.Y.: An optimal algorithm for approximate nearest neighbor searching in fixed dimensions. Journal of the ACM 45, 891–923 (1998)
Justin, M.: Spamassassin public corpus (2003) (accessed 27/02/04)
Sakkis, G., Androutsopoulos, I., Paliouras, G., Karkaletsis, V., Spyropoulos, C., Stamatopoulos, P.: Ling-spam - from a memory-based approach to anti-spam filtering for mailing lists. Information Retrieval 6, 49–73 (2003)
Group, P.: The portland spam automatic mail-filtering project (2003) (accessed 02/03/04)
Jones, R.W.M.: Annexia great spam archive (2003) (accessed 27/02/04)
Quinlan, J.R.: C4.5, Programs For Machine Learning. Morgan Kaufmann, California (1993)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Trudgian, D.C. (2004). Spam Classification Using Nearest Neighbour Techniques. In: Yang, Z.R., Yin, H., Everson, R.M. (eds) Intelligent Data Engineering and Automated Learning – IDEAL 2004. IDEAL 2004. Lecture Notes in Computer Science, vol 3177. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-28651-6_85
Download citation
DOI: https://doi.org/10.1007/978-3-540-28651-6_85
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22881-3
Online ISBN: 978-3-540-28651-6
eBook Packages: Springer Book Archive