Abstract
Spam messes up user’s inbox, consumes network resources and spread worms and viruses. Spam is flooding of unsolicited,unwanted e mail.Spam in blogs is called blog spam or comment spam.It is done by posting comments or flooding spams to the services such as blogs, forums,news,email archives and guestbooks. Blog spams generally appears on guestbooks or comment pages where spammers fill a comment box with spam words. In addition to wasting user’s time with unwanted comments, spam also consumes a lot of bandwith. In this paper,we propose a software tool to prevent such blog spams by using Bayesian Algorithm based technique. It is derived from Bayes’ Theorem.It gives an output which has a probability that any comment is spam, given that it has certain words in it. With using our past entries and a comment entry , this value is obtained and compared with a threshold value to find if it exceeds the threshold value or not. By using this cocept, we developed a software tool to block comment spam. The experimetal results shows that the bayesian based tool is working well. This paper has the major findings and their significance of blog spam filter.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Nagamalai, D., Dhinakaran, B.C., Lee, J.K.: An In-depth analysis of spam and spammers. IJSA 2(2), 9–22 (2008)
Mishne, G., Carmel, D., Lempel, R.: Blocking blog spam with language Model disagreement. In: AIRWEB 2005 (2005)
Thimason, A.: Blog spam- A Review. In: CEAS 2007 (2007)
Sahami, M., Dumais, S., Heckerman, D., Horvitz, E.: A Bayesian Approach to Filtering Junk E-mail. AAAI Technical Report WS-98-05 (1998)
Androutsopoulos, I., et al.: An experimental comparison of naive Bayesian and keyword-based anti-spam filtering with personal e-mail messages. ACM SIGIR CRDIR, 160–167 (2000)
Apache2triad, http://apache2triad.net/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Dhinakaran, B.C., Nagamalai, D., Lee, JK. (2009). Bayesian Approach Based Comment Spam Defending Tool. In: Park, J.H., Chen, HH., Atiquzzaman, M., Lee, C., Kim, Th., Yeo, SS. (eds) Advances in Information Security and Assurance. ISA 2009. Lecture Notes in Computer Science, vol 5576. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02617-1_59
Download citation
DOI: https://doi.org/10.1007/978-3-642-02617-1_59
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-02616-4
Online ISBN: 978-3-642-02617-1
eBook Packages: Computer ScienceComputer Science (R0)