HoneySpam 2.0: Profiling Web Spambot Behaviour

Hayati, Pedram; Chai, Kevin; Potdar, Vidyasagar; Talevski, Alex

doi:10.1007/978-3-642-11161-7_23

Pedram Hayati²⁴,
Kevin Chai²⁴,
Vidyasagar Potdar²⁴ &
…
Alex Talevski²⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5925))

Included in the following conference series:

International Conference on Principles and Practice of Multi-Agent Systems

1208 Accesses
20 Citations

Abstract

Internet bots have been widely used for various beneficial and malicious activities on the web. In this paper we provide new insights into a new kind of bot termed as web spambot which is primarily used for spreading spam content on the web. To gain insights into web spambots, we developed a tool (HoneySpam 2.0) to track their behaviour. This paper presents two main contributions, firstly it describes the design of HoneySpam 2.0 and secondly we outline the experimental results that characterise web spambot behaviour. By profiling web spambots, we provide the foundation for identifying such bots and preventing and filtering web spam content.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Gyongyi, Z., Garcia-Molina, H.: Web spam taxonomy. In: Proceedings of the 1st International Workshop on Adversarial Information Retrieval on the Web, Chiba, Japan (2005)
Google Scholar
Hayati, P., Potdar, V.: Toward Spam 2.0: An Evaluation of Web 2.0 Anti-Spam Methods. In: 7th IEEE International Conference on Industrial Informatics Cardiff, Wales (2009)
Google Scholar
Zeitgeist, L.S.: Comment Spam. In: Akismet, ed. (2009), http://akismet.com/stats/
Cobb, S.: The Economics of Spam. EPrivacyGroup (2003), http://www.eprivacygroup.com
Workathome, Work from home online ad placing work pay per posting (2009), http://www.workathomeforum.in/online-adplacing-homejob.htm , http://www.workathomeforum.in/online-adplacing-homejob.htm
Tan, P.-N., Kumar, V.: Discovery of Web Robot Sessions Based on their Navigational Patterns. Data Mining and Knowledge Discovery 6, 9–35 (2002)
Article MathSciNet Google Scholar
Park, K., Pai, V.S., Lee, K.-W., Calo, S.: Securing Web Service by Automatic Robot Detection. In: USENIX 2006 Annual Technical Conference Refereed Paper (2006)
Google Scholar
Chellapilla, K., Simard, P.: Using Machine Learning to Break Visual Human Interaction Proofs (HIPs). In: NIPS (2004)
Google Scholar
Abram, H., Michael, W.G., Richard, C.H.: Reverse Engineering CAPTCHAs. In: Proceedings of the 2008 15th Working Conference on Reverse Engineering. IEEE Computer Society, Los Alamitos (2008)
Google Scholar
Mori, G., Malik, J.: Recognizing objects in adversarial clutter: breaking a visual CAPTCHA. In: Proceedings. 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 1, p I-134-I-141 (2003)
Google Scholar
Baird, H.S., Bentley, J.L.: Implicit CAPTCHAs. In: Proceedings SPIE/IS&T Conference on Document Recognition and Retrieval XII (DR&R2005), San Jose, CA (2005)
Google Scholar
Ogbuji, U.: Real Web 2.0: Battling Web spam (2008), http://www.ibm.com/developerworks/web/library/wa-realweb10/
Mertz, D.: Charming Python: Beat spam using hashcash (2004), http://www.ibm.com/developerworks/linux/library/l-hashcash.html
Cooley, R., Mobasher, B., Srivastava, J.: Web mining: information and pattern discovery on the World Wide Web. In: Proceedings of Ninth IEEE International Conference on Tools with Artificial Intelligence 1997, pp. 558–567 (1997)
Google Scholar
Webb, S., Caverlee, J., Pu, C.: Social Honeypots: Making Friends with a Spammer Near You. In: Proceedings of the Fifth Conference on Email and Anti-Spam (CEAS 2008), Mountain View, CA (2008)
Google Scholar
Andreolini, M., Bulgarelli, A., Colajanni, M., Mazzoni, F.: HoneySpam: honeypots fighting spam at the source. In: Proceedings of the Steps to Reducing Unwanted Traffic on the Internet on Steps to Reducing Unwanted Traffic on the Internet Workshop (2005), Cambridge, MA, p. 11 (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

Digital Ecosystem and Business Intelligence Institute, Curtin University, Perth, Western Australia
Pedram Hayati, Kevin Chai, Vidyasagar Potdar & Alex Talevski

Authors

Pedram Hayati
View author publications
You can also search for this author in PubMed Google Scholar
Kevin Chai
View author publications
You can also search for this author in PubMed Google Scholar
Vidyasagar Potdar
View author publications
You can also search for this author in PubMed Google Scholar
Alex Talevski
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computer Science and Information Engineering, The Catholic University of Korea, Bucheon, South Korea
Jung-Jin Yang
Faculty of Information Science and Electrical Engineering, Department of Informatics, Kyushu University, 744 Motooka, Nishi-ku, 819-0395, Fukuoka, Japan
Makoto Yokoo
School of Techno-Business Administration, Dept. of Computer Science, Nagoya Institute of Technology, Gokiso, Showa-ku, 466-8555, Nagoya, Japan
Takayuki Ito
School of Electronic Engineering and Computer Science, Peking University, No. 5 Yiheyuan Road, 100871, Beijing, China
Zhi Jin
Robotics Institute, Carnegie Mellon University, 5000 Forbes Avenue, 15213, Pittsburgh, PA, USA
Paul Scerri

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hayati, P., Chai, K., Potdar, V., Talevski, A. (2009). HoneySpam 2.0: Profiling Web Spambot Behaviour. In: Yang, JJ., Yokoo, M., Ito, T., Jin, Z., Scerri, P. (eds) Principles of Practice in Multi-Agent Systems. PRIMA 2009. Lecture Notes in Computer Science(), vol 5925. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-11161-7_23

Download citation

DOI: https://doi.org/10.1007/978-3-642-11161-7_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-11160-0
Online ISBN: 978-3-642-11161-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics