skip to main content
10.1145/1367497.1367645acmconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
poster

User behavior oriented web spam detection

Published: 21 April 2008 Publication History

Abstract

Combating Web spam has become one of the top challenges for Web search engines. State-of-the-art spam detection techniques are usually designed for specific known types of Web spam and are incapable and inefficient for recently-appeared spam. With user behavior analyses into Web access logs, we propose a spam page detection algorithm based on Bayes learning. Preliminary experiments on Web access data collected by a commercial Web site (containing over 2.74 billion user clicks in 2 months) show the effectiveness of the proposed detection framework and algorithm.

References

[1]
Silverstein, C., Marais, H., Henzinger, M., and Moricz, M. 1999. Analysis of a very large web search engine query log. SIGIR Forum 33, 1 (Sep. 1999), 6--12.
[2]
Zoltán Gyöngyi and Hector Garcia-Molina. Web spam taxonomy. In First International Workshop on Adversarial Information Retrieval on the Web, 2005.
[3]
Ntoulas, A., Najork, M., Manasse, M., and Fetterly, D. Detecting spam Web pages through content analysis. In proceedings of the 15th WWW conference. 83--92.
[4]
Gyongyi, Zoltan; Garcia-Molina, Hector; Pedersen, Jan. Combating Web Spam with TrustRank, Proceedings of the 30th International Conference on Very Large Data Bases.

Cited By

View all
  • (2015)Web spam detection using trust and distrust-based ant colony optimization learningInternational Journal of Web Information Systems10.1108/IJWIS-12-2014-004711:2(142-161)Online publication date: 15-Jun-2015
  • (2014)SPADE: a social-spam analytics and detection frameworkSocial Network Analysis and Mining10.1007/s13278-014-0189-14:1Online publication date: 10-Apr-2014
  • (2014)Adaptive Learning Ant Colony Optimization for Web Spam DetectionProceedings of the 14th International Conference on Computational Science and Its Applications — ICCSA 2014 - Volume 858410.1007/978-3-319-09153-2_48(642-653)Online publication date: 30-Jun-2014
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
WWW '08: Proceedings of the 17th international conference on World Wide Web
April 2008
1326 pages
ISBN:9781605580852
DOI:10.1145/1367497
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 21 April 2008

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. spam detection
  2. user behavior analysis
  3. web search engine

Qualifiers

  • Poster

Conference

WWW '08
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)0
Reflects downloads up to 27 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2015)Web spam detection using trust and distrust-based ant colony optimization learningInternational Journal of Web Information Systems10.1108/IJWIS-12-2014-004711:2(142-161)Online publication date: 15-Jun-2015
  • (2014)SPADE: a social-spam analytics and detection frameworkSocial Network Analysis and Mining10.1007/s13278-014-0189-14:1Online publication date: 10-Apr-2014
  • (2014)Adaptive Learning Ant Colony Optimization for Web Spam DetectionProceedings of the 14th International Conference on Computational Science and Its Applications — ICCSA 2014 - Volume 858410.1007/978-3-319-09153-2_48(642-653)Online publication date: 30-Jun-2014
  • (2013)Russian web spam evolutionProceedings of the 22nd International Conference on World Wide Web10.1145/2487788.2488135(1137-1140)Online publication date: 13-May-2013
  • (2013)Mining Spam Accounts with User InfluenceProceedings of the 2013 International Conference on Information Science and Cloud Computing Companion10.1109/ISCC-C.2013.85(167-173)Online publication date: 7-Dec-2013
  • (2013)Unveil the Spams in WeiboProceedings of the 2013 IEEE International Conference on Green Computing and Communications and IEEE Internet of Things and IEEE Cyber, Physical and Social Computing10.1109/GreenCom-iThings-CPSCom.2013.159(916-922)Online publication date: 20-Aug-2013
  • (2012)Survey on web spam detectionACM SIGKDD Explorations Newsletter10.1145/2207243.220725213:2(50-64)Online publication date: 1-May-2012
  • (2012)Identifying Web Spam with the Wisdom of the CrowdsACM Transactions on the Web10.1145/2109205.21092076:1(1-30)Online publication date: 1-Mar-2012
  • (2010)Intelligent user search behaviour knowledge discoveryInternational Conference on Fuzzy Systems10.1109/FUZZY.2010.5584867(1-8)Online publication date: Jul-2010
  • (2009)A structural, content‐similarity measure for detecting spam documents on the webInternational Journal of Web Information Systems10.1108/174400809110062075:4(431-464)Online publication date: 20-Nov-2009

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media