research-article

Combating spam in tagging systems: An evaluation

Authors:
Georgia Koutrika

Stanford University, Stanford, CA

Stanford University, Stanford, CA
View Profile

,
Frans Adjie Effendi

Stanford University, Stanford, CA

Stanford University, Stanford, CA
View Profile

,
Zolt´n Gyöngyi

Stanford University, Stanford, CA

Stanford University, Stanford, CA
View Profile

,
Paul Heymann

Stanford University, Stanford, CA

Stanford University, Stanford, CA
View Profile

,
Hector Garcia-Molina

Stanford University, Stanford, CA

Stanford University, Stanford, CA
View Profile

Authors Info & Claims

ACM Transactions on the Web Volume 2 Issue 4Article No.: 22pp 1–34https://doi.org/10.1145/1409220.1409225

Published:27 October 2008Publication History

ACM Transactions on the Web

Abstract

Tagging systems allow users to interactively annotate a pool of shared resources using descriptive strings called tags. Tags are used to guide users to interesting resources and help them build communities that share their expertise and resources. As tagging systems are gaining in popularity, they become more susceptible to tag spam: misleading tags that are generated in order to increase the visibility of some resources or simply to confuse users. Our goal is to understand this problem better. In particular, we are interested in answers to questions such as: How many malicious users can a tagging system tolerate before results significantly degrade? What types of tagging systems are more vulnerable to malicious attacks? What would be the effort and the impact of employing a trusted moderator to find bad postings? Can a system automatically protect itself from spam, for instance, by exploiting user tag patterns? In a quest for answers to these questions, we introduce a framework for modeling tagging systems and user tagging behavior. We also describe a method for ranking documents matching a tag based on taggers' reliability. Using our framework, we study the behavior of existing approaches under malicious attacks and the impact of a moderator and our ranking method.

References

3spots. http://3spots.blogspot.com/2006/01/all-social-that-can-bookmark.html.Google Scholar
Adlam, T. 2006. Tag and ping phenomenon. http://www.optiniche.com/blog/174/tag-and-ping/.Google Scholar
Brooks, C. and Montanez, N. 2006. Improved annotation of the blogosphere via autotagging and hierarchical clustering. In Proceedings of the 15th International Conference on the World Wide Web. Google ScholarDigital Library
CiteULike. http://www.citeulike.org/.Google Scholar
Control, N. http://asp.net/ajax/control-toolkit/live/NoBot/NoBot.aspx.Google Scholar
del.icio.us. http://del.icio.us/.Google Scholar
Diigo. http://www.diigo.com/.Google Scholar
EbiquityBlogger. 2007 http://ebiquity.umbc.edu/blogger/2007/01/24/tag-spam-on-the-rise.Google Scholar
Farrell, S. and Lau, T. 2006. Fringe contacts: people tagging for the enterprise. In Proceedings of the Collaborative Web Tagging Workshop in conjunction with the 15th International Conference on the World Wide Web.Google Scholar
Flickr. url: http://www.flickr.com/.Google Scholar
Golder, S. and Huberman, B. A. 2006. Usage patterns of collaborative tagging systems. J. Inform. Sci. 32, 2, 198--208. Google ScholarDigital Library
Guha, R., Kumar, R., Raghavan, P., and Tomkins, A. 2004. Propagation of trust and distrust. In Proceedings of the 13th International Conference on the World Wide Web. 403--412. Google ScholarDigital Library
Gyöngyi, Z., Berkhin, P., Garcia-Molina, H., and Pedersen, J. 2006. Link spam detection with mass estimation. In Proceedings of the 32nd International Conference on Very Large Databases. 439--450. Google ScholarDigital Library
Gyöngyi, Z. and Garcia-Molina, H. 2005. Web spam taxonomy. In Proceedings of the 1st International Workshop on Adversarial Information Retrieval on the Web. 39--47.Google Scholar
Gyöngyi, Z., Garcia-Molina, H., and Pedersen, J. 2004. Combating spam with TrustRank. In Proceedings of the 30th International Conference on Very Large Databases. 576--587. Google ScholarDigital Library
Henzinger, M. 2000. Link analysis in web information retrieval. IEEE Data Eng. Bull. 23, 3, 3--8.Google Scholar
John, A. and Seligmann, D. 2006. Collaborative tagging and expertise in the enterprise. In Proceedings of the Collaborative Web Tagging Workshop in conjunction with the 15th International Conference on the World Wide Web.Google Scholar
Jots. http://www.jots.com/.Google Scholar
Koutrika, G., Effendi, F., Gyöngyi, Z., Heymann, P., and Garcia-Molina, H. 2007. Combating spam in tagging systems. In Proceedings of the 3rd International Workshop on Adversarial Information Retrieval on the Web. Google ScholarDigital Library
Kumar, R., Novak, J., and Tomkins, A. 2006. Structure and evolution of online social networks. In Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 611--617. Google ScholarDigital Library
Marlow, C., Naaman, M., Boyd, D., and Davis, M. 2006. Position paper, tagging, taxonomy, flickr, article, toread. In Proceedings of the Hypertext Conference. 31--40. Google ScholarDigital Library
Mathes, A. 2004. Folksonomies—cooperative classification and communication through shared metadata. Computer Mediated Communication, LIS590CMC (Doctoral Seminar), Graduate School of Library and Information Science, University of Illinois Urbana-Champaign.Google Scholar
Merholz, P. 2004. Metadata for the masses. http://www.adaptivepath.com/ideas/essays/archives/000361.php.Google Scholar
Mishne, G. 2006. Autotag: collaborative approach to automated tag assignment for weblog posts. In Proceedings of the 15th International Conference on the World Wide Web. Google ScholarDigital Library
MyWeb. http://myweb2.search.yahoo.com/.Google Scholar
Ohkura, T., Kiyota, Y., and Nakagawa, H. 2006. Browsing system for weblog articles based on automated folksonomy. In Proceedings of the 15th International Conference on the World Wide Web.Google Scholar
Rawsugar. http://rawsugar.com/.Google Scholar
RealTravel, X. http://realtravel.com/.Google Scholar
Schmitz, P. 2006. Inducing ontology from flickr tags. In Proceedings of the Collaborative Web Tagging Workshop in conjunction with the 15th International Conference on the World Wide Web.Google Scholar
Sen, S., Lam, S., Rashid, A., Cosley, D., Frankowski, D., Osterhouse, J., Harper, F. M., and Riedl, J. 2006. Tagging, communities, vocabulary, evolution. In Proceedings of the 10th International Conference on Computer Supported Cooperative Work in Design. Google ScholarDigital Library
Slideshare. http://slideshare.net/.Google Scholar
Technorati. http://www.technorati.com/.Google Scholar
Wasserman, S. and Faust, K. 1994. Social Network Analysis: Methods and Applications. Cambridge University Press, Cambridge, UK.Google Scholar
Wu, B., Goel, V., and Davison, B. 2006. Topical trustrank: using topicality to combact web spam. In Proceedings of the 15th International Conference on the World Wide Web. 63--72. Google ScholarDigital Library
Xu, Z., Fu, Y., Mao, J., and Su, D. 2006. Towards the semantic web: collaborative tag suggestions. In Proceedings of the Collaborative Web Tagging Workshop in 15th International Conference on the World Wide Web.Google Scholar
YouTube. http://www.youtube.com/.Google Scholar

Index Terms

Recommendations

Combating spam in tagging systems
AIRWeb '07: Proceedings of the 3rd international workshop on Adversarial information retrieval on the web

Tagging systems allow users to interactively annotate a pool of shared resources using descriptive tags. As tagging systems are gaining in popularity, they become more susceptible to tag spam: misleading tags that are generated in order to increase the ...
Read More
Detecting tag spam in social tagging systems with collaborative knowledge
FSKD'09: Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 7

Social tagging systems allow collaborative users to annotate shared resources with tags. Since they rely on user-contributed content, social tagging systems are vulnerable to spam annotations, which are generated by malicious users to mislead or confuse ...
Read More
Detecting Tag Spam in Social Tagging Systems with Collaborative Knowledge
FSKD '09: Proceedings of the 2009 Sixth International Conference on Fuzzy Systems and Knowledge Discovery - Volume 07

Social tagging systems allow collaborative users to annotate shared resources with tags. Since they rely on user contributed content, social tagging systems are vulnerable to spam annotations, which are generated by malicious users to mislead or confuse ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in

ACM Transactions on the Web Volume 2, Issue 4
October 2008
118 pages
ISSN:1559-1131
EISSN:1559-114X
DOI:10.1145/1409220
Issue’s Table of Contents

Copyright © 2008 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 27 October 2008
- Accepted: 1 June 2008
- Revised: 1 October 2007
- Received: 1 March 2007
Published in tweb Volume 2, Issue 4

Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Tagging
bookmarking systems
tag spam
tagging models
Qualifiers
- research-article
- Research
- Refereed
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 40
  Total Citations
  View Citations
- 1,120
  Total Downloads
- Downloads (Last 12 months)6
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Combating spam in tagging systems: An evaluation

ACM Transactions on the Web

Abstract

References

Cited By

Index Terms

Recommendations

Combating spam in tagging systems

Detecting tag spam in social tagging systems with collaborative knowledge

Detecting Tag Spam in Social Tagging Systems with Collaborative Knowledge

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Combating spam in tagging systems: An evaluation

ACM Transactions on the Web

Abstract

References

Cited By

Index Terms

Recommendations

Combating spam in tagging systems

Detecting tag spam in social tagging systems with collaborative knowledge

Detecting Tag Spam in Social Tagging Systems with Collaborative Knowledge

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media