Abstract
Tagging systems allow users to interactively annotate a pool of shared resources using descriptive strings called tags. Tags are used to guide users to interesting resources and help them build communities that share their expertise and resources. As tagging systems are gaining in popularity, they become more susceptible to tag spam: misleading tags that are generated in order to increase the visibility of some resources or simply to confuse users. Our goal is to understand this problem better. In particular, we are interested in answers to questions such as: How many malicious users can a tagging system tolerate before results significantly degrade? What types of tagging systems are more vulnerable to malicious attacks? What would be the effort and the impact of employing a trusted moderator to find bad postings? Can a system automatically protect itself from spam, for instance, by exploiting user tag patterns? In a quest for answers to these questions, we introduce a framework for modeling tagging systems and user tagging behavior. We also describe a method for ranking documents matching a tag based on taggers' reliability. Using our framework, we study the behavior of existing approaches under malicious attacks and the impact of a moderator and our ranking method.
- 3spots. http://3spots.blogspot.com/2006/01/all-social-that-can-bookmark.html.Google Scholar
- Adlam, T. 2006. Tag and ping phenomenon. http://www.optiniche.com/blog/174/tag-and-ping/.Google Scholar
- Brooks, C. and Montanez, N. 2006. Improved annotation of the blogosphere via autotagging and hierarchical clustering. In Proceedings of the 15th International Conference on the World Wide Web. Google ScholarDigital Library
- CiteULike. http://www.citeulike.org/.Google Scholar
- Control, N. http://asp.net/ajax/control-toolkit/live/NoBot/NoBot.aspx.Google Scholar
- del.icio.us. http://del.icio.us/.Google Scholar
- Diigo. http://www.diigo.com/.Google Scholar
- EbiquityBlogger. 2007 http://ebiquity.umbc.edu/blogger/2007/01/24/tag-spam-on-the-rise.Google Scholar
- Farrell, S. and Lau, T. 2006. Fringe contacts: people tagging for the enterprise. In Proceedings of the Collaborative Web Tagging Workshop in conjunction with the 15th International Conference on the World Wide Web.Google Scholar
- Flickr. url: http://www.flickr.com/.Google Scholar
- Golder, S. and Huberman, B. A. 2006. Usage patterns of collaborative tagging systems. J. Inform. Sci. 32, 2, 198--208. Google ScholarDigital Library
- Guha, R., Kumar, R., Raghavan, P., and Tomkins, A. 2004. Propagation of trust and distrust. In Proceedings of the 13th International Conference on the World Wide Web. 403--412. Google ScholarDigital Library
- Gyöngyi, Z., Berkhin, P., Garcia-Molina, H., and Pedersen, J. 2006. Link spam detection with mass estimation. In Proceedings of the 32nd International Conference on Very Large Databases. 439--450. Google ScholarDigital Library
- Gyöngyi, Z. and Garcia-Molina, H. 2005. Web spam taxonomy. In Proceedings of the 1st International Workshop on Adversarial Information Retrieval on the Web. 39--47.Google Scholar
- Gyöngyi, Z., Garcia-Molina, H., and Pedersen, J. 2004. Combating spam with TrustRank. In Proceedings of the 30th International Conference on Very Large Databases. 576--587. Google ScholarDigital Library
- Henzinger, M. 2000. Link analysis in web information retrieval. IEEE Data Eng. Bull. 23, 3, 3--8.Google Scholar
- John, A. and Seligmann, D. 2006. Collaborative tagging and expertise in the enterprise. In Proceedings of the Collaborative Web Tagging Workshop in conjunction with the 15th International Conference on the World Wide Web.Google Scholar
- Jots. http://www.jots.com/.Google Scholar
- Koutrika, G., Effendi, F., Gyöngyi, Z., Heymann, P., and Garcia-Molina, H. 2007. Combating spam in tagging systems. In Proceedings of the 3rd International Workshop on Adversarial Information Retrieval on the Web. Google ScholarDigital Library
- Kumar, R., Novak, J., and Tomkins, A. 2006. Structure and evolution of online social networks. In Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 611--617. Google ScholarDigital Library
- Marlow, C., Naaman, M., Boyd, D., and Davis, M. 2006. Position paper, tagging, taxonomy, flickr, article, toread. In Proceedings of the Hypertext Conference. 31--40. Google ScholarDigital Library
- Mathes, A. 2004. Folksonomies—cooperative classification and communication through shared metadata. Computer Mediated Communication, LIS590CMC (Doctoral Seminar), Graduate School of Library and Information Science, University of Illinois Urbana-Champaign.Google Scholar
- Merholz, P. 2004. Metadata for the masses. http://www.adaptivepath.com/ideas/essays/archives/000361.php.Google Scholar
- Mishne, G. 2006. Autotag: collaborative approach to automated tag assignment for weblog posts. In Proceedings of the 15th International Conference on the World Wide Web. Google ScholarDigital Library
- MyWeb. http://myweb2.search.yahoo.com/.Google Scholar
- Ohkura, T., Kiyota, Y., and Nakagawa, H. 2006. Browsing system for weblog articles based on automated folksonomy. In Proceedings of the 15th International Conference on the World Wide Web.Google Scholar
- Rawsugar. http://rawsugar.com/.Google Scholar
- RealTravel, X. http://realtravel.com/.Google Scholar
- Schmitz, P. 2006. Inducing ontology from flickr tags. In Proceedings of the Collaborative Web Tagging Workshop in conjunction with the 15th International Conference on the World Wide Web.Google Scholar
- Sen, S., Lam, S., Rashid, A., Cosley, D., Frankowski, D., Osterhouse, J., Harper, F. M., and Riedl, J. 2006. Tagging, communities, vocabulary, evolution. In Proceedings of the 10th International Conference on Computer Supported Cooperative Work in Design. Google ScholarDigital Library
- Slideshare. http://slideshare.net/.Google Scholar
- Technorati. http://www.technorati.com/.Google Scholar
- Wasserman, S. and Faust, K. 1994. Social Network Analysis: Methods and Applications. Cambridge University Press, Cambridge, UK.Google Scholar
- Wu, B., Goel, V., and Davison, B. 2006. Topical trustrank: using topicality to combact web spam. In Proceedings of the 15th International Conference on the World Wide Web. 63--72. Google ScholarDigital Library
- Xu, Z., Fu, Y., Mao, J., and Su, D. 2006. Towards the semantic web: collaborative tag suggestions. In Proceedings of the Collaborative Web Tagging Workshop in 15th International Conference on the World Wide Web.Google Scholar
- YouTube. http://www.youtube.com/.Google Scholar
Index Terms
- Combating spam in tagging systems: An evaluation
Recommendations
Combating spam in tagging systems
AIRWeb '07: Proceedings of the 3rd international workshop on Adversarial information retrieval on the webTagging systems allow users to interactively annotate a pool of shared resources using descriptive tags. As tagging systems are gaining in popularity, they become more susceptible to tag spam: misleading tags that are generated in order to increase the ...
Detecting tag spam in social tagging systems with collaborative knowledge
FSKD'09: Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 7Social tagging systems allow collaborative users to annotate shared resources with tags. Since they rely on user-contributed content, social tagging systems are vulnerable to spam annotations, which are generated by malicious users to mislead or confuse ...
Detecting Tag Spam in Social Tagging Systems with Collaborative Knowledge
FSKD '09: Proceedings of the 2009 Sixth International Conference on Fuzzy Systems and Knowledge Discovery - Volume 07Social tagging systems allow collaborative users to annotate shared resources with tags. Since they rely on user contributed content, social tagging systems are vulnerable to spam annotations, which are generated by malicious users to mislead or confuse ...
Comments