skip to main content
10.1145/1390334.1390425acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article

Social tag prediction

Published: 20 July 2008 Publication History

Abstract

In this paper, we look at the "social tag prediction" problem. Given a set of objects, and a set of tags applied to those objects by users, can we predict whether a given tag could/should be applied to a particular object? We investigated this question using one of the largest crawls of the social bookmarking system del.icio.us gathered to date. For URLs in del.icio.us, we predicted tags based on page text, anchor text, surrounding hosts, and other tags applied to the URL. We found an entropy-based metric which captures the generality of a particular tag and informs an analysis of how well that tag can be predicted. We also found that tag-based association rules can produce very high-precision predictions as well as giving deeper understanding into the relationships between tags. Our results have implications for both the study of tagging systems as potential information retrieval tools, and for the design of such systems.

References

[1]
R. Agrawal, T. Imieliński, and A. Swami. Mining Association Rules Between Sets of Items in Large Databases. SIGMOD Record, 22(2), 1993.
[2]
M. Aurnhammer, P. Hanappe, and L. Steels. Integrating Collaborative Tagging and Emergent Semantics for Image Retrieval. Collaborative Web Tagging Workshop (WWW'06).
[3]
S. Chakrabarti, B. Dom, and P. Indyk. Enhanced Hypertext Categorization Using Hyperlinks. SIGMOD'98.
[4]
E. Chi and T. Mytkowicz. Understanding the Efficiency of Social Tagging Systems using Information Theory. HT'08.
[5]
E. Gabrilovich and S. Markovitch. Text Categorization with Many Redundant Features: Using Aggressive Feature Selection to Make SVMs Competitive with C4.5. ICML'04.
[6]
S. Golder and B. A. Huberman. Usage Patterns of Collaborative Tagging Systems. Journal of Information Science, 32(2):198--208, April 2006.
[7]
T. Haveliwala, A. Gionis, D. Klein, and P. Indyk. Evaluating Strategies for Similarity Search on the Web. WWW'02.
[8]
P. Heymann, G. Koutrika, and H. Garcia-Molina. Can Social Bookmarking Improve Web Search. WSDM'08.
[9]
T. Joachims. A Support Vector Method for Multivariate Performance Measures. ICML'05.
[10]
T. Joachims. Making Large-scale Support Vector Machine Learning Practical. Advances in Kernel Methods: Support Vector Learning, 1999.
[11]
K. Jones and C. van Rijsbergen. Information Retrieval Test Collections. Journal of Documentation, 32(1):59--75, 1976.
[12]
K. Jones and C. van Rijsbergen. Information Retrieval Test Collections. Journal of Documentation, 32(1):59--75, 1976.
[13]
G. Mishne. AutoTag: a collaborative approach to automated tag assignment for weblog posts. WWW'06.
[14]
C. Schmitz, A. Hotho, R. Jaschke, and G. Stumme. Mining Association Rules in Folksonomies. IFCS'06.
[15]
E. Schwarzkopf, D. Heckmann, D. Dengler, and A. Kroner. Mining the Structure of Tag Spaces for User Modeling. Workshop on Data Mining for User Modeling (ICUM'07).
[16]
S. Sen, S. K. Lam, A. M. Rashid, D. Cosley, D. Frankowski, J. Osterhouse, F. M. Harper, and J. Riedl. tagging, communities, vocabulary, evolution. CSCW'06.
[17]
S. Sood, K. Hammond, S. Owsley, and L. Birnbaum. TagAssist: Automatic Tag Suggestion for Blog Posts. ICWSM'07.
[18]
Z. Xu, Y. Fu, J. Mao, and D. Su. Towards the Semantic Web: Collaborative Tag Suggestions. Collaborative Web Tagging Workshop (WWW'06).
[19]
Y. Yang and J. O. Pedersen. A Comparative Study on Feature Selection in Text Categorization. ICML'97.
[20]
Y. Yang, S. Slattery, and R. Ghani. A Study of Approaches to Hypertext Categorization. Journal of Intelligent Information Systems, 18(2--3), 2002.

Cited By

View all
  • (2024)Tagging Items with Emerging Tags: A Neural Topic Model Based Few-Shot Learning ApproachACM Transactions on Information Systems10.1145/364185942:4(1-37)Online publication date: 23-Jan-2024
  • (2022)Automated Hashtag Hierarchy Generation Using Community Detection and the Shannon Diversity Index, with Applications to Twitter and ParlerInternational Journal of Semantic Computing10.1142/S1793351X2250005216:04(473-496)Online publication date: 27-Aug-2022
  • (2022)From Fundamentals to Recent Advances: A Tutorial on KeyphrasificationAdvances in Information Retrieval10.1007/978-3-030-99739-7_73(582-588)Online publication date: 5-Apr-2022
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGIR '08: Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
July 2008
934 pages
ISBN:9781605581644
DOI:10.1145/1390334
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 July 2008

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. association rules
  2. collaborative tagging
  3. social bookmarking
  4. text classification

Qualifiers

  • Research-article

Conference

SIGIR '08
Sponsor:

Acceptance Rates

Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)14
  • Downloads (Last 6 weeks)3
Reflects downloads up to 02 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Tagging Items with Emerging Tags: A Neural Topic Model Based Few-Shot Learning ApproachACM Transactions on Information Systems10.1145/364185942:4(1-37)Online publication date: 23-Jan-2024
  • (2022)Automated Hashtag Hierarchy Generation Using Community Detection and the Shannon Diversity Index, with Applications to Twitter and ParlerInternational Journal of Semantic Computing10.1142/S1793351X2250005216:04(473-496)Online publication date: 27-Aug-2022
  • (2022)From Fundamentals to Recent Advances: A Tutorial on KeyphrasificationAdvances in Information Retrieval10.1007/978-3-030-99739-7_73(582-588)Online publication date: 5-Apr-2022
  • (2020)Tagging and Tag RecommendationCyberspace10.5772/intechopen.82242Online publication date: 17-Jun-2020
  • (2020)Large-Scale Question Tagging via Joint Question-Topic Embedding LearningACM Transactions on Information Systems10.1145/338095438:2(1-23)Online publication date: 28-Feb-2020
  • (2020)Learning Semantic Representations from Directed Social Links to Tag Microblog Users at ScaleACM Transactions on Information Systems10.1145/337755038:2(1-30)Online publication date: 7-Mar-2020
  • (2020)Similarity Measure for Product Attribute EstimationIEEE Access10.1109/ACCESS.2020.30270238(179073-179082)Online publication date: 2020
  • (2020)Sentiment Enhanced Multi-Modal Hashtag Recommendation for Micro-VideosIEEE Access10.1109/ACCESS.2020.29894738(78252-78264)Online publication date: 2020
  • (2020)Graph‐based tag recommendations using clusters of patients in clinical decision support systemConcurrency and Computation: Practice and Experience10.1002/cpe.562433:1Online publication date: 6-Jan-2020
  • (2019)Co-attention Memory Network for Multimodal Microblog's Hashtag RecommendationIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2019.2932406(1-1)Online publication date: 2019
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media