skip to main content
10.1145/2063576.2063986acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
poster

Folksonomy-based term extraction for word cloud generation

Published: 24 October 2011 Publication History

Abstract

In this work we study the task of term extraction for word cloud generation. We present a folksonomy-based term extraction method, called tag-boost, which boosts terms that are frequently used by the public to tag content. Our experiments with tag-boost-based term extraction over different domains demonstrate tremendous improvement in word cloud quality, as reflected by the agreement between extracted terms and manually assigned tags of the testing items. Additionally, we show that tag-boost can be effectively applied even in non-tagged domains, by using an external rich folksonomy borrowed from a well-tagged domain.

References

[1]
E. Amitay, D. Carmel, N. Har'El, S. Ofek-Koifman, A. Soffer, S. Yogev, and N. Golbandi. Social search and discovery using a unified approach. In Proceedings of HyperText, pages 199--208, 2009.
[2]
A. Berger and J. Lafferty. Information retrieval as statistical translation. In Proceedings of SIGIR, pages 222--229, 1999.
[3]
D. Carmel, H. Roitman, and N. Zwerdling. Enhancing cluster labeling using wikipedia. In Proceedings of SIGIR, pages 139--146, 2009.
[4]
P. Heymann, G. Koutrika, and H. Garcia-Molina. Can social bookmarking improve web search? In Proceedings of WSDM, pages 195--206, 2008.
[5]
P. Heymann, D. Ramage, and H. Garcia-Molina. Social tag prediction. In Proceedings of SIGIR, pages 531--538, 2008.
[6]
A. Hotho, R. Jäschke, C. Schmitz, and G. Stumme. Information retrieval in folksonomies: Search and ranking. In The Semantic Web: Research and Applications, ESWC 2006, pages 411--426, 2006.
[7]
J. Huh, L. Jones, T. Erickson, W. A. Kellogg, R. K. E. Bellamy, and J. C. Thomas. Blogcentral: the role of internal blogs at work. In Proceedings of CHI, pages 2447--2452, 2007.
[8]
H. Kwak, C. Lee, H. Park, and S. Moon. What is twitter, a social network or a news media? In Proceedings of WWW, WWW '10, pages 591--600, 2010.
[9]
Y. Liu, M. Liu, X. Chen, L. Xiang, and Q. Yang. Automatic tag recommendation for weblogs. In Proceedings of ITCS, pages 546--549, 2009.
[10]
Y.-T. Lu, S.-I. Yu, T.-C. Chang, and J. Y.-j. Hsu. A content-based method to enhance tag recommendation. In Proceedings of IJCAI, pages 2064--2069, 2009.
[11]
C. D. Manning, P. Raghavan, and H. Schutze. Introduction to Information Retrieval. 2008.
[12]
D. R. Millen, J. Feinberg, and B. Kerr. Dogear: Social bookmarking in the enterprise. In Proceedings of CHI, pages 111--120, 2006.
[13]
C. Zhai and J. Lafferty. A study of smoothing methods for language models applied to ad hoc information retrieval. In Proceedings of SIGIR, pages 334--342, 2001.
[14]
X. Zhang, L. Yang, X. Wu, H. Guo, Z. Guo, S. Bao, Y. Yu, and Z. Su. sdoc: exploring social wisdom for document enhancement in web mining. In Proceedings of CIKM, pages 395--404, 2009.

Cited By

View all
  • (2013)Computational Framework for Generating Visual Summaries of Topical Clusters in Twitter StreamsSocial Networks: A Framework of Computational Intelligence10.1007/978-3-319-02993-1_9(173-199)Online publication date: 10-Dec-2013
  • (2012)Swimming against the streamzProceedings of the 21st ACM international conference on Information and knowledge management10.1145/2396761.2398478(1587-1591)Online publication date: 29-Oct-2012
  • (2012)Harnessing the crowds for smart city sensingProceedings of the 1st international workshop on Multimodal crowd sensing10.1145/2390034.2390043(17-18)Online publication date: 2-Nov-2012
  • Show More Cited By

Index Terms

  1. Folksonomy-based term extraction for word cloud generation

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    CIKM '11: Proceedings of the 20th ACM international conference on Information and knowledge management
    October 2011
    2712 pages
    ISBN:9781450307178
    DOI:10.1145/2063576
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 24 October 2011

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. tag-boost
    2. term extraction
    3. word-cloud generation

    Qualifiers

    • Poster

    Conference

    CIKM '11
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

    Upcoming Conference

    CIKM '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)1
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 01 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2013)Computational Framework for Generating Visual Summaries of Topical Clusters in Twitter StreamsSocial Networks: A Framework of Computational Intelligence10.1007/978-3-319-02993-1_9(173-199)Online publication date: 10-Dec-2013
    • (2012)Swimming against the streamzProceedings of the 21st ACM international conference on Information and knowledge management10.1145/2396761.2398478(1587-1591)Online publication date: 29-Oct-2012
    • (2012)Harnessing the crowds for smart city sensingProceedings of the 1st international workshop on Multimodal crowd sensing10.1145/2390034.2390043(17-18)Online publication date: 2-Nov-2012
    • (2012)Folksonomy-Based Term Extraction for Word Cloud GenerationACM Transactions on Intelligent Systems and Technology10.1145/2337542.23375453:4(1-20)Online publication date: 1-Sep-2012
    • (2011)Personalized activity streamsProceedings of the fifth ACM conference on Recommender systems10.1145/2043932.2043966(181-188)Online publication date: 23-Oct-2011

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media