skip to main content
10.1145/2390148.2390155acmconferencesArticle/Chapter ViewAbstractPublication PageswsdmConference Proceedingsconference-collections
poster

Conceptualizing documents with Wikipedia

Published: 02 November 2012 Publication History

Abstract

In this work, we will discuss how to improve Wikilabel, an approach which makes use of titles in Wikipedia pages to generate labels for documents, by retooling ideas from story link detection (SLD). A comparison of our approach against Elastic Net, a powerful machine learner, on the real world data, finds the visible superiority of our approach over the latter.

References

[1]
D. Carmel, H. Roitman, and N. Zwerding. Enhancing cluster labeling using wikipedia. In SIGIR'09, 2009.
[2]
Q. Mei, X. Shen, and C. Zhai. Automatic labeling of multinomial topic models. In KDD'07, 2007.
[3]
T. Nomoto. Two-tier similarity model for story link detection. In CIKM '10, 2010.
[4]
T. Nomoto. Wikilabel: an encyclopedic approach to labeling documents en masse. In CIKM '11, 2011.
[5]
Z. S. Syed, T. Finin, and A. Joshi. Wikipedia as an ontology for describing documents. In ICWSM '08, 2008.
[6]
H. Zou and T. Hastie. Regularization and variable selection via the elastic net. J. R. Statist. Soc. B, 2005.

Cited By

View all
  • (2015)Towards robust tags for scientific publications from natural language processing tools and WikipediaInternational Journal on Digital Libraries10.1007/s00799-014-0132-016:1(25-36)Online publication date: 1-May-2015
  • (2014)Tagging Scientific Publications Using Wikipedia and Natural Language Processing ToolsTheory and Practice of Digital Libraries - TPDL 2013 Selected Workshops10.1007/978-3-319-14226-5_3(16-27)Online publication date: 6-Jul-2014
  • (2014)Tagging Scientific Publications Using Wikipedia and Natural Language Processing ToolsTheory and Practice of Digital Libraries -- TPDL 2013 Selected Workshops10.1007/978-3-319-08425-1_3(16-27)Online publication date: 6-Jul-2014
  • Show More Cited By

Index Terms

  1. Conceptualizing documents with Wikipedia

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    ESAIR '12: Proceedings of the fifth workshop on Exploiting semantic annotations in information retrieval
    November 2012
    28 pages
    ISBN:9781450317177
    DOI:10.1145/2390148

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 02 November 2012

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. cluster labeling
    2. relevance feedback
    3. topic detection

    Qualifiers

    • Poster

    Conference

    CIKM'12
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 35 of 55 submissions, 64%

    Upcoming Conference

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 01 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2015)Towards robust tags for scientific publications from natural language processing tools and WikipediaInternational Journal on Digital Libraries10.1007/s00799-014-0132-016:1(25-36)Online publication date: 1-May-2015
    • (2014)Tagging Scientific Publications Using Wikipedia and Natural Language Processing ToolsTheory and Practice of Digital Libraries - TPDL 2013 Selected Workshops10.1007/978-3-319-14226-5_3(16-27)Online publication date: 6-Jul-2014
    • (2014)Tagging Scientific Publications Using Wikipedia and Natural Language Processing ToolsTheory and Practice of Digital Libraries -- TPDL 2013 Selected Workshops10.1007/978-3-319-08425-1_3(16-27)Online publication date: 6-Jul-2014
    • (2012)Report on the fifth workshop on exploiting semantic annotations in information retrieval (ESAIR'12)ACM SIGIR Forum10.1145/2492189.249219647:1(38-45)Online publication date: 7-Jun-2012
    • (2012)Fifth workshop on exploiting semantic annotations in information retrievalProceedings of the 21st ACM international conference on Information and knowledge management10.1145/2396761.2398761(2772-2773)Online publication date: 29-Oct-2012

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media