skip to main content
10.1145/2663712.2666196acmconferencesArticle/Chapter ViewAbstractPublication PageswsdmConference Proceedingsconference-collections
research-article

Bringing Head Closer to the Tail with Entity Linking

Published:07 November 2014Publication History

ABSTRACT

With the creation and rapid development of knowledge bases, it has become easier to understand the underlying semantics of unstructured text (short or long) on the web. In this work we especially look at the impact of entity linking on search logs. Search queries follow a Zipfian distribution wherein other than few popular queries (head queries), a significant percentage of queries (tail queries) occur rarely. Given a search log, there is sufficient data to analyze head queries but insufficient data (low frequency, limited clicks) to draw any conclusions about tail queries. In this work we focus on quantifying the extent of overlap between long tail and head queries by means of entity linking. We specifically analyze the frequency distribution of entities in head and tail queries. Our analysis shows that by means of entity linking, we can indeed bridge the gap between the head and tail.

References

  1. R. Baeza-Yates, A. Gionis, F. Junqueira, V. Murdock, V. Plachouras, and F. Silvestri. The impact of caching on search engines. In SIGIR. ACM, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. D. Ceccarelli, C. Lucchese, S. Orlando, R. Perego, and S. Trani. Dexter: an open source framework for entity linking. In ESAIR. ACM, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. D. Downey, S. Dumais, and E. Horvitz. Heads and tails: Studies of web search with common and rare queries. In SIGIR. ACM, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. S. Goel, A. Broder, E. Gabrilovich, and B. Pang. Anatomy of the long tail: Ordinary people with extraordinary tastes. In WSDM. ACM, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. L. Hollink, P. Mika, and R. Blanco. Web usage mining with semantic analysis. In WWW, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. D. Milne and I. H. Witten. Learning to link with wikipedia. In CIKM. ACM, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. F. Silvestri. Mining query logs: Turning search usage data into knowledge. Foundations and Trends in Information Retrieval, 4, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Bringing Head Closer to the Tail with Entity Linking

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        ESAIR '14: Proceedings of the 7th International Workshop on Exploiting Semantic Annotations in Information Retrieval
        November 2014
        52 pages
        ISBN:9781450313650
        DOI:10.1145/2663712

        Copyright © 2014 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 7 November 2014

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        ESAIR '14 Paper Acceptance Rate11of15submissions,73%Overall Acceptance Rate35of55submissions,64%

        Upcoming Conference

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader