skip to main content
10.1145/2872518.2890537acmotherconferencesArticle/Chapter ViewAbstractPublication PageswwwConference Proceedingsconference-collections
demonstration

The Knowledge Awakens: Keeping Knowledge Bases Fresh with Emerging Entities

Published:11 April 2016Publication History

ABSTRACT

Entity search over news, social media and the Web allows users to precisely retrieve concise information about specific people, organizations, movies and their characters, and other kinds of entities. This expressive search mode builds on two major assets: 1) a knowledge base (KB) that contains the entities of interest and 2) entity markup in the documents of interest derived by automatic disambiguation of entity names (NED) and linking names to the KB. These prerequisites are not easily available, though, in the important case when a user is interested in a newly emerging entity (EE) such as new movies, new songs, etc. Automatic methods for detecting and canonicalizing EEs are not nearly at the same level as the NED methods for prominent entities that have rich descriptions in the KB.

To overcome this major limitation, we have developed an approach and prototype system that allows searching for EEs in a user-friendly manner. The approach leverages the human in the loop by prompting for user feedback on candidate entities and on characteristic keyphrases for EEs. For convenience and low burden on users, this process is supported by the automatic harvesting oftentative keyphrases. Our demo system shows this interactive process and its high usability.

References

  1. H. Bast, F. Baurle, B. Buchhold, and E. Haußmann. Semantic Full-Text Search with Broccoli. In SIGIR 2014, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. J. Dalton, L. Dietz, and J. Allan. Entity query feature expansion using knowledge base links. In SIGIR 2014, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. J. Hoffart, Y. Altun, and G. Weikum. Discovering Emerging Entities with Ambiguous Names. In WWW 2014, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. J. Hoffart, D. Milchevski, and G. Weikum. STICS: Searching with Strings, Things, and Cats. SIGIR 2014, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. J. Hoffart, M. A. Yosef, I. Bordino, H. Fürstenau, M. Pinkal, M. Spaniol, B. Taneva, S. Thater, and G. Weikum. Robust Disambiguation of Named Entities in Text. In EMNLP 2011, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. H. Ji, R. Grishman, and H. T. Dang. Overview of the TAC2011 Knowledge Base Population Track. In Text Analysis Conference, 2011.Google ScholarGoogle Scholar
  7. H. Ji, J. Nothman, B. Hachey, and F. Radu. Overview of TAC-KBP2015 Tri-lingual Entity Discovery and Linking. In Text Analysis Conference, 2015.Google ScholarGoogle Scholar
  8. B. Keegan, D. Gergle, and N. Contractor. Hot Off the Wiki: Structures and Dynamics of Wikipedia's Coverage of Breaking News Events. American Behavioral Scientist, 57(5), 2013.Google ScholarGoogle Scholar
  9. Y. Li, C. Wang, F. Han, J. Han, D. Roth, and X. Yan. Mining Evidences for Named Entity Disambiguation. In KDD 2013, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. W. Shen, J. Wang, and J. Han. Entity Linking with a Knowledge Base: Issues, Techniques, and Solutions. IEEE Trans. Knowl. Data Eng., 27(2), 2015.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. The Knowledge Awakens: Keeping Knowledge Bases Fresh with Emerging Entities

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        WWW '16 Companion: Proceedings of the 25th International Conference Companion on World Wide Web
        April 2016
        1094 pages
        ISBN:9781450341448

        Copyright © 2016 Copyright is held by the owner/author(s)

        Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

        Publisher

        International World Wide Web Conferences Steering Committee

        Republic and Canton of Geneva, Switzerland

        Publication History

        • Published: 11 April 2016

        Check for updates

        Qualifiers

        • demonstration

        Acceptance Rates

        WWW '16 Companion Paper Acceptance Rate115of727submissions,16%Overall Acceptance Rate1,899of8,196submissions,23%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader