skip to main content
10.1145/1141753.1141826acmconferencesArticle/Chapter ViewAbstractPublication PagesjcdlConference Proceedingsconference-collections
Article

Search engine driven author disambiguation

Published:11 June 2006Publication History

ABSTRACT

In scholarly digital libraries, author disambiguation is an important task that attributes a scholarly work with specific authors. This is critical when individuals share the same name. We present an approach to this task that analyzes the results of automatically-crafted web searches. A key observation is that pages from rare web sites are stronger source of evidence than pages from common web sites, which we model as Inverse Host Frequency (IHF). Our system is able to achieve an average accuracy of 0.836.

References

  1. C. L. Giles, K. D. Bollacker, and S. Lawrence. CiteSeer: An automatic citation indexing system. In ACM Conf. on Digital Libraries, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. H. Han, H. Zha, and C. L. Giles. Name disambiguation in author citations using a K-way spectral clustering method. In JCDL, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. D. Lee, B. W. On, J. Kang, and S. Park. Effective and scalable solutions for mixed and split citation problems in digital libraries. In IQIS, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. M. Ley. The DBLP computer science bibliography: Evolution, research issues, perspectives. In SPIRE, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Search engine driven author disambiguation

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      JCDL '06: Proceedings of the 6th ACM/IEEE-CS joint conference on Digital libraries
      June 2006
      402 pages
      ISBN:1595933549
      DOI:10.1145/1141753

      Copyright © 2006 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 11 June 2006

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • Article

      Acceptance Rates

      Overall Acceptance Rate415of1,482submissions,28%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader