skip to main content
10.1145/584792.584909acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
Article

Discovering the representative of a search engine

Published:04 November 2002Publication History

ABSTRACT

Given a large number of search engines on the Internet, it is difficult for a person to determine which search engines could serve his/her information needs. A common solution is to construct a metasearch engine on top of the search engines. Upon receiving a user query, the metasearch engine sends it to those underlying search engines which are likely to return the desired documents for the query. The selection algorithm used by a metasearch engine to determine whether a search engine should be sent the query typically makes the decision based on the search-engine representative, which contains characteristic information about the database of a search engine. However, an underlying search engine may not be willing to provide the needed information to the metasearch engine. This paper shows that the needed information can be estimated from an uncooperative search engine with good accuracy. Two pieces of information which permit accurate search engine selection are the number of documents indexed by the search engine and the maximum weight of each term. In this paper, we present techniques for the estimation of these two pieces of information.

References

  1. J. Callan, M. Connell, and A. Du. Automatic discovery of language models for text databases. In Proceedings of ACM SIGMOD, pages 479--490, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. K. Liu, C. Yu, W. Meng, W. Wu, and N. Rishe. A statistical method for estimating the usefulness of text databases. IEEE Transactions on Knowledge and Data Engineering. (to appear). Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. W. Meng, K. Liu, C. Yu, X. Wang, Y. Chang, and N. Rishe. Determining text databases to search in the internet. In VLDB, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. W. Meng, K. Liu, C. Yu, W. Wu, and N. Rishe. Estimating the usefulness of search engines. In ICDE, March 1999.Google ScholarGoogle Scholar
  5. W. Meng, C. Yu, and K. Liu. Building efficient and effective metasearch engines. ACM Computing Surveys, 34(1):48--89, March 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. S. Robertson, S. Walker, and M. Beaulieu. Okapi at trec-7: automatic ad hoc, filtering, vlc and interactive. In Overview of the Seventh Text Retrieval Conference, 1998.Google ScholarGoogle Scholar
  7. G. Salton and M. McGill. Introduction to Modern Information Retrieval. McCraw-Hill, New York, 1983. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. S. Salton and C. Buckley. Term-weighting approaches in automatic text retrieval. Information Processing and Management, 24(5):513--523, 1988. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. C. Yu, K. Liu, W. Wu, W. Meng, and N. Rishe. Finding the most similar documents across multiple text databases. In Proceedings of the IEEE Conference on Advances in Digital Libraries (ADL'99), Baltimore, Maryland, May 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. C. Yu, W. Meng, K. Liu, W. Wu, and N. Rishe. Efficient and effective metasearch for a large number of text databases. In Proceedings of ACM CIKM, November 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. C. Yu, W. Meng, W. Wu, and K. Liu. Efficient and effective metasearch for text databases incorporating linkages among documents. In Proceedings of ACM SIGMOD, pages 187--198, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Discovering the representative of a search engine

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        CIKM '02: Proceedings of the eleventh international conference on Information and knowledge management
        November 2002
        704 pages
        ISBN:1581134924
        DOI:10.1145/584792

        Copyright © 2002 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 4 November 2002

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • Article

        Acceptance Rates

        Overall Acceptance Rate1,861of8,427submissions,22%

        Upcoming Conference

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader