skip to main content
10.1145/1571941.1572121acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
poster

A search engine in a few lines.: yes, we can!

Published:19 July 2009Publication History

ABSTRACT

Many research implementations of search engines are written in C, C++, or Java. They are difficult to understand and modify because they are at least a few thousand lines of code and contain many low-level details. In this paper, we show how to achieve a much shorter and higher level implementation: one in about a few hundred lines. We accomplish this result through the use of a high-level functional programming language, F#, and some of its features such as sequences, pipes and structured input and output. By using a search engine implementation as a case study, we argue that functional programming fits the domain of Information Retrieval problems much better than imperative/OO languages like C++ and Java.

Functional programming languages are ideal for rapid algorithm prototyping and data exploration in the field of Information Retrieval (IR).

Additionally, our implementation can be used as case study in an IR course since it is a very high level, but nevertheless executable specification of a search engine.

References

  1. The lemur toolkit. http://www.lemurproject.org/Google ScholarGoogle Scholar
  2. GALAGO. http://www.galagosearch.org/Google ScholarGoogle Scholar
  3. Terrier. http://ir.dcs.gla.ac.uk/terrier/\endthebibliographyGoogle ScholarGoogle Scholar

Index Terms

  1. A search engine in a few lines.: yes, we can!

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      SIGIR '09: Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
      July 2009
      896 pages
      ISBN:9781605584836
      DOI:10.1145/1571941

      Copyright © 2009 Copyright is held by the author/owner(s)

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 19 July 2009

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • poster

      Acceptance Rates

      Overall Acceptance Rate792of3,983submissions,20%
    • Article Metrics

      • Downloads (Last 12 months)1
      • Downloads (Last 6 weeks)0

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader