skip to main content
10.1145/1863879.1863881acmotherconferencesArticle/Chapter ViewAbstractPublication PagessemsearchConference Proceedingsconference-collections
research-article

Using BM25F for semantic search

Published:26 April 2010Publication History

ABSTRACT

Information Retrieval (IR) approaches for semantic web search engines have become very populars in the last years. Popularization of different IR libraries, like Lucene, that allows IR implementations almost out-of-the-box have make easier IR integration in Semantic Web search engines. However, one of the most important features of Semantic Web documents is the structure, since this structure allow us to represent semantic in a machine readable format. In this paper we analyze the specific problems of structured IR and how to adapt weighting schemas for semantic document retrieval.

References

  1. }}C. Bizer, T. Heath, K. Idehen, and T. Berners-Lee. Linked data on the web (ldow2008). In WWW, pages 1265--1266, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. }}G. Cheng and Y. Qu. Searching linked objects with falcons: Approach, implementation and evaluation. Int. J. Semantic Web Inf. Syst., 5(3):49--70, 2009.Google ScholarGoogle ScholarCross RefCross Ref
  3. }}C. Cleverdon. The Cranfield tests on index language devices. pages 47--59, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. }}N. Craswell, H. Zaragoza, and S. Robertson. Microsoft cambridge at trec-14: Enterprise track. In TREC, 2005.Google ScholarGoogle Scholar
  5. }}M. d'Aquin, C. Baldassarre, L. Gridinoc, S. Angeletou, M. Sabou, and E. Motta. Characterizing knowledge on the semantic web with watson. In EON, pages 1--10, 2007.Google ScholarGoogle Scholar
  6. }}M. d'Aquin, M. Sabou, E. Motta, S. Angeletou, L. Gridinoc, V. Lopez, and F. Zablith. What can be done with the semantic web? an overview Watson-based applications. In SWAP, 2008.Google ScholarGoogle Scholar
  7. }}R. Delbru. SIREn: Entity retrieval system for the web of data. In Proceedings of the 3rd Symposium on Future Directions in Information Access (FDIA)., 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. }}S. Elbassuoni, M. Ramanath, R. Schenkel, M. Sydow, and G. Weikum. Language-model-based ranking for queries on RDF-graphs. In CIKM '09: Proceeding of the 18th ACM conference on Information and knowledge management, pages 977--986, New York, NY, USA, 2009. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. }}M. Fernandez, V. Lopez, E. Motta, M. Sabou, V. Uren, D. Vallet, and P. Castells. Using TREC for cross-comparison between classic IR and ontology-based search models at a Web scale. In Workshop: Semantic search workshop at 18th International World Wide Web Conference, 2009.Google ScholarGoogle Scholar
  10. }}R. V. Guha, R. McCool, and E. Miller. Semantic search. In WWW, pages 700--709, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. }}D. Harman. Overview of the first text retrieval conference (trec-1). In TREC, pages 1--20, 1992.Google ScholarGoogle Scholar
  12. }}A. Harth, A. Hogan, R. Delbru, J. Umbrich, S. O'Riain, and S. Decker. SWSE: Answers before links! In Semantic Web Challenge, 2007.Google ScholarGoogle Scholar
  13. }}D. Hawking, E. M. Voorhees, N. Craswell, and P. Bailey. Overview of the trec-8 web track. In TREC, 1999.Google ScholarGoogle Scholar
  14. }}M. Lalmas. XML Retrieval. Synthesis Lectures on Information Concepts, Retrieval, and Services. Morgan & Claypool Publishers, 2009.Google ScholarGoogle Scholar
  15. }}C. D. Manning, P. Raghavan, and H. Schtze. Introduction to Information Retrieval. Cambridge University Press, New York, NY, USA, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. }}J. Pérez-Iglesias, J. R. Pérez-Agüera, V. Fresno, and Y. Z. Feinstein. Integrating the Probabilistic Models BM25/BM25F into Lucene. CoRR, abs/0911.5046, 2009.Google ScholarGoogle Scholar
  17. }}S. E. Robertson, H. Zaragoza, and M. J. Taylor. Simple BM25 extension to multiple weighted fields. In CIKM '04, pages 42--49, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. }}M. J. Taylor, H. Zaragoza, N. Craswell, S. Robertson, and C. Burges. Optimisation methods for ranking functions with multiple parameters. In CIKM, pages 585--593, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. }}G. Tummarello and R. Delbru. Entity coreference resolution services in sindice.com: Identification on the current web of data. In IRSW, 2008.Google ScholarGoogle Scholar
  20. }}H. Wang, Q. Liu, T. Penin, L. Fu, L. Zhang, T. Tran, Y. Yu, and Y. Pan. Semplore: A scalable IR approach to search the web of data. J. Web Sem., 7(3):177--188, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. }}H. Zaragoza, N. Craswell, M. J. Taylor, S. Saria, and S. E. Robertson. Microsoft cambridge at trec 13: Web and hard tracks. In TREC, 2004.Google ScholarGoogle Scholar

Index Terms

  1. Using BM25F for semantic search

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        SEMSEARCH '10: Proceedings of the 3rd International Semantic Search Workshop
        April 2010
        75 pages
        ISBN:9781450301305
        DOI:10.1145/1863879

        Copyright © 2010 Copyright is held by the author/owner(s).

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 26 April 2010

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader