skip to main content
10.1145/2467696.2467699acmconferencesArticle/Chapter ViewAbstractPublication PagesjcdlConference Proceedingsconference-collections
research-article

WikiMirs: a mathematical information retrieval system for wikipedia

Published:22 July 2013Publication History

ABSTRACT

Mathematical formulae in structural formats such as MathML and LaTeX are becoming increasingly available. Moreover, repositories and websites, including ArXiv and Wikipedia, and growing numbers of digital libraries use these structural formats to present mathematical formulae. This presents an important new and challenging area of research, namely Mathematical Information Retrieval (MIR). In this paper, we propose WikiMirs, a tool to facilitate mathematical formula retrieval in Wikipedia. WikiMirs is aimed at searching for similar mathematical formulae based upon both textual and spatial similarities, using a new indexing and matching model developed for layout structures. A hierarchical generalization technique is proposed to generate sub-trees from presentation trees of mathematical formulae, and similarity is calculated based upon matching at different levels of these trees. Experimental results show that WikiMirs can efficiently support sub-structure matching and similarity matching of mathematical formulae. Moreover, WikiMirs obtains both higher accuracy and better ranked results over Wikipedia in comparison to Wikipedia Search and Egomath. We conclude that WikiMirs provides a new, alternative, and hopefully better service for users to search mathematical expressions within Wikipedia.

References

  1. http://dlmf.nist.gov/.Google ScholarGoogle Scholar
  2. http://egomath.projekty.ms.mff.cuni.cz/.Google ScholarGoogle Scholar
  3. http://search.mathweb.org/.Google ScholarGoogle Scholar
  4. https://mir.fi.muni.cz/mias/.Google ScholarGoogle Scholar
  5. http://www.latexsearch.com/.Google ScholarGoogle Scholar
  6. http://www.mathjax.org/.Google ScholarGoogle Scholar
  7. http://www.openmath.org/.Google ScholarGoogle Scholar
  8. http://www.w3.org/math/.Google ScholarGoogle Scholar
  9. A. Asperti, F. Guidi, C. Coen, E. Tassi, and S. Zacchiroli. A content based mathematical search engine: Whelp. Types for Proofs and Programs, pages 17--32, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Y. Hijikata, H. Hashimoto, and S. Nishida. Search mathematical formulas by mathematical formulas. Human Interface and the Management of Information. Designing Information Environments, pages 404--411, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. M. Kohlhase and I. Sucan. A search engine for mathematical formulae. In Artificial Intelligence and Symbolic Computation, pages 241--253. Springer, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. B. Miller and A. Youssef. Technical aspects of the digital library of mathematical functions. Annals of Mathematics and Artificial Intelligence, 38(1):121--136, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. R. Miner and R. Munavalli. An approach to mathematical search through query formulation and data normalization. Towards Mechanized Mathematical Assistants, pages 342--355, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. J. Misutka and L. Galambos. Extending full text search engine for mathematical content. Towards Digital Mathematics Library. Birmingham, United Kingdom, July 27th, 2008, pages 55--67, 2008.Google ScholarGoogle Scholar
  15. T. Nguyen, S. Hui, and K. Chang. A lattice-based approach for mathematical search using formal concept analysis. Expert Systems with Applications, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. T. T. Nguyen, K. Chang, and S. C. Hui. A math-aware search engine for math question answering system. In X. wen Chen, G. Lebanon, H. Wang, and M. J. Zaki, editors, CIKM, pages 724--733. ACM, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. T. Schellenberg, B. Yuan, and R. Zanibbi. Layout-based substitution tree indexing and retrieval for mathematical expressions. In Proceedings of SPIE, volume 8297, page 82970I, 2012.Google ScholarGoogle Scholar
  18. P. Sojka and M. Liska. Indexing and searching mathematics in digital libraries. Intelligent Computer Mathematics, pages 228--243, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. R. Zanibbi and D. Blostein. Recognition and retrieval of mathematical expressions. International Journal on Document Analysis and Recognition, pages 1--27, 2012.Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. R. Zanibbi and B. Yuan. Keyword and image-based retrieval of mathematical expressions. In IS&T/SPIE Electronic Imaging, pages 78740I--78740I. International Society for Optics and Photonics, 2011.Google ScholarGoogle ScholarCross RefCross Ref
  21. J. Zhao, M.-Y. Kan, and Y. L. Theng. Math information retrieval: user requirements and prototype implementation. In Proceedings of the 8th ACM/IEEE-CS joint conference on Digital libraries, pages 187--196. ACM, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. WikiMirs: a mathematical information retrieval system for wikipedia

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      JCDL '13: Proceedings of the 13th ACM/IEEE-CS joint conference on Digital libraries
      July 2013
      480 pages
      ISBN:9781450320771
      DOI:10.1145/2467696

      Copyright © 2013 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 22 July 2013

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      JCDL '13 Paper Acceptance Rate28of95submissions,29%Overall Acceptance Rate415of1,482submissions,28%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader