Skip to main content

A Critical Survey of Mathematical Search Engines

  • Conference paper
  • First Online:
Book cover Computational Intelligence, Communications, and Business Analytics (CICBA 2018)

Abstract

Traditional text retrieval systems cannot effectively search for mathematical expressions because it may contain formulae ranging from simple symbols to complex structures. In the area of math retrieval system, index data structure and document representation play a vital role in ranking and relevancy of results.The paper investigates the current math aware search engines to provide a critical overview of their relative strengths and limitations and to explore the current challenges related to the field.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Sojka, P.: Exploiting semantic annotations in math information retrieval. In: Proceedings of the Fifth Workshop on Exploiting Semantic Annotations in Information Retrieval. ESAIR 2012, pp. 15–16. ACM, New York (2012)

    Google Scholar 

  2. Pathak, A., Pakray, P., Sarkar, S., Das, D., Gelbukh, A.: MathIRs: retrieval system for scientific documents. Computación y Sistemas 21(2), 253–265 (2017). http://www.redalyc.org/articulo.oa?id=61551628007

  3. Kohlhase, M., Sucan, I.: A search engine for mathematical formulae. In: Calmet, J., Ida, T., Wang, D. (eds.) AISC 2006. LNCS (LNAI), vol. 4120, pp. 241–253. Springer, Heidelberg (2006). https://doi.org/10.1007/11856290_21

    Chapter  Google Scholar 

  4. W3C: Mathematical Markup Language. https://www.w3.org/TR/WD-math-980106/. Accessed 12 Feb 2018

  5. Latex A Document Preparation System. https://www.latex-project.org. Accessed 12 Feb 2018

  6. Openmath. http://www.openmath.org/. Accessed 12 Feb 2018

  7. Omdoc. http://www.omdoc.org/. Accessed 12 Feb 2018

  8. Archambault, D., Moço, V.: Canonical MathML to Simplify Conversion of MathML to Braille Mathematical Notations. In: Miesenberger, K., Klaus, J., Zagler, W.L., Karshmer, A.I. (eds.) ICCHP 2006. LNCS, vol. 4061, pp. 1191–1198. Springer, Heidelberg (2006). https://doi.org/10.1007/11788713_172

    Chapter  Google Scholar 

  9. Zanibbi, R., Blostein, D.: Recognition and retrieval of mathematical expressions. Int. J. Doc. Anal. Recogn. 15(4), 331–357 (2012). https://doi.org/10.1007/s10032-011-0174-4

    Article  Google Scholar 

  10. Graf, P.: Substitution tree indexing. In: Hsiang, J. (ed.) Rewriting Techniques and Applications, pp. 117–131. Springer, Heidelberg (1995). https://doi.org/10.1007/3-540-59200-8_52

    Chapter  Google Scholar 

  11. Miner, R., Munavalli, R.: An approach to mathematical search through query formulation and data normalization. In: Kauers, M., Kerber, M., Miner, R., Windsteiger, W. (eds.) Calculemus/MKM -2007. LNCS (LNAI), vol. 4573, pp. 342–355. Springer, Heidelberg (2007). https://doi.org/10.1007/978-3-540-73086-6_27

    Chapter  MATH  Google Scholar 

  12. Mišutka, J., Galamboš, L.: Extending full text search engine for mathematical content. In: Proceedings of the DML 2008, Towards Digital Mathematics Library, Birmingham, UK, 27th July 2008, pp. 55–67. Masaryk University, Brno (2008). Zbl 1170.68488)

    Google Scholar 

  13. Mišutka, J., Galamboš, L.: System description: EgoMath2 as a tool for mathematical searching on wikipedia.org. In: Davenport, J.H., Farmer, W.M., Urban, J., Rabe, F. (eds.) CICM 2011. LNCS (LNAI), vol. 6824, pp. 307–309. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-22673-1_30

    Chapter  Google Scholar 

  14. Pineau, D.C.: Math-Aware Search Engines: Physics Applications and Overview, CoRR, vol. abs/1609.03457 (2016). (http://arxiv.org/abs/1609.03457)

  15. Libbrecht, P., Melis, E.: Methods to access and retrieve mathematical content in ActiveMath. In: Iglesias, A., Takayama, N. (eds.) ICMS 2006. LNCS, vol. 4151, pp. 331–342. Springer, Heidelberg (2006). https://doi.org/10.1007/11832225_33

    Chapter  MATH  Google Scholar 

  16. Sojka, P., Líška, M.: The art of mathematics retrieval. In: Proceedings of the ACM Conference on Document Engineering, DocEng 2011, pp. 57–60. Association of Computing Machinery, Mountain View, CA September 2011. https://doi.org/10.1145/2034691.2034703

  17. Oliveira, R.M., Gonzaga, F.B., Barbosa, V.C. Xexéo, G.B.: A distributed system for SearchOnMath based on the Microsoft BizSpark program, CoRR, vol. abs/1711.04189 (2017). http://arxiv.org/abs/1711.04189

  18. Borbinha, J., Bouche, T., Nowiński, A., Sojka, P.: Project EuDML – a first year demonstration. In: Davenport, J.H., Farmer, W.M., Urban, J., Rabe, F. (eds.) CICM 2011. LNCS (LNAI), vol. 6824, pp. 281–284. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-22673-1_21

    Chapter  Google Scholar 

  19. Eu-DML. https://eudml.org/. Accessed 12 Feb 2018

  20. Hu, X., Gao, L., Lin, X., Tang, Z., Lin, X., Baker, J.B.: WikiMirs: a mathematical information retrieval system for wikipedia. In: 2013 Proceedings of the 13th ACM/IEEE-CS Joint Conference on Digital libraries (JCDL) (2013)

    Google Scholar 

  21. Wang, Y., Gao, L., Wang, S., Tang, Z., Liu, X., Yuan, F.: WikiMirs 3.0: a hybrid MIR system based on the context, structure and importance of formulae in a document. In: JCDL (2015)

    Google Scholar 

  22. Apache Lucene Core. https://lucene.apache.org/core/. Accessed 17 Feb 2018

  23. Sojka, P., Líška, M.: Indexing and searching mathematics in digital libraries. In: Davenport, J.H., Farmer, W.M., Urban, J., Rabe, F. (eds.) CICM 2011. LNCS (LNAI), vol. 6824, pp. 228–243. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-22673-1_16

    Chapter  MATH  Google Scholar 

  24. Springer Innovations: LaTexSearch.com. https://www.springer.com/in/partners/society-zone-issues/springer-innovations-latexsearch-com/4516. Accessed 17 Feb 2018

  25. Kohlhase, M., Matican, B.A., Prodescu, C.-C.: MathWebSearch 0.5: scaling an open formula search engine. In: Jeuring, J., et al. (eds.) CICM 2012. LNCS (LNAI), vol. 7362, pp. 342–357. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-31374-5_23

    Chapter  Google Scholar 

  26. Anca, S.: Natural Language and Mathematics Processing for Applicable Theorem Search. Master’s Thesis, Jacobs University (2009)

    Google Scholar 

  27. Grigore, M., Wolska, M., Kohlhase, M.: Towards context-based disambiguation of mathematical expressions. In: The Joint Conference of ASCM, pp. 262–271 (2009)

    Google Scholar 

  28. Libbrecht, P., Melis, E.: Semantic search in LeActiveMath. In: First WebALT Conference and Exhibition, pp. 97–110, Technical University of Eindhoven, Netherlands (2006)

    Google Scholar 

  29. Sylwestrzak, W., Borbinha, J., Bouche, T., Nowiski, A.W., Sojka, P.: EuDML towards the european digital mathematics library architecture and design, pp. 11–26, Masaryk University Press (2010)

    Google Scholar 

  30. Stalnaker, D., Zanibbi, R.: Math expression retrieval using an inverted index over symbol pairs. In: Document Recognition and Retrieval XXII, San Francisco, California, USA, 11–12 February 2015

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sourish Dhar .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Dhar, S., Roy, S., Das, S.K. (2019). A Critical Survey of Mathematical Search Engines. In: Mandal, J., Mukhopadhyay, S., Dutta, P., Dasgupta, K. (eds) Computational Intelligence, Communications, and Business Analytics. CICBA 2018. Communications in Computer and Information Science, vol 1031. Springer, Singapore. https://doi.org/10.1007/978-981-13-8581-0_16

Download citation

  • DOI: https://doi.org/10.1007/978-981-13-8581-0_16

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-13-8580-3

  • Online ISBN: 978-981-13-8581-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics