Skip to main content

Determining Empirical Characteristics of Mathematical Expression Use

  • Conference paper
Mathematical Knowledge Management (MKM 2005)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3863))

Included in the following conference series:

Abstract

Many processes in mathematical computing try to use knowledge of the most desired forms of mathematical expressions. This occurs, for example, in symbolic computation systems, when expressions are simplified, or mathematical document recognition, when formula layout is analyzed. The decision about which forms are the most desired, however, has typically been left to the guess-work or prejudices of a small number of system designers.

This paper observes that, on a domain by domain basis, certain expressions are actually used much more frequently than others. On the hypothesis that actual usage is the best measure of desirability, this papers begins to quantify empirically the use of common expressions in the mathematical literature. We analyze all 20,000 mathematical documents from the mathematical arXiv server from 2000-2004, the period corresponding to the new mathematical subject classification. We report on the process by which these documents are analyzed, through conversion to MathML, and present first empirical results on the most common aspects of mathematical expressions by subject classification. We use the notion of a weighted dictionary to record the relative frequency of subexpressions, and explore how this information may be used for further processes, including deriving common patterns of expressions and probability measures for symbol sequences.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Carlisle, D., Ion, P., Miner, R., Poppelier, N. (eds.): Mathematical Markup Language (MathML) Version 2.0 (2nd Edn.) W3C Recommendation. October 21 (2003), http://www.w3.org/TR/2003/REC-MathML2-20031021/

  2. ArXiv e-Print Archive, http://xxx.lanl.gov

  3. Mathematical Subject Classification (2000), American Mathematical Society, http://www.ams.org/msc

  4. The Hermes Project, http://alphaserv3.aei.mpg.de/hermes

  5. Ontario Research Centre for Computer Algebra. On-line TeX to MathML translator (2002), http://www.orcca.on.ca/MathML/texmml/textomml.html

  6. Plotkin, G.D.: A Note on Inductive Generalization. Machine Intelligence 5, 153–163 (1970)

    MathSciNet  Google Scholar 

  7. So, C.M.: An Analysis of Mathematical Expressions Used in Practice. MSc. Thesis. University of Western Ontario (2005)

    Google Scholar 

  8. Stephen, M.: Watt. Implicit Mathematical Semantics in Conversion between TeX and MathML, TUGBoat 23(1) (2002)

    Google Scholar 

  9. Oancea, C., So, C., Watt, S.M.: Generalization in Maple. In: Maple Conference 2005, Maplesoft, pp. 377–382.

    Google Scholar 

  10. TeX4ht: LaTeX and TeX for Hypertext, http://www.cse.ohio-state.edu/~gurari/TeX4ht

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

So, C.M., Watt, S.M. (2006). Determining Empirical Characteristics of Mathematical Expression Use. In: Kohlhase, M. (eds) Mathematical Knowledge Management. MKM 2005. Lecture Notes in Computer Science(), vol 3863. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11618027_24

Download citation

  • DOI: https://doi.org/10.1007/11618027_24

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-31430-1

  • Online ISBN: 978-3-540-31431-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics